2024年2月19日

摘要: https://zhuanlan.zhihu.com/p/617807098 阅读全文
posted @ 2024-02-19 11:48 zhenxia-jiuyou 阅读(2) 评论(0) 推荐(0) 编辑

2024年2月18日

摘要: 1. build kubeflow/serving container image which contains serving_model [1] # run container tensorflow/serving, the image of this container # is the ba 阅读全文
posted @ 2024-02-18 22:14 zhenxia-jiuyou 阅读(3) 评论(0) 推荐(0) 编辑
 
摘要: 1. save the trained model # in module file of tfx component trainer def _apply_preprocessing(raw_features, tft_layer): transformed_features = tft_laye 阅读全文
posted @ 2024-02-18 14:45 zhenxia-jiuyou 阅读(1) 评论(0) 推荐(0) 编辑

2024年2月15日

摘要: 1. deploy worker, parameter server on kubernetes cluster 1.1 build container image of worker, parameter server $ git clone https://github.com/tensorfl 阅读全文
posted @ 2024-02-15 16:37 zhenxia-jiuyou 阅读(11) 评论(0) 推荐(0) 编辑
 
摘要: [ERROR: tf distribute strategy parameter server: tfx component trainer: model.save(): failed to connect to all addresses] log of pod tfx-component-tra 阅读全文
posted @ 2024-02-15 00:01 zhenxia-jiuyou 阅读(36) 评论(0) 推荐(0) 编辑

2024年2月14日

摘要: [ERROR: tf distribute strategy parameter server: tfx component trainer: OutOfRangeError(), Node: 'cond/IteratorGetNext' End of sequence] log of pod tf 阅读全文
posted @ 2024-02-14 23:39 zhenxia-jiuyou 阅读(10) 评论(0) 推荐(0) 编辑
 
摘要: [ERROR: NOT_FOUND: /tfx/tfx_pv/pipelines/detect_anomolies_on_wafer_tfdv_schema/ImportExampleGen/examples/67/Split-train/data_tfrecord-00000-of-00001.g 阅读全文
posted @ 2024-02-14 17:59 zhenxia-jiuyou 阅读(7) 评论(0) 推荐(0) 编辑
 
摘要: [ERROR: Failed to start a transaction to create a new pipeline and a new pipeline version: dial tcp: lookup mysql on 10.96.0.10:53: no such host","] > 阅读全文
posted @ 2024-02-14 17:29 zhenxia-jiuyou 阅读(13) 评论(0) 推荐(0) 编辑
 
摘要: [ERROR: stuck at "INFO:tensorflow:ParameterServerStrategyV2 is now connecting to cluster with cluster_spec: ClusterSpec({'ps': ['dist-strat-example-ps 阅读全文
posted @ 2024-02-14 16:17 zhenxia-jiuyou 阅读(4) 评论(0) 推荐(0) 编辑
 
摘要: [ERROR: unknown field "spec.template.spec.nodeAffinity"] (base) maye@maye-Inspiron-5547:~/github_repository/tensorflow_ecosystem/distribution_strategy 阅读全文
posted @ 2024-02-14 14:44 zhenxia-jiuyou 阅读(13) 评论(0) 推荐(0) 编辑