Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RuntimeError: argument not found #178

Closed
beng003 opened this issue Dec 2, 2024 · 3 comments
Closed

RuntimeError: argument not found #178

beng003 opened this issue Dec 2, 2024 · 3 comments

Comments

@beng003
Copy link

beng003 commented Dec 2, 2024

Issue Type

Running

Have you searched for existing documents and issues?

Yes

OS Platform and Distribution

Linux Ubuntu 18.04

All_in_one Version

v0.11.0b0

Module type

others

Module version

dataproxy v0.2.0b0

What happend and What you expected to happen.

'''
Feature,MinValues,MaxValues
T,-4.46,295.7
H,9.71,98.2745
P,989.41,1040.58
SWR,0,877.57
"WS_10 11",0.56 ,9.59
WS_30,0.60,11.86
WS_50,0.61,12.96
WS_70,0.61,13.73
WS_80,0.61,14.03
WS_90,0.62,14.31
WS_100,0.62,14.58
WS_170,0.64,18.14
WD_10,10.36,351.24
WD_30,11.78,350.06
WD_50,12.17,350.79
WD_70,12.22,351.64
WD_80,12.03,351.97
WD_90,10.65,352.26
WD_100,9.48,352.43
WD_170,0,359.99
'''

数据读取报错,数据中不能有空格("WS_10 11"

Log output.

2024-12-03 01:23:22 INFO the jobId=ssbt, taskId=ssbt-kcsrlxoh-node-34 start ...
2024-12-03 01:23:27 INFO the jobId=ssbt, taskId=ssbt-kcsrlxoh-node-34 failed: party alice failed msg: container[secretflow] terminated state reason "Error", message: "times\":60,\"connect_retry_interval_ms\":1000,\"brpc_channel_protocol\":\"http\",\"brpc_channel_connection_type\":\"pooled\",\"recv_timeout_ms\":1200000,\"http_timeout_ms\":1200000}}'}, {'name': 'heu', 'type': 'heu', 'parties': ['alice'], 'config': '{\"mode\": \"PHEU\", \"schema\": \"paillier\", \"key_size\": 2048}'}], 'ray_fed_config': {'cross_silo_comm_backend': 'brpc_link'}}, 'sf_node_eval_param': {'domain': 'stats', 'name': 'table_statistics', 'version': '1.0.0', 'attr_paths': ['input/input_ds/features'], 'attrs': [{'is_na': False, 'ss': ['Feature', 'MinValues', 'MaxValues']}], 'inputs': [{'type': 'sf.table.individual', 'meta': {'@type': 'type.googleapis.com/secretflow.spec.v1.IndividualTable', 'line_count': '-1'}, 'data_refs': [{'uri': 'MaxMin_546400942.csv', 'party': 'alice', 'format': 'csv'}]}], 'checkpoint_uri': 'ckssbt-kcsrlxoh-node-34-output-0'}, 'sf_output_uris': ['ssbt_kcsrlxoh_node_34_output_0'], 'sf_input_ids': ['zowonftw'], 'sf_input_partitions_spec': [''], 'sf_output_ids': ['ssbt-kcsrlxoh-node-34-output-0'], 'table_attrs': [{'table_id': 'zowonftw', 'column_attrs': [{'col_name': 'Feature', 'col_type': 'feature'}, {'col_name': 'MinValues', 'col_type': 'feature'}, {'col_name': 'MaxValues', 'col_type': 'feature'}]}]}, 'task_cluster_def': '{\"parties\":[{\"name\":\"alice\", \"role\":\"\", \"services\":[{\"portName\":\"fed\", \"endpoints\":[\"ssbt-kcsrlxoh-node-34-0-fed.alice.svc\"]}, {\"portName\":\"global\", \"endpoints\":[\"ssbt-kcsrlxoh-node-34-0-global.alice.svc:24160\"]}, {\"portName\":\"inference\", \"endpoints\":[\"ssbt-kcsrlxoh-node-34-0-inference.alice.svc\"]}, {\"portName\":\"spu\", \"endpoints\":[\"ssbt-kcsrlxoh-node-34-0-spu.alice.svc\"]}]}], \"selfPartyIdx\":0, \"selfEndpointIdx\":0}', 'allocated_ports': '{\"ports\":[{\"name\":\"fed\", \"port\":24166, \"scope\":\"Cluster\", \"protocol\":\"GRPC\"}, {\"name\":\"global\", \"port\":24160, \"scope\":\"Domain\", \"protocol\":\"GRPC\"}, {\"name\":\"node-manager\", \"port\":24161, \"scope\":\"Local\", \"protocol\":\"GRPC\"}, {\"name\":\"object-manager\", \"port\":24162, \"scope\":\"Local\", \"protocol\":\"GRPC\"}, {\"name\":\"client-server\", \"port\":24163, \"scope\":\"Local\", \"protocol\":\"GRPC\"}, {\"name\":\"inference\", \"port\":24164, \"scope\":\"Cluster\", \"protocol\":\"HTTP\"}, {\"name\":\"spu\", \"port\":24165, \"scope\":\"Cluster\", \"protocol\":\"GRPC\"}]}'}
2024-12-02 17:23:24,557|alice|INFO|secretflow|entry.py:start_ray:58| ray_conf: RayConfig(ray_node_ip_address='ssbt-kcsrlxoh-node-34-0-global.alice.svc', ray_node_manager_port=24161, ray_object_manager_port=24162, ray_client_server_port=24163, ray_worker_ports=[], ray_min_worker_port=11750, ray_max_worker_port=11850, ray_gcs_port=24160)
2024-12-02 17:23:24,557|alice|INFO|secretflow|entry.py:start_ray:66| Trying to start ray head node at ssbt-kcsrlxoh-node-34-0-global.alice.svc, start command: ray start --head --include-dashboard=false --disable-usage-stats --num-cpus=32 --node-ip-address=ssbt-kcsrlxoh-node-34-0-global.alice.svc --port=24160 --node-manager-port=24161 --object-manager-port=24162 --ray-client-server-port=24163 --min-worker-port=11750 --max-worker-port=11850
2024-12-02 17:23:26,799|alice|INFO|secretflow|entry.py:start_ray:79| 2024-12-02 17:23:25,036\tINFO usage_lib.py:423 -- Usage stats collection is disabled.
2024-12-02 17:23:25,036\tINFO scripts.py:744 -- Local node IP: ssbt-kcsrlxoh-node-34-0-global.alice.svc
2024-12-02 17:23:26,699\tSUCC scripts.py:781 -- --------------------
2024-12-02 17:23:26,699\tSUCC scripts.py:782 -- Ray runtime started.
2024-12-02 17:23:26,699\tSUCC scripts.py:783 -- --------------------
2024-12-02 17:23:26,699\tINFO scripts.py:785 -- Next steps
2024-12-02 17:23:26,699\tINFO scripts.py:788 -- To add another node to this Ray cluster, run
2024-12-02 17:23:26,699\tINFO scripts.py:791 --   ray start --address='ssbt-kcsrlxoh-node-34-0-global.alice.svc:24160'
2024-12-02 17:23:26,699\tINFO scripts.py:800 -- To connect to this Ray cluster:
2024-12-02 17:23:26,699\tINFO scripts.py:802 -- import ray
2024-12-02 17:23:26,699\tINFO scripts.py:803 -- ray.init(_node_ip_address='ssbt-kcsrlxoh-node-34-0-global.alice.svc')
2024-12-02 17:23:26,699\tINFO scripts.py:834 -- To terminate the Ray runtime, run
2024-12-02 17:23:26,699\tINFO scripts.py:835 --   ray stop
2024-12-02 17:23:26,699\tINFO scripts.py:838 -- To view the status of the cluster, use
2024-12-02 17:23:26,699\tINFO scripts.py:839 --   ray status

2024-12-02 17:23:26,799|alice|INFO|secretflow|entry.py:start_ray:80| Succeeded to start ray head node at ssbt-kcsrlxoh-node-34-0-global.alice.svc.
2024-12-02 17:23:26,799|alice|INFO|secretflow|entry.py:main:572| datasource.access_directly False
sf_node_eval_param  {
  \"domain\": \"stats\",
  \"name\": \"table_statistics\",
  \"version\": \"1.0.0\",
  \"attrPaths\": [
    \"input/input_ds/features\"
  ],
  \"attrs\": [
    {
      \"ss\": [
        \"Feature\",
        \"MinValues\",
        \"MaxValues\"
      ]
    }
  ],
  \"inputs\": [
    {
      \"type\": \"sf.table.individual\",
      \"meta\": {
        \"@type\": \"type.googleapis.com/secretflow.spec.v1.IndividualTable\",
        \"lineCount\": \"-1\"
      },
      \"dataRefs\": [
        {
          \"uri\": \"MaxMin_546400942.csv\",
          \"party\": \"alice\",
          \"format\": \"csv\"
        }
      ]
    }
  ],
  \"checkpointUri\": \"ckssbt-kcsrlxoh-node-34-output-0\"
} 
2024-12-02 17:23:26,800|alice|INFO|secretflow|sf_config.py:get_sf_cluster_config:36| kuscia_task_allocated_ports: ports {
  name: \"fed\"
  port: 24166
  scope: \"Cluster\"
  protocol: \"GRPC\"
}
ports {
  name: \"global\"
  port: 24160
  scope: \"Domain\"
  protocol: \"GRPC\"
}
ports {
  name: \"node-manager\"
  port: 24161
  scope: \"Local\"
  protocol: \"GRPC\"
}
ports {
  name: \"object-manager\"
  port: 24162
  scope: \"Local\"
  protocol: \"GRPC\"
}
ports {
  name: \"client-server\"
  port: 24163
  scope: \"Local\"
  protocol: \"GRPC\"
}
ports {
  name: \"inference\"
  port: 24164
  scope: \"Cluster\"
  protocol: \"HTTP\"
}
ports {
  name: \"spu\"
  port: 24165
  scope: \"Cluster\"
  protocol: \"GRPC\"
}

2024-12-02 17:23:26,807|alice|INFO|secretflow|entry.py:domaindata_id_to_dist_data:195| domaindata_id zowonftw to 
...........
name: \"MaxMinalice3\"
type: \"sf.table.individual\"
meta {
  type_url: \"type.googleapis.com/secretflow.spec.v1.IndividualTable\"
  value: \"\
2\\022\\007Feature\\022\\tMinValues\\022\\tMaxValues*\\003str*\\005float*\\005float\\020\\377\\377\\377\\377\\377\\377\\377\\377\\377\\001\"
}
data_refs {
  uri: \"MaxMin_546400942.csv\"
  party: \"alice\"
  format: \"csv\"
  null_strs: \"\"
}

....
2024-12-02 17:23:26,807|alice|INFO|secretflow|dp_file_adapter.py:download_file:31| dataproxy sdk: start download_file[/tmp/sf_ssbt-kcsrlxoh-node-34_alice/MaxMin_546400942.csv], type[1]
2024-12-02 17:23:26,846|alice|ERROR|secretflow|entry.py:<module>:637| unexpected exception
Traceback (most recent call last):
  File \"/usr/local/lib/python3.10/site-packages/secretflow/kuscia/entry.py\", line 635, in <module>
    main()
  File \"/usr/local/lib/python3.10/site-packages/click/core.py\", line 1157, in __call__
    return self.main(*args, **kwargs)
  File \"/usr/local/lib/python3.10/site-packages/click/core.py\", line 1078, in main
    rv = self.invoke(ctx)
  File \"/usr/local/lib/python3.10/site-packages/click/core.py\", line 1434, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File \"/usr/local/lib/python3.10/site-packages/click/core.py\", line 783, in invoke
    return __callback(*args, **kwargs)
  File \"/usr/local/lib/python3.10/site-packages/secretflow/kuscia/entry.py\", line 597, in main
    sf_node_eval_param = preprocess_sf_node_eval_param(
  File \"/usr/local/lib/python3.10/site-packages/secretflow/kuscia/entry.py\", line 327, in preprocess_sf_node_eval_param
    domaindata_id_to_dist_data(
  File \"/usr/local/lib/python3.10/site-packages/secretflow/kuscia/entry.py\", line 201, in domaindata_id_to_dist_data
    download_dist_data_from_dp(
  File \"/usr/local/lib/python3.10/site-packages/secretflow/kuscia/entry.py\", line 140, in download_dist_data_from_dp
    get_file_from_dp(
  File \"/usr/local/lib/python3.10/site-packages/secretflow/kuscia/datamesh.py\", line 206, in get_file_from_dp
    dm_flight_client.download_file(download_info, output_file_path, file_format)
  File \"/usr/local/lib/python3.10/site-packages/dataproxy/dp_file_adapter.py\", line 35, in download_file
    self.data_proxy_file.download_file(
RuntimeError: argument not found
"
@beng003
Copy link
Author

beng003 commented Dec 2, 2024

想问一下问题出在什么地方?怎么修改可以实现读取带空格的数据?
Example Image

目的是正确读取这种数据
Example Image

@zimu-yuxi
Copy link

在kuscia容器中cat /home/kuscia/var/logs/kuscia.log | grep 'Read domaindata' 看下

@beng003
Copy link
Author

beng003 commented Dec 3, 2024

在kuscia容器中cat /home/kuscia/var/logs/kuscia.log | grep 'Read domaindata' 看下

类型识别错误,已解决

@beng003 beng003 closed this as completed Dec 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants