important-london-94970
02/10/2025, 7:41 PMFile "/metaflow/metaflow_extensions/ray/plugins/status_notifier.py", line 106, in wait_for_task_completion
if time.time() - status.timestamp > heartbeat_timeout:
~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~
TypeError: unsupported operand type(s) for -: 'float' and 'NoneType'
Running:
• metaflow==2.13
• metaflow-ray==0.1.3square-wire-39606
02/10/2025, 7:55 PMimportant-london-94970
02/10/2025, 8:00 PM@retry
did eventually work on the last retryhallowed-glass-14538
02/10/2025, 8:25 PMsquare-wire-39606
02/10/2025, 8:26 PMNone
important-london-94970
02/10/2025, 8:27 PMhallowed-glass-14538
02/10/2025, 8:30 PMnum_parallel
you are running ?important-london-94970
02/10/2025, 8:32 PMhallowed-glass-14538
02/10/2025, 8:37 PMhallowed-glass-14538
02/11/2025, 1:42 AM@environment(vars={"METAFLOW_RAY_DEBUG_MODE":"true"})
on the @step
that has @metaflow_ray
and this will make it publish debug logs to stderr for the worker tasks. Can you try it out and potentially share the logs if this you face this issue again ?important-london-94970
02/11/2025, 1:43 PMmetaflow-ray==0.1.4
and that fixed my issue!
I'm seeing the fix in the logs too.
[@metaflow_ray] Task 0 status: running with timestamp 1739281146.9187553
23
08:39:12
[@metaflow_ray] Task 0 status: unreachable with timestamp None
24
08:39:12
[@metaflow_ray] Task 0 unreachable
25
08:39:12
[@metaflow_ray] Task 0 still unreachable after 0.0 seconds
26
08:39:13
[@metaflow_ray] Task 0 status: running with timestamp 1739281152.0030744
27
08:39:14
[@metaflow_ray] Task 0 status: running with timestamp 1739281152.0030744
important-london-94970
02/11/2025, 1:43 PM