mysterious-tailor-22852
12/17/2023, 12:02 AM--no-auto-emit-argo-events
command because otherwise, it generates an error (the error is in the image)
I would like to better understand how the Metaflow UI knows when a pipeline fails or not, if using the no auto emit argo event is related to my problem, and if anyone knows how to solve these issues. I would be very grateful. Regards.dry-beach-38304
12/17/2023, 9:09 AMfuture-carpet-9485
12/17/2023, 8:03 PMdry-beach-38304
12/18/2023, 12:38 PMRUN_INACTIVE_CUTOFF_TIME
.dry-beach-38304
12/18/2023, 12:38 PMHEARTBEAT_THRESHOLD
as well.bulky-afternoon-92433
12/18/2023, 12:45 PMRUN_INACTIVE_CUTOFF_TIME
(in seconds) to a rough estimate on how long tasks can take to start. Default for this is 6 minutes in the recent releases, but a while ago it was still quite optimistically set to 1minutebulky-afternoon-92433
12/18/2023, 12:55 PMPOST
to the metadata service for the running task
• metadata service updates heartbeat for task, and the associated run for the task
• once the metadata service updates a task record in the database with a new heartbeat, a postgres table trigger sends a message for this to any listeners
• the MetaflowUI backend is listening to these updates, and is running its own 'keepalive' process for running runs/tasks, which are refreshed based on received heartbeats
◦ if a keepalive fails, the real status will be inferred and broadcast to any open UI websocket that has subscribed to the affected resource
As at the moment the only source of info for run level freshness comes from executing tasks, this is there reason we need a threshold for run level failures, instead of being able to provide completely accurate fail statusesmysterious-tailor-22852
12/20/2023, 12:48 PM