https://outerbounds.com/ logo
Join Slack
Powered by
  • d

    dry-umbrella-11948

    07/02/2025, 3:01 PM
    Hey, is there a good way to make an ArgoEvent published in a test/user namespace not trigger e.g. the production flow but only flows in that same namespace?
    ✅ 1
    0
    s
    • 2
    • 5
  • l

    lively-lunch-9285

    07/02/2025, 3:04 AM
    Lineage update: This week, I'm going all on on this POC of to show data lineage between airflow and metaflow. If anyone here has ever used DataHub or OpenLineage instrumentation before, I'd love to talk here
    airflow spin 2
    excited monkey 2
    👀 2
    0
  • b

    brief-kite-90012

    06/30/2025, 9:29 AM
    Hi! Have a question regarding using
    metaflow_ray
    on argo workflows, I've installed JobSet to the same namespace with my argo workflows runner namespace, and had the permission added to my service account, but hitting some error at the moment about the webhook service path, is there anyway i can modify this value? can't seem to find it either from the metaflow configs or anywhere, thanks in advance!
    0
    a
    • 2
    • 6
  • s

    straight-dog-3982

    06/27/2025, 7:21 AM
    Hi! In my k8s deployment, I have tried to change the datastore location, now I have this in my helm chart:
    Copy code
    metaflow-ui:
      uiBackend:
        metaflowDefaultDatastore: "s3"
        metaflowDatastoreSysRootS3: "<s3://metaflow>"
        metaflowS3EndpointURL: "<http://minio.caic:9000>"
    plus AWS_SECRET_* values through
    env
    . Previously I had DefaultDatastore set to "local" I have restarted both deployments for mataflow-ui. Now I'm running yet another flow, and in UI it still shows me Since this run uses local datastore instead of cloud services, some information may be incomplete. And of course there are no data in my minio bucket Is my s3 configuration correct? Do I need to drop the database or deploy the backend service too to see the effect?
    0
    h
    • 2
    • 8
  • r

    ripe-car-38698

    06/26/2025, 5:06 PM
    Hello team! I was wondering if any of you had this error when loading cards from a flow:
    URL/api/flows/ComponentDemoFlow/runs/4543/steps/start/tasks/222732/cards/blank/0490a0de7c024207af081c2187dcea5d?embed=true
    returns
    [Errno 2] No such file or directory: '/root/services/ui_backend_service/ui/index.html'
    . • We are using a custom DNS behind a vpn in
    http
    • Recent versions for all services:
    metaflow-ui.uiBackend.image.tag=2.4.13-2-g70af4ed
    ,
    metaflow-service.image.tag=2.4.13-2-g70af4ed
    ,
    metaflow-ui.uiStatic.image.tag=1.3.5-123-g95238f8-obp
    Thank you!
    0
    • 1
    • 1
  • a

    astonishing-train-18397

    06/25/2025, 7:32 PM
    Hello everyone, I setup metaflow on prem with argo events and workflows. I wanted to ask if we have something similar to sensors that react on events like for example uploading a file to s3 ? Also can I connect this on prem deployment to outerbounds UI ? I specifically mean this https://docs.outerbounds.com/outerbounds/connect-to-outerbounds/ The third option here ?
    👍 1
    0
    s
    • 2
    • 7
  • s

    straight-dog-3982

    06/25/2025, 10:51 AM
    Hi, I hope this is the right channel for such question. I've deployed metaflow using helm chart to y k8s cluster (using https://github.com/outerbounds/metaflow-tools/tree/master/charts/metaflow). I connected it to my postgresql db, this parts seems to work fine. I have 3 services running, metaflow-service, -ui and -ui-static. So far so good. I've set
    METAFLOW_DEFAULT_DATASTORE=local
    as I did not find out how to set up credentials for s3/minio. Now, I port-forward the service (port 8080) to localhost and with
    METAFLOW_SERVICE_URL=<http://localhost:8080/>
    I'm able to run example flow; from the logs on service pod it seems it is connecting to remote server which is nice. I can even check the flow status on port 8083, so it seems this is logged corerctly OK, but now I cannot see anything in the UI. The UI (running through ingress) just shows
    Error loading data
    on empty dashboard page, nothing about my flow and nothing more specific about the error, just
    Unknown error
    /
    generic-error
    . I assume some part of my stack is misconfigured but I can't see any hints anywhere ... Is it complaining about connection to backend? Or is it backend complaning about actual access to datastore? I see nothing relevant in the logs...
    0
    a
    h
    • 3
    • 21
  • l

    lively-lunch-9285

    06/23/2025, 8:43 PM
    I'm doing a walkthrough of my work's ML Platform tomorrow if anyone wants to come. Deets in thread...
    among us party 2
    0
    h
    • 2
    • 3
  • b

    bulky-portugal-95315

    06/23/2025, 6:57 PM
    hi team, we recently updated our metaflow version to 2.5.0 and we seem to have lost metadata around the ec2-instance-id and ec2-instance-type for our batch jobs. Was this intended or moved somewhere else?
    ✅ 1
    0
    a
    s
    • 3
    • 11
  • s

    square-wire-39606

    06/23/2025, 5:49 PM
    <!here> @lively-lunch-9285 and his team at Pattern (they are the biggest seller on Amazon after Amazon!) will be discussing some cool stuff they have been hacking on top of Metaflow tomorrow. Join us if you are able to! Slack Conversation
    0
  • a

    ambitious-evening-58240

    06/23/2025, 7:02 AM
    Hello! First of all, thank you for your work — we're using Metaflow extensively to structure the results from our library, and it's been very helpful. In our current workflow, we call various functions from elsewhere in our code within a Metaflow flow. I was wondering: is there a way to automatically treat all functions called as Metaflow steps, without needing to explicitly annotate them with
    @step
    ?
    0
    s
    • 2
    • 5
  • r

    ripe-car-38698

    06/20/2025, 7:03 PM
    Hello team! I was wondering if it could ring a bell: we are trying to display cards in our metaflow flows but we are having in the metaflow ui pod the following error - the
    /tmp
    seems to be well used as I can see that some data have been written:
    /tmp/metaflow_client/gs.gs:/argo-workflows-artifacts-pilot.ComponentDemoFlow/98
    . Here is the error in the thread. Thank you!
    ✅ 1
    0
    s
    h
    • 3
    • 4
  • c

    chilly-monkey-19638

    06/20/2025, 6:23 AM
    Hi, while running workflows as stepfunctions, we wanted to use our own image for step executions which runs as non root user. But this fails with below error. is it possible to run as non root user or it has to be root always.
    0
    a
    t
    • 3
    • 2
  • l

    lively-lunch-9285

    06/19/2025, 3:46 AM
    Hey! In my efforts to instrument metaflow with openlineage: • I'm storing a value on
    self.
    , e.g.
    self.x
    • and I want to access it on
    current.
    , e.g.
    current.x
    (accessing it in the same flow run) But for the life of me, I can't find a way to do that. Could anyone help me out?
    0
    v
    f
    • 3
    • 4
  • a

    acoustic-van-30942

    06/17/2025, 4:13 AM
    Hi all, I'm getting this error from my Kubernetes pod on Metaflow:
    Copy code
    The node was low on resource: ephemeral-storage.
    Threshold quantity: 32204283974, available: 30875356Ki.
    Container main was using 51664160Ki, request is 10240M, has larger consumption of ephemeral-storage.
    To resolve this, would I need to increase disk space or tmpfs storage?
    0
    a
    • 2
    • 1
  • f

    fast-advantage-42097

    06/16/2025, 4:13 PM
    Is there a way to use
    @schedule
    with different parameters being passed into the flow? Is
    Config
    the only option? We have an existing setup where a single Flow is being called with different parameters and migrating it to use
    Config
    is non-trivial
    ✅ 1
    0
    v
    • 2
    • 7
  • e

    enough-article-90757

    06/14/2025, 2:12 AM
    Are there any recommendations about setting a TTL for Metaflow objects in S3 / other object stores? https://github.com/Netflix/metaflow/issues/21 only says that the TTL needs to be longer than the lifetime of the workflow, not sure if that has changed in the last six years. Any info would be great!
    ✅ 1
    0
    v
    • 2
    • 4
  • g

    gorgeous-kangaroo-54074

    06/13/2025, 6:21 PM
    Is there a way to make the pylint checker that runs before a flow check all of the code in symlinked directories in addition to the flow itself?
    0
  • b

    bored-vr-66208

    06/13/2025, 1:05 PM
    I just discovered Marimo notebooks! Anyone use Marimo notebooks with MetaFlow yet? https://marimo.io/
    0
    l
    f
    f
    • 4
    • 28
  • a

    ancient-guitar-13766

    06/12/2025, 7:41 PM
    Hey guys, is there any reason why a particular run-id does not show up on metaflow UI? I see the following logs, This run-id is a resumed argo-workflow.
    Copy code
    Metaflow 2.15.14 executing flowTrain for user:j.kollipara
    Validating your flow...                                                                                                                                                     
    The graph looks good!│
    Running pylint...                                                                                                                                                           
    Pylint is happy! 
    Workflow starting (run-id 1749756923772615):
    ✅ 1
    0
    d
    • 2
    • 13
  • l

    lively-lunch-9285

    06/11/2025, 11:49 PM
    I took a stab at logging lineage for a metaflow flow using OpenLineage and Marquez. The concept is super exciting.
    ooh 2
    0
    f
    b
    m
    • 4
    • 37
  • m

    modern-dinner-75080

    06/11/2025, 3:28 PM
    Let me know if there is a more appropriate developer channel for this type of question.
    0
  • m

    modern-dinner-75080

    06/11/2025, 3:14 PM
    message has been deleted
    0
  • m

    modern-dinner-75080

    06/11/2025, 3:14 PM
    I am running into several initialization challenges with a custom step decorator that can run remote tasks in Azure Batch. I tried to use the batch_decoratory.py from metaflow.plugins.aws.batch module as a template (see attached code). However, when I use this in flowspec, I get errors such as: MyAzureBatchDecorator is not callable ImportError: cannot import name 'my_azure_batch' from 'custom_metaflow.plugins.decorators.my_azure_decorator' AttributeError: 'MyAzureDecorator' object has no attribute '__func__'. Did you mean: '__doc__'?
    0
    d
    • 2
    • 1
  • l

    lively-lunch-9285

    06/10/2025, 11:45 PM
    You know, the
    python <flow>.py ...
    cli has a lot going on. There are a lot of arguments and options you can pass in. We could totally make a VS Code extension that turns some of it into a push-button affair: • killing kubernetes pods • resuming (and from which step? 🤔 ) • running • choosing your environment • launching a debug session that stops if you set a breakpoint I'm just saying, there are a TON of subcommands. And I've been guilty of asking repeat questions on these, as well as fielding tons of questions about it all.
    ✅ 2
    0
    a
    • 2
    • 9
  • c

    chilly-france-99853

    06/10/2025, 6:47 AM
    Hey good people, has anyone used API Gateway (with Lambda) to trigger a metaflow workflow? Due to nature of the read-only filesystem of lambda, I was facing a wall right now. If not, are there any other alternatives to get it triggered as an endpoint? Thanks in advance!
    0
    h
    s
    f
    • 4
    • 11
  • s

    shy-midnight-40599

    06/06/2025, 2:44 PM
    Hi, I am using "step-functions delete" command to delete the stepfunctions and other resources created as part of "step-functions create" command. Its deleting only the Step function and Event bridge rule associated with it. But not deleting the job definitions. Is there any reason behind this? We have a bunch of job definitions which are not used by any step functions and figuring out a way to delete those automatically.
    ✅ 1
    0
    s
    • 2
    • 2
  • f

    fast-vr-44972

    06/06/2025, 12:42 PM
    Hi, how do you delete a flow completely from metaflow + argo workflow? Is it just removing the template? or something else?
    ✅ 1
    0
    s
    • 2
    • 3
  • g

    gentle-author-38571

    06/06/2025, 6:40 AM
    Hi , is it normal for Metaflow to create local datastore in ECS container (via aws batch) when MF_DATASTORE=s3 is set to s3 ? Getting below error Creating local datastore in current directory (/opt/my_app/portfolio_calcs/.metaflow) Internal error Traceback (most recent call last): File "/opt/my_app/metaflow/metaflow/cli.py", line 554, in main start(auto_envvar_prefix="METAFLOW", obj=state) . . . . File "/opt/my_app/metaflow/metaflow/datastore/content_addressed_store.py", line 140, in load_blobs with open(file_path, "rb") as f: TypeError: expected str, bytes or os.PathLike object, not NoneType
    ✅ 1
    0
    a
    • 2
    • 1
  • q

    quick-carpet-67110

    06/05/2025, 11:37 AM
    Hey everyone! We are trying to orchestrate our Metaflow pipelines with AirFlow/Cloud Composer (running on GCP), but when we upload the generated AirFlow DAG into our DAGs folder, we get this parsing error: Does this look familiar to anyone? We are using this command to generate the pipeline:
    Copy code
    python flow.py --environment=pypi --with retry airflow create flow_dag.py
    Copy code
    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
      File "/opt/python3.11/lib/python3.11/site-packages/pluggy/_callers.py", line 139, in _multicall
        raise exception.with_traceback(exception.__traceback__)
      File "/opt/python3.11/lib/python3.11/site-packages/pluggy/_callers.py", line 103, in _multicall
        res = hook_impl.function(*args)
              ^^^^^^^^^^^^^^^^^^^^^^^^^
      File "/etc/airflow/config/airflow_local_settings.py", line 62, in pod_mutation_hook
        and any(env_var.name == "AIRFLOW_IS_K8S_EXECUTOR_POD" for env_var in container.env)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
      File "/etc/airflow/config/airflow_local_settings.py", line 62, in <genexpr>
        and any(env_var.name == "AIRFLOW_IS_K8S_EXECUTOR_POD" for env_var in container.env)
                ^^^^^^^^^^^^
    AttributeError: 'dict' object has no attribute 'name'
    👀 1
    ✅ 1
    0
    t
    a
    • 3
    • 11