Support stateless for multi container scaling and deployment #1831

sherodtaylor · 2024-07-19T18:19:55Z

Description

I have a custom kubernetes deployment that requires stateless applications which has been a standard for a long time.

Alternative

Our internal kubernetes system doesn't support sticky sessions.

Additional context

error received:

  File "/bb/libexec/workflow-metrics-notebooks/python/lib/python3.10/site-packages/marimo/_server/api/deps.py", line 135, in require_current_session
    raise ValueError(f"Invalid session id: {session_id}")
ValueError: Invalid session id: s_p41hf3

import argparse
import logging
from typing import Any

import marimo
from fastapi import FastAPI
import uvicorn



LOG_LEVELS = ["DEBUG", "INFO", "WARNING", "ERROR", "CRITICAL"]

def parse_args(args: list[str] | None = None) -> dict[str, Any]:
    parser = argparse.ArgumentParser()
    parser.add_argument("--log-file", "-l")
    parser.add_argument("--host", default="0.0.0.0")
    parser.add_argument("--log-level", default="INFO", choices=LOG_LEVELS)
    parser.add_argument("--dir", default="./notebooks")
    parsed_args, unknown = parser.parse_known_args(args=args)
    logging.info(f'unknown args - {unknown}')
    return vars(parsed_args)



def start_server(host: str, dir: str) -> None:

    server = (
        marimo.create_asgi_app()
        .with_app(path="", root=f'{dir}/workflow-metrics.py')
    )

    # Create a FastAPI app
    app = FastAPI(host=host)
    app.mount("/", server.build())

    uvicorn.run(app, host=host, port=8080)


def main() -> None:
    args = parse_args()
    start_server(args.pop("host"), args.pop("dir"))

if __name__ == "__main__":
    main()

The text was updated successfully, but these errors were encountered:

akshayka · 2024-07-23T22:09:41Z

Copying a response from Discord, in case others are also interested:

This will be difficult since not everything is serializable so can’t be stateless. These are running programs that inherently have state. Even if we made marimo stateless, your code may not be (e.g threads, db connections, etc)

I think a better request would be a load balancer that can manage multiple instances.

Based on what you know about marimo, if you have suggestions on how marimo might one day support stateless execution, we're open to hearing them.

sherodtaylor · 2024-09-24T14:21:30Z

@akshayka one model you can follow is how Airflow serializes it's objects. It would be useful for deployed programs as programs are built to be stateless.

https://airflow.apache.org/docs/apache-airflow/stable/authoring-and-scheduling/serializers.html

akshayka · 2024-09-24T14:38:13Z

Thanks for the link. We'll likely look into this one day (perhaps we could patch globals() to hit an external cache) — I appreciate how this would make horizontal scaling very easy — but it's not on our short-term roadmap.

sherodtaylor · 2025-02-06T17:28:42Z

@akshayka any idea if there has been any prioritization for this because i'd really like to use this. It would be nice to have a redis cache of some sort

dmadisetti · 2025-02-06T17:51:06Z

Just testing my understanding, would something like --global-session (#2489) coupled with caching (https://docs.marimo.io/api/caching/#marimo.persistent_cache) address this?

edit: Updated link to persistent_cache

sherodtaylor · 2025-02-06T18:03:58Z

@dmadisetti I think that would work if there was redis integration and the global session shared in redis some how through persistant_cache maybe?

dmadisetti · 2025-02-06T18:32:18Z

I think adding redis integration to persistent_cache would be some low hanging fruit, and something we've been thinking about (just a general remote cache).

I think a persistent session becomes pretty tricky, and we'd have to choose what we want to serialize.
What stateful information would you want to persist across stateless app instantiations? UI Values, mo.State? Or this generally not needed, and a fresh session ala --global-session enough?
You could roll it yourself:

stateful_vars = load_stateful_from_redis()
my_state, my_ui_element, my_shared_object_with_varying_state = create_stateful_objs_from_dump(stateful_vars)

# e.g.
f""" Just a simple dashboard showing
{my_ui_element}
which is the same for every session
"""

with persistent_cache("result_ideally_loaded_from_redis", type="redis") as cache_attempt:
      do_app_level_computations(my_shared_object_with_varying_state)

if not cache_attempt.hit:
    dump_stateful_obj(my_shared_object_with_varying_state)
dump_ui_state(my_state, my_ui_element)

But open to other api suggestions, and how a more native solution could smooth this over

sherodtaylor · 2025-02-07T14:04:48Z

i think a persistant session is necessary as the our system doesn't support sticky sessions which means that the session needs to persist remotely as the api calls with reach out via a round robin load balancer which could hit different machines.

dmadisetti · 2025-02-07T14:57:10Z

Oh I see, so down to the request level it needs to be session agnostic.

Have you considered just using the containers to create / serve a WASM page? Getting dynamic information from unsupported libraries would then be as easy as a refresh, and it would still be interactive within WASM. Alternatively, you could also serve public/ to return dynamic information (https://docs.marimo.io/guides/wasm/?h=public#including-data) - and just do as much as you can in WASM for everything else.

Maybe true stateless execution is possible? There's a lot of web-socket and miscellaneous requests- but I feel like WASM could get you most of the way there, and maybe an expanded Islands (server side initial run) could get you further: https://docs.marimo.io/guides/exporting/?h=islands#islands-in-action

sherodtaylor changed the title ~~Support stateless deployment~~ Support stateless deployment for multi container deployment Jul 19, 2024

sherodtaylor changed the title ~~Support stateless deployment for multi container deployment~~ Support stateless for multi container scaling and deployment Jul 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support stateless for multi container scaling and deployment #1831

Support stateless for multi container scaling and deployment #1831

sherodtaylor commented Jul 19, 2024 •

edited

Loading

akshayka commented Jul 23, 2024

sherodtaylor commented Sep 24, 2024

akshayka commented Sep 24, 2024

sherodtaylor commented Feb 6, 2025

dmadisetti commented Feb 6, 2025 •

edited

Loading

sherodtaylor commented Feb 6, 2025

dmadisetti commented Feb 6, 2025

sherodtaylor commented Feb 7, 2025

dmadisetti commented Feb 7, 2025

Support stateless for multi container scaling and deployment #1831

Support stateless for multi container scaling and deployment #1831

Comments

sherodtaylor commented Jul 19, 2024 • edited Loading

Description

Suggested solution

Alternative

Additional context

akshayka commented Jul 23, 2024

sherodtaylor commented Sep 24, 2024

akshayka commented Sep 24, 2024

sherodtaylor commented Feb 6, 2025

dmadisetti commented Feb 6, 2025 • edited Loading

sherodtaylor commented Feb 6, 2025

dmadisetti commented Feb 6, 2025

sherodtaylor commented Feb 7, 2025

dmadisetti commented Feb 7, 2025

sherodtaylor commented Jul 19, 2024 •

edited

Loading

dmadisetti commented Feb 6, 2025 •

edited

Loading