Update storage driver store context metadata by jmaeagle99 · Pull Request #1399 · temporalio/sdk-python

jmaeagle99 · 2026-03-30T04:08:17Z

What was changed

Remove on StorageDriverStoreContext.serialization_context in favor of a new asymmetric context information system for storage drivers.
Update S3 driver to use new context information system.

The S3 driver will now generate all keys (assuming context is provided) such that the key will be in the following formats:

v0/ns/{namespace}/wt/{workflow_type}/wi/{workflow_id}/ri/{run_id}/d/sha256/{hash} for all workflow operations, and
v0/ns/{namespace}/at/{activity_type}/ai/{activity_id}/ri/{run_id}/d/sha256/{hash} for standalone activity operations.

The null sentinel value is used for any token which is not known at during the driver store operation. For example, when starting a workflow from a client, the run ID of the worklfow is not known and will have a null value for the {run_id} token.

Key Segments by Operation

The overall idea is that payloads are stored into the key space of the target of the operation. The exception is when there is no target (standalone activities results, parentless workflow results), the target is set to the current workflow/activity.

Operation	Key segments	Notes
Client: `start_workflow`	`wt`=type, `wi`=ID, `ri`=`null`	Run ID not yet assigned
Client: `signal_workflow`	`wt`=`null`, `wi`=ID, `ri`=run ID or `null`	Type not available from client; run ID present if handle was obtained with one
Client: `signal_with_start`	`wt`=type, `wi`=ID, `ri`=`null`
Client: `execute_update`	`wt`=`null`, `wi`=ID, `ri`=run ID or `null`	Type not available from client; run ID present if handle was obtained with one
Client: `query_workflow`	`wt`=`null`, `wi`=ID, `ri`=run ID or `null`	Type not available from client; run ID present if handle was obtained with one
Client: schedule action	`wt`=type, `wi`=ID, `ri`=`null`
Client: async activity (ID reference with workflow ID)	`wt`=`null`, `wi`=workflow ID, `ri`=run ID or `null`	Keyed under parent workflow; type not available
Client: async activity (ID reference, no workflow ID)	`at`=`null`, `ai`=activity ID, `ri`=`null`	Activity-keyed; type and run ID not available
Client: async activity (task token)	(no context segments)	No identity info in raw token
Workflow worker: schedule activity	`wt`=type, `wi`=ID, `ri`=run ID	Keyed under the current workflow
Workflow worker: start child workflow	`wt`=type, `wi`=ID, `ri`=`null`	Keyed under the child workflow
Workflow worker: signal external	`wt`=`null`, `wi`=ID, `ri`=`null`	Keyed under the target workflow
Workflow worker: continue-as-new	`wt`=type, `wi`=ID, `ri`=run ID	Keyed under the current workflow; should be target but requires broader CaN support
Workflow worker: result	`wt`=type, `wi`=ID, `ri`=run ID	Child workflow: Keyed under parent workflow No parent workflow: Keyed under current workflow
Activity worker: heartbeat	`wt`=type, `wi`=ID, `ri`=run ID	Keyed under the parent workflow
Activity worker: result	`wt`=type, `wi`=ID, `ri`=run ID	Workflow activity: Keyed under the parent workflow Standalone activity: Keyed under current activity

One scenario that needs to be fixed is CaN, but that requires more work in assigning a sequence number an serialization context seems to not support it correctly. Would advocate for fixing separate from this PR.

Why?

Provide more information to storage drivers than what is permissible by SerializationContext
Create a consistent key format for the S3 driver to better allow external lifecycle management

Checklist

How was this tested: Unit and integration tests
Any docs updates needed? Yes

README.md

temporalio/converter/_extstore.py

drewhoskins-temporal

re: your comments on CaN -- sequence # is an interesting idea.

drewhoskins-temporal · 2026-04-01T21:29:59Z

temporalio/converter/_extstore.py

-    """The serialization context active when this store operation was initiated,
-    or ``None`` if no context has been set.
-    """
+    target: StorageDriverActivityInfo | StorageDriverWorkflowInfo | None = None


"target" struck me as odd since it's an info rather than a target. then my second thought was that target would be "where it's stored" rather than "what's being called."
Maybe callee_info or something?

Like you said, target might not be the best name here. It implies where something should be stored, which is what the information is hinting at. It's trying to say "use this information for storing the payload in a structured manner".

Implementation wise, the target could be the current context or the other size of the serialization boundary. So calling it caller or callee would be wrong in a handful of scenarios. For example, if this was invoke for the result of a completing workflow, it's the "caller" information if it's the top-level workflow but is the "callee" information if the workflow has a parent. Because we want to store it (in terms of S3 terminology) the key prefix for the object that needs it for replay and for lifecycle management. So the API is opinionated as to what information should be used for storing in some kind of hierarchy.

Maybe:

context_info? Feels generic.

hierarchy_info? Why is the API telling me about a hierarchy when there isn't an obvious prescription of how to generate the hierarchy? Maybe the API should be offering a visitor for prescriptive ordering and layering of the information.

drewhoskins-temporal · 2026-04-01T21:30:59Z

temporalio/converter/_extstore.py

+    For payloads being stored on behalf of an explicit target (e.g. a child
+    workflow being started, an activity being scheduled, an external workflow
+    being signaled), this is that target's identity.  When no explicit target
+    exists the current execution context (workflow or activity) is used as the


separate caller_info field ? Feels like glossing over this distinction between source and target or caller and callee could result in bugs. Would rather callsites explicitly say what they want to refer to.

During code review, we talked about the idea of providing both "caller" and "callee" information and felt that it was ambiguous as to how a driver author was supposed to use them, because you don't always store in the callee context.

When talking in terms of the S3 driver, there was an overall pattern of we always want to store the information in a key space that best suits itself for replay and for lifecycle management. That mostly meant the "target workflow" (e.g. child workflow, external signal, CaN, completing workflow that does have a parent) but is sometimes the the current workflow" (e.g. workflow activities, completing workflow that doesn't have a parent); sometimes it is the "current activity" (completing standalone activity) but is sometimes the "target activity" (client starts a standalone activity).

Just providing "caller" and "callee" information isn't enough; you also need to know standalone vs workflow activity. Maybe that's all the extra was needed but the algorithm is not easy to implement.

We felt that this determination would be the same across drivers, if they cared to store payloads in a contextual hierarchy. So it was built into the external storage layer rather than just a concern driver authors had to think about.

jmaeagle99 requested a review from a team as a code owner March 30, 2026 04:08

Storage driver store context metadata

5a19712

jmaeagle99 force-pushed the extstore-context-info branch from fcb36db to 5a19712 Compare March 30, 2026 04:35

jmaeagle99 added 11 commits March 30, 2026 12:07

Fix test assertions

8f14f6b

Fix test for non-deterministic ordering

50ffea2

Consolidate to single target field

f7ed149

Format

700b73c

Fix workflow activity target and remote redundant tests

31d6509

Fix assertion

9c392c8

Child workflows store payload in parent context

d80338b

Format

3d84175

Fix S3 test

ff30607

Skip standalone activities and schedules in time-skipping environment

77ae64d

Move namespace to info classes

b57c6ec

tconley1428 reviewed Apr 1, 2026

View reviewed changes

README.md Outdated Show resolved Hide resolved

jmaeagle99 added 3 commits April 1, 2026 10:33

Replace context var with context transform methods

a6c5ca2

Document the store target for workflow commands

e2881d0

Update readme

98677f6

jmaeagle99 requested a review from tconley1428 April 1, 2026 18:05

tconley1428 approved these changes Apr 1, 2026

View reviewed changes

jmaeagle99 commented Apr 1, 2026

View reviewed changes

temporalio/converter/_extstore.py Outdated Show resolved Hide resolved

Comment updates

019da3e

jmaeagle99 enabled auto-merge (squash) April 1, 2026 19:26

drewhoskins-temporal reviewed Apr 2, 2026

View reviewed changes

Merge branch 'main' into extstore-context-info

b3759a4

jmaeagle99 disabled auto-merge April 2, 2026 15:58

Merge branch 'main' into extstore-context-info

718ef3c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update storage driver store context metadata#1399

Update storage driver store context metadata#1399
jmaeagle99 wants to merge 18 commits intotemporalio:mainfrom
jmaeagle99:extstore-context-info

jmaeagle99 commented Mar 30, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

drewhoskins-temporal left a comment

Uh oh!

drewhoskins-temporal Apr 1, 2026

Uh oh!

jmaeagle99 Apr 2, 2026 •

edited

Loading

Uh oh!

drewhoskins-temporal Apr 1, 2026 •

edited

Loading

Uh oh!

jmaeagle99 Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jmaeagle99 commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What was changed

Key Segments by Operation

Why?

Checklist

Uh oh!

Uh oh!

Uh oh!

drewhoskins-temporal left a comment

Choose a reason for hiding this comment

Uh oh!

drewhoskins-temporal Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

jmaeagle99 Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

drewhoskins-temporal Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jmaeagle99 Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jmaeagle99 commented Mar 30, 2026 •

edited

Loading

jmaeagle99 Apr 2, 2026 •

edited

Loading

drewhoskins-temporal Apr 1, 2026 •

edited

Loading