Strategy to Improve Transform Performance #1247

kade-robertson · 2025-05-06T12:57:51Z

kade-robertson
May 6, 2025

Hello!

In using Typebox for some particularly large data, I've run into Transform types being a bit of a bottleneck on performance. Especially relative to Check performance when using a TypeCompiler, Decode ends up being too slow in its current implementation to make sense for the use case I'm looking at. Taking a look at the implementation of transforms, I did notice an opportunity for some pretty meaningful improvements.

The primary improvement is to avoid traversing a schema if it has no transforms contained within it. Right now, the transform methods have to "walk" the entire object and schema to apply transforms, even if it ends up down a branch which contains nothing to apply. You can get a pretty significant improvement just by checking to see if a schema contains a transform somewhere, and returning early if that branch has no transforms. This looks like a small modification to Visit:

import { HasTransform } from './has';
function Visit(schema, references, value) {
  if (!HasTransform(schema, references)) {
    return value;
  }
  // ... rest of impl
}

I can't share anything substantial from the dataset I had been using when originally experimenting with this, but making a superficial schema like so:

const ToBeDecoded = Type.Transform(Type.Number()).Decode(() => 1).Encode(() => 2);

const A = Type.Object({
    x: Type.Number(),
    y: Type.String(),
    z: ToBeDecoded
});

const B = Type.Object({
    j: Type.Number(),
    k: Type.String(),
    l: Type.Number()
});

const C = Type.Object({
    m: Type.Number(),
    n: Type.String(),
    o: Type.Number()
});

export const D = Type.Object({
    as: Type.Array(A),
    bs: Type.Array(B),
    cs: Type.Array(C)
});

And generating some input data where each array on this top level object has 10,000 items, HasTransform dropped the average Decode and Encode runtime from 114ms down to 68ms. On "sparse" data (where this has been modified to only have a single field on the top level with a transform), it drops further to 26ms. This performance improvement can be even more significant depending on the level of nesting or the quantity of data that can possibly be transformed.

There is also some room for improvement still -- HasTransform still has to traverse the input schema to look for transforms, and this could potentially be run many times per schema (i.e. for an array with many items, we are forced to check the same schema for every single item). This should be pretty safe to cache within a Transform step, using a WeakMap<TSchema, boolean> to store the result of HasTransform:

let cache: WeakMap<TSchema, boolean>();
function CachedHasTransform(schema, references) {
  // ... read from weakmap, return if present, check otherwise
}
function Visit(schema, references, value) {
  if (!CachedHasTransform(schema, references)) {
    return value;
  }
}
function TransformDecode(schema, references, value) {
  cache = new WeakMap();
  return Visit(schema, references, value);
}

(This could just be implemented by passing the cache to each function in here, just for demonstration it's a bit easier to write it like this)

This is faster, but it's less significant than before -- the "dense" example from above only drops from 68ms to 53ms, and the sparse example stays the same. Personally, I'm using these together as custom Encode/Decode steps and the change is surprisingly significant (~2 orders of magnitude faster) for a fairly complex and nested schema.

I'd be interested in opening up a PR for this change (or something to achieve the same effect), as it saves me from having to maintain a separate Encode/Decode implementation (which requires some cloning of other files to include some internal Typebox stuff that doesn't get exported), which then needs to be used directly vs. customizing Parse or skipping the Encode/Decode method on a TypeCompiler result, since those won't use the "fast" version. I noticed however that in #1150 you had mentioned avoiding caching previously due to complexity concerns, so maybe that aspect is something you aren't looking to bring into the main project.

sinclairzx81 · 2025-05-07T05:09:18Z

sinclairzx81
May 7, 2025
Maintainer

@kade-robertson Hi!

And generating some input data where each array on this top level object has 10,000 items, HasTransform dropped the average Decode and Encode runtime from 114ms down to 68ms. On "sparse" data (where this has been modified to only have a single field on the top level with a transform), it drops further to 26ms. This performance improvement can be even more significant depending on the level of nesting or the quantity of data that can possibly be transformed.

Thanks for the write up and insights. I do agree TypeBox could be doing a better job here.

As for the caching, it's a good optimization. However, as noted on #1150 (and as you also pointed out), adding schema caching internal to TypeBox is generally avoided. The concern isn't just about complexity (there is some), it’s also due to TypeBox being unable to make assumptions about how users will ultimately create, mutate, or reuse schemas throughout the lifecycle of an application.

Keeping a global WeakMap (or Set) to track transform schematics can be a problem. If TypeBox detects a Transform on a schema during the first pass and caches that result, problems can occur if the schema is later modified in a way that changes or invalidates that transform / schema. While such mutations would be rare (and generally considered bad practice), the possibility is still non-zero, and that uncertainty makes automatic caching unsafe. TypeBox can't really assert the presence of a Transform via WeakMap via reference key alone, it needs deep introspection to know for sure.

Transform optimization is a difficult problem. Previous attempts to provide internal caching have generally been met with edge cases that have proven difficult to resolve. As of today, the current TypeCheck implementation provides a private cached field which is used specifically for compiled schematics, but only does so because the interior schema is stated private (and intended to be non-mutable interior to the TypeCheck class)

https://github.com/sinclairzx81/typebox/blob/master/src/compiler/compiler.ts#L93-L96

Moving forward, TypeBox is trying to expose functions that make it feasible to implement high throughput decode external to the library (inclusive of external caching). I feel this is generally the best direction the library can go as it provides implementers more control over validation, transforms and performance (including having a means to select performance trade-offs based on the compute overhead of some of the Value.* functions).

If possible, I think it would be good to explore a external caching implementations via the current API, then discuss ways to improve that usage. The HasTransform function is part of the public API, so perhaps providing an external cache implementation using this function might be a good way to proceed.

Let me know your thoughts!
Cheers
S

0 replies

kade-robertson · 2025-05-07T12:48:42Z

kade-robertson
May 7, 2025
Author

This makes sense to me. I have already finished the implementation of these before posting here, as that's when the idea for upstreaming some of the implementation came up. As far as making this more feasible to implement externally, these were the things I ran into that would potentially aid in making external implementations of Encode/Decode a bit easier to maintain, in order of how straightforward I'd expect these to implement:

Pushref and Deref in the value directory aren't exported, so clones of these have to be made
Reimplementing TransformDecode necessarily means reimplementing Decode, meaning anything that relied on the original Value.* version doesn't use the new implementation. I see ParseRegistry would let you override Decode this for Value.Parse, so maybe a similar abstraction could exist that potentially lets you register a custom TransformDecode that Value.Decode would use. Alternatively, anything that calls a Value.* function on your behalf should maybe let you pass an override?
Figuring out a way to reuse the the visitors would be convenient -- needing to maintain an entire copy of the transform files isn't super ideal, especially given you can see how minor the modifications I've ended up with are. I don't really know a good way to approach this, but there's a few ideas I haven't really fleshed out yet:
- Re-export all of the visitor methods, so it's easy to call the original. This does end up potentially bloating the exposed API somewhat if a new DecodeVisitor.* namespace is present, so maybe these could get their own named export? In general at least I basically never modified the behaviour of these, aside from maybe needing to pass an additional argument through to a Visit call nested in there.
- Provide some way of "hooking" parts of the process, like being able to register an onBeforeVisit handler or something to that effect. In theory, this drops the entire implementation to a single hook to deal with the early return case, but the exact schemantics (how does Visit know what to do with the result of a hook? etc).

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strategy to Improve Transform Performance #1247

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Strategy to Improve Transform Performance #1247

kade-robertson May 6, 2025

Replies: 2 comments

sinclairzx81 May 7, 2025 Maintainer

kade-robertson May 7, 2025 Author

kade-robertson
May 6, 2025

sinclairzx81
May 7, 2025
Maintainer

kade-robertson
May 7, 2025
Author