Replies: 2 comments 2 replies
-
|
Three main approaches in mind for me so far: (1) Codegen - generate playwright or something
(2) LLM-assisted - run a cached workflow/test with a smaller LLM based on a plan already executed by a bigger one
(3) Embeddings - "trajectory shift detection"
Probably a good way to look at this problem is as two key components:
(2) Adaptation trigger
|
Beta Was this translation helpful? Give feedback.
-
|
Here's a rough draft that covers a potential approach. I may have glossed over something that is more complicated than I'm thinking, so definitely call out anything that seems off. Addressing the Core Components
Caching is used only when both validations succeed. If either check fails, the system deems the entry stale and falls back to the LLM. The High-Level ArchitectureThe system consists of three layers:
Two-Tier Cache Lookup:
Integration Touch-Points
Cached Surfaces & Key Generation
Key Generation Details:
The DOM Normalization PipelineTo combat UI noise (timestamps, ads, dynamic IDs), the // Example configuration
startBrowserAgent({
caching: {
domNormalizers: [stripTimestamps, stripDataReactIds, /* custom user normalizer */]
}
});Default normalizers strip common volatile elements. Users can provide custom normalizer functions. LLM-Assisted Normalization:
Validation & Failure Strategy
Invalidation Rules
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Starting point for discussions on approaches for implementing action caching - ability for the LLM to save a workflow or test and re-run it again in the future the same way for much cheaper.
Beta Was this translation helpful? Give feedback.
All reactions