Add an agent in production example #3890

htahir1 · 2025-08-08T16:40:41Z

Describe changes

What’s this PR about?

TL;DR version:

Adds a ready-to-use “document analysis agent” showing how ZenML pipelines can integrate with a FastAPI service and UI—plus simple docs to walk you through it.

Adds a production-ready example to the ZenML project that demonstrates how to build a document‑analysis agent using FastAPI, ZenML pipelines, and a simple UI. – [+3361 / −26, 39 files updated] ([GitHub][1])

New in the PR:

Example app (examples/minimal_agent_production/) featuring:
- A FastAPI app + static assets (HTML/CSS/JS).
- ZenML pipelines for “production” (analyzes documents) and “evaluation”.
- Pipeline steps for ingesting, analyzing, rendering, and evaluating documents.
- Entrypoint scripts to run pipelines and the app.
Docs updates:
- README additions and a "Your First AI Pipeline" guide with flowcharts and screenshots.
Project config tweak:
- Loosened mypy settings to ignore missing imports. ([GitHub][1])

Why it matters for newcomers:

This gives you a plug‑and‑play template—showcasing how to connect ZenML pipelines, an agentic workflow, and a web UI in one cohesive example.

Pre-requisites

Please ensure you have done the following:

I have read the CONTRIBUTING.md document.
I have added tests to cover my changes.
I have based my new branch on develop and the open PR is targeting develop. If your branch wasn't based on develop read Contribution guide on rebasing branch to develop.
IMPORTANT: I made sure that my changes are reflected properly in the following resources:
- ZenML Docs
- Dashboard: Needs to be communicated to the frontend team.
- Templates: Might need adjustments (that are not reflected in the template tests) in case of non-breaking changes and deprecations.
- Projects: Depending on the version dependencies, different projects might get affected.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Other (add details above)

coderabbitai · 2025-08-08T16:40:47Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing Touches

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch doc/addagentguide

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

strickvl

TL;DR: I like the high-level idea (that you demoed to us previously), but I hope we can remove as much of the non-critical complexity to make the ZenML parts shine a bit more?

Duplication

Seems like there's quite a bit of duplication between llm_utils.py and steps/analyze.py (both have LLM availability checking functions, both implement document analysis check, text cleaning and extraction functions are duplicated etc). Seems also like llm_utils.py is mostly unused in the actual pipeline, but I might be wrong about that? Also steps/utils.py has some duplication?

llm_utils.py vs steps/utils.py have duplicate stop words / filter lists (though not fully overlapping?

steps/render.py and steps/evaluate.py and app/main.py have a bunch of inline CSS which probably could be consolidated?

There are two different definitions of DocumentRequest: in models.py and in app/main.py. This is confusing.

Showcasing ZenML

Where poss we should try to pull out the HTML / CSS code into separate files / modules as I think it gets in the way of understanding what ZenML is doing.

Overly complicated stuff

The multiprocessing stuff in app/main.py feels like it's a bit complex. Maybe rather just two endpionts: POST /analyze which submits pipeline and return run_id, and then GET /result?run_id=... which polls until the artifact is ready. Removes a TON of probably unnecessary complexity which doesn't really add to the learning experience for a user / reader of the guide.

There's a bunch of places where the try...except pattern is done a bit too much, maybe to the point where you are actually smoothing over errors that should get raised? e.g. In steps/analyze.py, the analyze_document_step has a try...except block that calls perform_llm_analysis. Inside perform_llm_analysis, there is another try...except block that catches a failure and then re-raises it.

Under-complicated stuff

Wondering if it's too simplistic to be called an 'agent'. Maybe it doesn't really matter to showcase the pattern, but I guess it's stretching the limits of the word 'agent'.

Random stuff

lots of print() statements. Makes the output quite verbose. Either make them debug logging statements? (Also perhaps in general prefer logging over print?

Remove langfuse? It's mentioned a few times but it's never really used anywhere so it just adds confusion (and an extra dependency).

Feels like we might have wanted to use the model control plane to unify some of this behaviour / logic etc, but also maybe we don't want to foreground that so much?

strickvl · 2025-08-18T08:04:37Z

examples/minimal_agent_production/app/main.py

+                            try:
+                                # Try different ways to access the output
+                                if hasattr(step, "output") and step.output:
+                                    print("Trying step.output")
+                                    analysis_result = step.output.load()
+                                    print(
+                                        f"Successfully loaded from step.output: {type(analysis_result)}"
+                                    )
+                                elif hasattr(step.outputs, "items"):
+                                    print("Trying step.outputs.items()")
+                                    for (
+                                        output_name,
+                                        output_artifact,
+                                    ) in step.outputs.items():
+                                        print(
+                                            f"Trying to load output: {output_name}, type: {type(output_artifact)}"
+                                        )
+                                        analysis_result = (
+                                            output_artifact.load()  # type: ignore
+                                        )
+                                        print(
+                                            f"Successfully loaded: {type(analysis_result)}"
+                                        )
+                                        break
+                                elif isinstance(step.outputs, dict):
+                                    print(
+                                        "Outputs is dict, getting first value"
+                                    )
+                                    first_output = next(
+                                        iter(step.outputs.values())
+                                    )
+                                    analysis_result = first_output.load()  # type: ignore
+                                    print(
+                                        f"Successfully loaded from first output: {type(analysis_result)}"
+                                    )
+                                else:
+                                    print(
+                                        f"Unknown outputs structure: {type(step.outputs)}"
+                                    )


This stuff feels a bit ugly. Probably a nicer way to handle this.

strickvl · 2025-08-18T08:05:05Z

examples/minimal_agent_production/steps/evaluate.py

+        model_version = client.get_model_version(
+            "document_analysis_agent", "production"
+        )


We sort of just assume that this exists already?

yeah i think its a pre-req

strickvl · 2025-08-18T08:16:05Z

examples/minimal_agent_production/app/main.py

+    )
+
+    # Run the pipeline in a separate process to avoid signal handling in worker thread
+    tmp_dir = tempfile.mkdtemp(prefix="doc_analysis_")


we never actually clean this up at the end, I think... maybe do that?

strickvl · 2025-08-18T08:22:35Z

examples/minimal_agent_production/steps/analyze.py

+def perform_llm_analysis(
+    content: str,
+    filename: str,
+    model: str = "gpt-3.5-turbo",


maybe 4o-mini instead? 3.5 turbo might actually be more expensive, even, and much less capable. or use one of the 5-mini or 5-nano to make the example a bit more future-forward?

strickvl · 2025-08-18T08:23:28Z

examples/minimal_agent_production/app/main.py

+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)


maybe should add a comment that this is dev example only or something, otherwise maybe just a bit too permissive?

examples/minimal_agent_production/steps/analyze.py

strickvl · 2025-08-18T08:29:08Z

examples/minimal_agent_production/steps/analyze.py

+        raise
+
+
+def perform_deterministic_analysis(


Not sure if we ever want to be doing this? Consider removing? Feels like unnecessary complexity.

socket-security · 2025-08-18T09:55:43Z

Review the following changes in direct dependencies. Learn more about Socket for GitHub.

Diff	Package	Supply Chain Security	Vulnerability	Quality	Maintenance	License

View full report

examples/minimal_agent_production/run_pipeline.py

examples/minimal_agent_production/steps/analyze.py

github-actions · 2025-08-18T13:25:23Z

✅ No broken links found!

github-actions · 2025-08-20T12:03:24Z

Documentation Link Check Results

✅ Absolute links check passed
✅ Relative links check passed
_{Last checked: 2025-08-20 15:17:37 UTC}

github-actions · 2025-08-20T12:08:29Z

Images automagically compressed by Calibre's image-actions ✨

Compression reduced images by 29.5%, saving 396.72 KB.

Filename	Before	After	Improvement	Visual comparison
`docs/book/.gitbook/assets/your_first_pipeline_analysis.png`	200.04 KB	129.49 KB	-35.3%	View diff
`docs/book/.gitbook/assets/your_first_pipeline_pipeline_evaluation.png`	149.28 KB	106.09 KB	-28.9%	View diff
`docs/book/.gitbook/assets/your_first_pipeline_ui.png`	226.71 KB	168.84 KB	-25.5%	View diff
`examples/minimal_agent_production/static/images/app_result.png`	226.71 KB	168.84 KB	-25.5%	View diff
`examples/minimal_agent_production/static/images/app.png`	195.00 KB	141.50 KB	-27.4%	View diff
`examples/minimal_agent_production/static/images/pipeline_analysis.png`	200.04 KB	129.49 KB	-35.3%	View diff
`examples/minimal_agent_production/static/images/pipeline_evaluation.png`	149.28 KB	106.09 KB	-28.9%	View diff

368 images did not require optimisation.

Update required: Update image-actions configuration to the latest version before 1/1/21. See README for instructions.

docs/book/getting-started/your-first-ai-pipeline.md

* Add ZenML introduction to README.md * added agent example * Update evaluation pipeline max items to 100, 125, and 125 * Add document analysis pipeline and evaluation pipeline * Code review * Simplify analysis result extraction from artifacts * Refactor artifact loading and handling logic * Add "Your First AI Pipeline" section to table of contents * Add your first AI pipeline example for ZenML * Add login command for ZenML dashboard * Update Docker command for container startup * Update Docker compose command to start containers * Add architecture overview and screenshots * Add figures for pipeline visualization in ZenML dashboard * Add reasons for using pipelines in AI/ML workflows * Revise AI pipeline benefits in quickstart guide * Update AI pipeline architecture diagram layout * Update flowchart orientation to top to bottom * Add architecture overview in getting started guide * Add example installation instructions and architecture overview * Add whitespace and update comments for clarity * Update AI pipeline and examples with new links and services * Update ZenML hello-world.md with login step & new links.- * Refactor quickstart README for clarity and consistency * Add FloraCast and Retail Forecast to projects list * Update mypy configuration to ignore missing imports * Update ignore_missing_imports setting in pyproject.toml * Update links in README and fix relative path in docs * Add initial pipeline analysis and evaluation diagrams * Images * Delete test.html and add architecture.png image.<commit message> * Optimised images with calibre/image-actions * Update Argilla documentation links to correct URLs * Update docs/book/getting-started/your-first-ai-pipeline.md * Update docs/book/getting-started/your-first-ai-pipeline.md * Add new ML pipeline examples to README.md * Update README with new example images and links * Add example-06 and example-07 images to user guide * Add ZenML Pro cover image --------- Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Alex Strick van Linschoten <[email protected]> (cherry picked from commit bdd3319)

htahir1 added 4 commits August 8, 2025 14:26

Add ZenML introduction to README.md

0ff45f9

added agent example

13b1402

Update evaluation pipeline max items to 100, 125, and 125

34d803c

Add document analysis pipeline and evaluation pipeline

3b7c2ae

htahir1 requested a review from strickvl August 8, 2025 16:40

github-actions bot added the internal To filter out internal PRs and issues label Aug 8, 2025

strickvl reviewed Aug 18, 2025

View reviewed changes

Code review

a8054fa

htahir1 requested a review from strickvl August 18, 2025 09:55

strickvl reviewed Aug 18, 2025

View reviewed changes

examples/minimal_agent_production/run_pipeline.py Outdated Show resolved Hide resolved

strickvl reviewed Aug 18, 2025

View reviewed changes

examples/minimal_agent_production/steps/analyze.py Outdated Show resolved Hide resolved

htahir1 added 3 commits August 18, 2025 13:27

Simplify analysis result extraction from artifacts

65c2423

Refactor artifact loading and handling logic

51fc8a7

Add "Your First AI Pipeline" section to table of contents

f6258ad

htahir1 added 13 commits August 18, 2025 15:26

Add your first AI pipeline example for ZenML

493ee4d

Add login command for ZenML dashboard

36a13f8

Update Docker command for container startup

3d4c21f

Update Docker compose command to start containers

1125082

Add architecture overview and screenshots

23ff446

Add figures for pipeline visualization in ZenML dashboard

38a856e

Add reasons for using pipelines in AI/ML workflows

f54744c

Revise AI pipeline benefits in quickstart guide

5edc9df

Update AI pipeline architecture diagram layout

53797ec

Update flowchart orientation to top to bottom

7ec8608

Add architecture overview in getting started guide

9179264

Add example installation instructions and architecture overview

0015343

Add whitespace and update comments for clarity

1d25629

htahir1 added 7 commits August 18, 2025 16:10

Update AI pipeline and examples with new links and services

0779c92

Update ZenML hello-world.md with login step & new links.-

1d34dec

Refactor quickstart README for clarity and consistency

9093a73

Add FloraCast and Retail Forecast to projects list

903018c

Update mypy configuration to ignore missing imports

6dda7cc

Update ignore_missing_imports setting in pyproject.toml

bd48d61

Update links in README and fix relative path in docs

147fb64

htahir1 marked this pull request as ready for review August 20, 2025 11:42

htahir1 added 2 commits August 20, 2025 14:02

Add initial pipeline analysis and evaluation diagrams

b76c970

Images

3d9008a

Delete test.html and add architecture.png image.<commit message>

1e7abe6

htahir1 requested a review from strickvl August 20, 2025 12:06

Optimised images with calibre/image-actions

61c05bd

Update Argilla documentation links to correct URLs

ebc3307

strickvl reviewed Aug 20, 2025

View reviewed changes

docs/book/getting-started/your-first-ai-pipeline.md Show resolved Hide resolved

docs/book/getting-started/your-first-ai-pipeline.md Outdated Show resolved Hide resolved

docs/book/getting-started/your-first-ai-pipeline.md Outdated Show resolved Hide resolved

strickvl added 2 commits August 20, 2025 16:27

Update docs/book/getting-started/your-first-ai-pipeline.md

9d4ba0d

Update docs/book/getting-started/your-first-ai-pipeline.md

baaac49

strickvl approved these changes Aug 20, 2025

View reviewed changes

htahir1 added 4 commits August 20, 2025 16:38

Add new ML pipeline examples to README.md

4766866

Update README with new example images and links

8426454

Add example-06 and example-07 images to user guide

495b263

Add ZenML Pro cover image

18a283b

htahir1 merged commit bdd3319 into develop Aug 20, 2025
56 of 59 checks passed

htahir1 deleted the doc/addagentguide branch August 20, 2025 16:20

Add an agent in production example #3890

Add an agent in production example #3890

Uh oh!

Conversation

htahir1 commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe changes

What’s this PR about?

TL;DR version:

New in the PR:

Why it matters for newcomers:

Pre-requisites

Types of changes

Uh oh!

coderabbitai bot commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

Status, Documentation and Community

Uh oh!

strickvl left a comment

Choose a reason for hiding this comment

Duplication

Showcasing ZenML

Overly complicated stuff

Under-complicated stuff

Random stuff

Uh oh!

strickvl Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

strickvl Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

htahir1 Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

strickvl Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

strickvl Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

strickvl Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

strickvl Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

socket-security bot commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Aug 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation Link Check Results

Uh oh!

github-actions bot commented Aug 20, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

htahir1 commented Aug 8, 2025 •

edited

Loading

coderabbitai bot commented Aug 8, 2025 •

edited

Loading

socket-security bot commented Aug 18, 2025 •

edited

Loading

github-actions bot commented Aug 18, 2025 •

edited

Loading

github-actions bot commented Aug 20, 2025 •

edited

Loading