Structured output #75

whereisfarukk · 2025-12-12T17:39:48Z

No description provided.

.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

…ield

.

gemini-code-assist · 2025-12-12T17:40:12Z

Summary of Changes

Hello @whereisfarukk, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly expands the capabilities of the VlmRun node by introducing a robust 'Chat Completion' feature. This enhancement allows users to leverage advanced conversational AI, supporting diverse input types like text, images, videos, and files, and providing precise control over output formatting through JSON schemas. The changes aim to broaden the node's utility for complex, multi-modal AI workflows.

Highlights

New Chat Completion Operation: A new 'Chat Completion' operation has been added to the VlmRun node, enabling users to interact with an OpenAI-compatible chat API.
Multi-modal Input Support: The new chat completion feature supports multi-modal inputs, allowing users to include text, image URLs, video URLs, and file URLs within their chat prompts.
Structured Output Control: Users can now specify the desired response format, including options for simple JSON object output or detailed JSON schema-guided responses, enhancing control over AI output.
API Integration and Error Handling: The VlmRunClient has been updated to integrate with a new /openai/chat/completions endpoint and includes enhanced error logging for better debugging.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a 'Chat Completion' operation, enabling structured output capabilities. The implementation is comprehensive, adding new types, API service methods, and node properties. My review focuses on improving code quality by addressing code duplication, strengthening validation, and removing debug statements that could leak information. Overall, this is a great feature addition.

nodes/VlmRun/VlmRunClient.ts

gemini-code-assist · 2025-12-12T17:41:40Z

nodes/VlmRun/VlmRun.node.ts

+						// Process image URLs
+						const imageUrls: string[] = [];
+						if (inputType === 'image') {
+							const imageUrlsParam = this.getNodeParameter('imageUrls', i) as IDataObject;
+							if (imageUrlsParam && imageUrlsParam.url) {
+								const urlEntries = Array.isArray(imageUrlsParam.url) ? imageUrlsParam.url : [imageUrlsParam.url];
+								for (const entry of urlEntries) {
+									if (entry && typeof entry === 'object' && entry.url) {
+										const url = entry.url as string;
+										if (url && url.trim()) {
+											imageUrls.push(url.trim());
+										}
+									}
+								}
+							}
+						}
+
+						// Process video URLs
+						const videoUrls: string[] = [];
+						if (inputType === 'video') {
+							const videoUrlsParam = this.getNodeParameter('videoUrls', i) as IDataObject;
+							if (videoUrlsParam && videoUrlsParam.url) {
+								const urlEntries = Array.isArray(videoUrlsParam.url) ? videoUrlsParam.url : [videoUrlsParam.url];
+								for (const entry of urlEntries) {
+									if (entry && typeof entry === 'object' && entry.url) {
+										const url = entry.url as string;
+										if (url && url.trim()) {
+											videoUrls.push(url.trim());
+										}
+									}
+								}
+							}
+						}
+
+						// Process file URLs
+						const fileUrls: string[] = [];
+						if (inputType === 'file') {
+							const fileUrlsParam = this.getNodeParameter('fileUrls', i) as IDataObject;
+							if (fileUrlsParam && fileUrlsParam.url) {
+								const urlEntries = Array.isArray(fileUrlsParam.url) ? fileUrlsParam.url : [fileUrlsParam.url];
+								for (const entry of urlEntries) {
+									if (entry && typeof entry === 'object' && entry.url) {
+										const url = entry.url as string;
+										if (url && url.trim()) {
+											fileUrls.push(url.trim());
+										}
+									}
+								}
+							}
+						}


The logic for processing URLs from imageUrls, videoUrls, and fileUrls parameters is duplicated. To improve code maintainability and reduce redundancy, this logic can be extracted into a helper function.

const getUrlsFromParam = (paramName: string): string[] => { const urls: string[] = []; const param = this.getNodeParameter(paramName, i) as IDataObject; if (param && param.url) { const urlEntries = Array.isArray(param.url) ? param.url : [param.url]; for (const entry of urlEntries) { if (entry && typeof entry === 'object' && entry.url) { const url = entry.url as string; if (url && url.trim()) { urls.push(url.trim()); } } } } return urls; }; // Process URLs const imageUrls: string[] = inputType === 'image' ? getUrlsFromParam('imageUrls') : []; const videoUrls: string[] = inputType === 'video' ? getUrlsFromParam('videoUrls') : []; const fileUrls: string[] = inputType === 'file' ? getUrlsFromParam('fileUrls') : [];

gemini-code-assist · 2025-12-12T17:41:40Z

nodes/VlmRun/VlmRun.node.ts

+                                if (!message.content) {
+									throw new NodeOperationError(
+										this.getNode(),
+										'Text message content cannot be empty.',
+									);
+								}


The current validation for text message content allows strings that contain only whitespace. It would be more robust to treat such strings as empty.

if (!message.content || !message.content.trim()) { throw new NodeOperationError( this.getNode(), 'Text message content cannot be empty.', ); }

gemini-code-assist · 2025-12-12T17:41:40Z

nodes/VlmRun/VlmRun.node.ts

+
+						response = await ApiService.chatCompletion(this, messages, model, undefined, responseFormat);
+
+						console.log(JSON.stringify(response, null, 2));


This console.log statement appears to be for debugging. It should be removed before merging to avoid cluttering the logs in a production environment.

whereisfarukk and others added 18 commits December 1, 2025 20:36

feat: added chat completion

af488de

feat: added chat completion

aeae6e6

Merge branch 'main' of https://github.com/whereisfarukk/n8n-nodes-vlmrun

a92c27f

.

fix: code format issue

3bc5097

Update nodes/VlmRun/VlmRun.node.ts

8e391ce

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

fix: removed max token and response format from chat completion ui

18e2aae

feat: added video url capability

cd8753d

feat: added image ,video, file and model choosing options

a0f3559

feat: added simplify output toggle and capability

2333a06

refactor(output): enhance simplified JSON structure with additional f…

e7c6742

…ield

chore(output): update description for message content flag

3ab6e19

Merge branch 'main' of https://github.com/whereisfarukk/n8n-nodes-vlmrun

7a7910f

.

feat: Output Content as JSON added

01c85b7

feat: added user defined structured output

b694615

feat:slight modification in structured output

f3a2648

feat: added working code for structured output

4f2f924

feat: completed implementing chat completion

4224712

fix: linting issue

de4aea1

gemini-code-assist bot reviewed Dec 12, 2025

View reviewed changes

Merge branch 'main' into structured_output

3b2e66a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Structured output #75

Structured output #75

Uh oh!

whereisfarukk commented Dec 12, 2025

Uh oh!

gemini-code-assist bot commented Dec 12, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

gemini-code-assist bot Dec 12, 2025

Uh oh!

gemini-code-assist bot Dec 12, 2025

Uh oh!

gemini-code-assist bot Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		response = await ApiService.chatCompletion(this, messages, model, undefined, responseFormat);

		console.log(JSON.stringify(response, null, 2));

Structured output #75

Are you sure you want to change the base?

Structured output #75

Uh oh!

Conversation

whereisfarukk commented Dec 12, 2025

Uh oh!

gemini-code-assist bot commented Dec 12, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

gemini-code-assist bot Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant