Add input processing pipeline + codegate-version pipeline step #91

jhrozek · 2024-11-25T22:48:43Z

This adds a pipeline processing before the completion is ran where
the request is either change or can be shortcut. This pipeline consists
of steps, for now we implement a single step CodegateVersion that
responds with the codegate version if the verbatim codegate-version
string is found in the input.

The pipeline also passes along a context, for now that is unused but I
thought this would be where we store extracted code snippets etc.

To avoid import loops, we also move the BaseCompletionHandler class to
a new completion package.

Since the shortcut replies are more or less simple strings, we add yet
another package providers/formatting whose responsibility is to
convert the string returned by the shortcut response to the format
expected by the client, meaning either a reply or a stream of replies in
the LLM-specific format. We use the BaseCompletionHandler as a way to
convert to the LLM-specific format.

Fixes: #93
Related: #45

lukehinds · 2024-11-26T08:44:43Z

Nice work @jhrozek !

ptelang · 2024-11-26T15:44:16Z

src/codegate/pipeline/base.py

+            Optional[tuple[str, int]]: A tuple containing the message content and
+                                       its index, or None if no user message is found
+        """
+        if request.get("messages") is None:


In case of llama.cpp provider, Continue plugin is not using OpenAI request format with "messages". Instead it sends a request with "prompt". The prompt string contains all messages that are separated by tokens (im_start, im_end).

Yes I found out. That's why I added the condition that just skips the chat request in case there is no messages attribute. I'm now working on improvements to the pipeline that would convert the request to the OpenAI format and then the augmented OpenAI request back to the model format

ptelang · 2024-11-26T15:48:03Z

src/codegate/pipeline/base.py

+        pass
+
+
+class PipelineProcessor:


Since this is sequential pipeline processor, we can call it: SequentialPipelineProcessor.
In future, we can implement: ParallelPipelineProcessor, GraphPipelineProcessor, etc.

Thank you good idea. I will do the rename.

ptelang · 2024-11-26T16:01:59Z

src/codegate/providers/litellmshim/generators.py

@@ -12,6 +12,7 @@ async def sse_stream_generator(stream: AsyncIterator[Any]) -> AsyncIterator[str]
    """OpenAI-style SSE format"""
    try:
        async for chunk in stream:
+            print(chunk)


Do we need this print?

Sorry of course not that was a leftover debugging.

This adds a pipeline processing before the completion is ran where the request is either change or can be shortcut. This pipeline consists of steps, for now we implement a single step `CodegateVersion` that responds with the codegate version if the verbatim `codegate-version` string is found in the input. The pipeline also passes along a context, for now that is unused but I thought this would be where we store extracted code snippets etc. To avoid import loops, we also move the `BaseCompletionHandler` class to a new `completion` package. Since the shortcut replies are more or less simple strings, we add yet another package `providers/formatting` whose responsibility is to convert the string returned by the shortcut response to the format expected by the client, meaning either a reply or a stream of replies in the LLM-specific format. We use the `BaseCompletionHandler` as a way to convert to the LLM-specific format.

jhrozek force-pushed the pipeline branch 2 times, most recently from 0717a9c to c74841e Compare November 26, 2024 12:17

jhrozek marked this pull request as ready for review November 26, 2024 12:17

jhrozek mentioned this pull request Nov 26, 2024

Pipelines block inputs for Guardrail hooks #45

Closed

ptelang reviewed Nov 26, 2024

View reviewed changes

jhrozek force-pushed the pipeline branch from c74841e to 257f506 Compare November 26, 2024 20:29

jhrozek added 3 commits November 27, 2024 09:22

review comment: Rename PipelineProcessor to SequentialPipelineProcessor

ac9c7e9

review feedback: Remove forgotten debug statement

4ddbc37

jhrozek force-pushed the pipeline branch from 257f506 to 4ddbc37 Compare November 27, 2024 08:22

lukehinds merged commit f0c0b38 into stacklok:main Nov 27, 2024

lukehinds deleted the pipeline branch November 27, 2024 10:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add input processing pipeline + codegate-version pipeline step #91

Add input processing pipeline + codegate-version pipeline step #91

Uh oh!

jhrozek commented Nov 25, 2024 •

edited

Loading

Uh oh!

lukehinds commented Nov 26, 2024

Uh oh!

ptelang Nov 26, 2024

Uh oh!

jhrozek Nov 26, 2024

Uh oh!

ptelang Nov 26, 2024

Uh oh!

jhrozek Nov 26, 2024

Uh oh!

ptelang Nov 26, 2024

Uh oh!

jhrozek Nov 26, 2024

Uh oh!

Uh oh!

Add input processing pipeline + codegate-version pipeline step #91

Add input processing pipeline + codegate-version pipeline step #91

Uh oh!

Conversation

jhrozek commented Nov 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lukehinds commented Nov 26, 2024

Uh oh!

ptelang Nov 26, 2024

Choose a reason for hiding this comment

Uh oh!

jhrozek Nov 26, 2024

Choose a reason for hiding this comment

Uh oh!

ptelang Nov 26, 2024

Choose a reason for hiding this comment

Uh oh!

jhrozek Nov 26, 2024

Choose a reason for hiding this comment

Uh oh!

ptelang Nov 26, 2024

Choose a reason for hiding this comment

Uh oh!

jhrozek Nov 26, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jhrozek commented Nov 25, 2024 •

edited

Loading