-
Notifications
You must be signed in to change notification settings - Fork 739
[Fearture] Support mm model close prefix cache #4459
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 21 commits
Commits
Show all changes
32 commits
Select commit
Hold shift + click to select a range
4bc170c
[Feature] support prefix cache in DP
1851e80
fix
983a8a8
Merge branch 'develop' into update_ep
ltd0924 2a9e046
Update common_engine.py
ltd0924 58fced4
Merge branch 'develop' into update_ep
ltd0924 f5733dd
Update common_engine.py
ltd0924 427cf47
Update common_engine.py
ltd0924 00bdcc2
Update common_engine.py
ltd0924 0822d63
[BugFix] fix workers more than 1
85b6990
Merge branch 'develop' into update_ep
ltd0924 0acf059
fix
667d146
Update api_server.py
ltd0924 a03dfe6
fix
141abd7
Update api_server.py
ltd0924 1f07ecd
fix
a531165
Merge branch 'develop' into update_ep
ltd0924 3670530
Merge branch 'develop' into update_ep
ltd0924 ab6f741
Merge branch 'develop' into update_ep
ltd0924 90cd313
[Fearture] Support mm model close prefix cache
e60d098
Merge branch 'develop' into update_ep
ltd0924 0053f17
Update api_server.py
ltd0924 03d9f22
Update engine_client.py
ltd0924 9ade15e
Update engine_client.py
ltd0924 7a93f0c
add test
a38a272
Merge branch 'develop' into update_ep
ltd0924 842cde7
Update test_chat.py
ltd0924 bd4ec3c
fix
b3112ba
Merge branch 'develop' into update_ep
ltd0924 33d9093
fix
334d29a
Update test_chat.py
ltd0924 6957bdc
Update test_chat.py
ltd0924 2cdcc02
Merge branch 'develop' into update_ep
Jiang-Jia-Jun File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The current service does not support processing requests containing multimodal data when prefix cache is enabled. Please send only text-based requests or disable prefix cache
报错信息改成这个吧