marko-kraemer
99e4ba0574
merge
2025-07-30 16:54:54 +02:00
marko-kraemer
696dd3722f
wip
2025-07-28 19:25:52 +02:00
marko-kraemer
4bce5d799f
wip
2025-07-28 14:08:56 +02:00
marko-kraemer
b9a4beb136
wip
2025-07-28 13:13:42 +02:00
marko-kraemer
2b31379b89
wip
2025-07-28 12:53:36 +02:00
marko-kraemer
a660919dec
wip
2025-07-28 11:53:10 +02:00
mykonos-ibiza
f4df0087b1
fix model name for anthropic model before fallback to OpenRouter
2025-07-25 12:59:08 +05:30
Krishav Raj Singh
e4a6f5a1ef
fix: auto continue response if finish_reason is length
2025-07-24 20:22:44 +05:30
Sharath
8cbaf3d44a
fix(backend): anthropic overloaded fallback failure ( #929 )
2025-07-11 18:36:15 +05:30
marko-kraemer
b0ab413866
fix: optimize blocking command execution with marker-based completion detection
...
- Use unique completion markers instead of unreliable prompt detection
- Reduce polling from 2s to 0.5s intervals for faster response
- Fix SessionExecuteRequest import path
- Commands now return immediately upon completion vs waiting full timeout
2025-07-06 20:07:26 +02:00
marko-kraemer
dc5496ae94
wip
2025-07-06 16:40:27 +02:00
marko-kraemer
6a6b9d8e85
refactor wip
2025-07-05 18:07:35 +02:00
Soumyadas15
18057a4db6
chore(dev): agent agnostic threads
2025-06-26 16:59:04 +05:30
sharath
38f4355838
fix(thread_manager): implement batching for message retrieval to enhance database performance and mitigate db limits
2025-06-22 16:42:55 +00:00
sharath
6501543814
feat(thread_manager): implement middle-out message compression to optimize message handling
2025-06-19 18:05:45 +00:00
sharath
184b54fc8f
fix(thread_manager): enhance content validation to include interactive elements
2025-06-19 14:19:48 +00:00
sharath
e8dbc55165
fix(thread_manager): add message compression by omitting messages to handle token limits
2025-06-18 21:25:32 +00:00
sharath
05a80e5f55
fix(thread_manager): adjust truncation logic and improve token compression handling
2025-06-18 07:25:29 +00:00
sharath
3507230ed0
fix(compress_messages): truncate the middle of last msg and reduce token size
2025-06-18 05:53:51 +00:00
Sharath
85cb4b032b
Update backend/agentpress/thread_manager.py
2025-06-16 14:39:13 +05:30
sharath
e7ef1c64c5
fix(thread_manager): enhance message content parsing and ensure proper compression of prepared messages
2025-06-16 09:07:47 +00:00
sharath
2a49ad8936
fix(thread_manager): improve message handling and update max_tokens calculation
2025-06-16 07:06:24 +00:00
sharath
0e41358047
fix(context): context window compression
2025-06-15 21:14:09 +00:00
sharath
c00497865f
fix(thread_manager): adjust max_tokens calculation for deepseek model
2025-06-06 18:48:55 +00:00
sharath
b5ae395fed
refactor(thread-manager): standardize max_tokens parameter in message compression methods
2025-06-06 09:18:34 +00:00
sharath
93660e4a61
feat(thread-manager): add message compression methods to handle long messages efficiently
2025-06-06 08:56:14 +00:00
Bobbie
1b269a96b6
Merge pull request #658 from escapade-mckv/prod-2a
...
Prod 2a
2025-06-06 13:21:29 +05:30
Sharath
aee5945907
hotfix(context-limit): truncate more content to allow more messages
2025-06-06 12:36:11 +05:30
sharath
11e8b4f7bd
Merge branch 'PRODUCTION' into sync/production
2025-06-05 06:48:53 +00:00
sharath
eda0c69bea
fix(context-limit-retry): don't retry when context limit reached
2025-06-05 06:46:27 +00:00
Soumyadas15
8a7c33c86b
chore(dev): thread compression for new format
2025-06-03 12:02:50 +05:30
Soumyadas15
b42e78b598
chore(dev): agent builder
2025-05-31 21:01:20 +05:30
sharath
c14a2e4cb4
fix(thread_manager): reduce maximum message content length from 300k to 200k characters to prevent context overload
2025-05-29 15:50:29 +00:00
sharath
0bfd7ecb3f
fix(thread_manager): adjust token count condition to handle default max token value
2025-05-29 15:28:53 +00:00
sharath
16ba4f7c36
fix(thread_manager): update message selection query to include only message_id and content
2025-05-29 14:20:57 +00:00
sharath
960dd671f2
feat(context-compression): add ExpandMessageTool for message expansion and integrate into agent run process
2025-05-29 13:47:31 +00:00
Soumyadas15
fb6c130136
chore(dev): fix double escape isue
2025-05-25 02:34:07 +05:30
sharath
7db395dd88
refactor(api): enhance trace handling in response processing
2025-05-24 12:50:36 +00:00
sharath
ff1670be90
feat(backend): langfuse traces
2025-05-21 00:39:28 +00:00
marko-kraemer
c6d13b5187
run, thread_run, response processor, raise errs properly
2025-05-10 03:58:57 +02:00
marko-kraemer
7ff15d8dc7
make non-anthropic model work with response example, fe padding bottom increase cuz of new chat input
2025-05-07 01:56:52 +02:00
marko-kraemer
12704f2786
fix
2025-05-05 02:53:24 +02:00
Korzhavin Ivan
68fafdd385
Refactor agent and tool configurations to support new model names and parameters
...
- Adjusted max_tokens logic based on model type, including new conditions for "gpt-4".
- Enhanced MessageTool to include detailed documentation and examples for user interactions.
- Improved ThreadManager to handle new tool choices and reasoning efforts.
- Updated LLM service to incorporate enable_thinking and reasoning_effort parameters.
- Modified chat input component to dynamically manage model selection and thinking options.
2025-05-03 14:17:46 +02:00
marko-kraemer
973861b2d2
wip
2025-05-03 02:26:25 +02:00
marko-kraemer
f9ca36efa0
wip
2025-04-23 09:13:07 +01:00
LE Quoc Dat
17e3e0e68c
Merge branch 'main' into dat_context
2025-04-18 17:01:59 +01:00
LE Quoc Dat
06f1782f4b
fix nasty bug xN system prompt
2025-04-18 09:09:09 +01:00
LE Quoc Dat
8669e40312
disable context manager
2025-04-18 07:13:44 +01:00
marko-kraemer
582f0d228b
unified message structure on sse stream & message get thread
2025-04-18 06:22:06 +01:00
LE Quoc Dat
c84ee59dc6
thinking & reasoning
2025-04-18 05:49:41 +01:00