buster

Commit Graph

Author	SHA1	Message	Date
dal	eac2529f29	ok we are on our way there...	2025-02-13 16:28:59 -07:00
dal	3bd5516486	ok thread is being inserted	2025-02-13 16:15:09 -07:00
dal	977b5eb6de	mdc	2025-02-13 15:05:10 -07:00
dal	2d47c2c539	mdc commit	2025-02-13 14:50:24 -07:00
dal	bdeaccdca8	cursor rules	2025-02-13 13:31:53 -07:00
dal	9fc5011b75	cursor rules	2025-02-13 08:38:14 -07:00
dal	5416418955	cargo optimization for faster builds	2025-02-13 08:31:14 -07:00
dal	c6d70b62dc	ddl on query	2025-02-12 11:07:59 -07:00
dal	09ab45bbb5	cli release code	2025-02-12 09:00:02 -07:00
dal	37854342da	ok now generating descriptions ayo	2025-02-12 08:41:17 -07:00
dal	0c597e7b0f	snowflake type mapping correct	2025-02-12 08:19:38 -07:00
dal	59054cfefa	tweak to the diffing bug	2025-02-12 08:14:38 -07:00
dal	422b4c5da5	some temp fixes while we prep the new chat	2025-02-12 06:12:56 -07:00
dal	361c48fade	database_identifier during creation	2025-02-12 05:27:08 -07:00
dal	ffaa373ec3	Merge remote-tracking branch 'origin/staging' into dal/simplify-deploy-endpoint	2025-02-12 05:25:09 -07:00
dal	61b6cfeffc	database mapping correct	2025-02-12 05:22:36 -07:00
dal	c7a631de40	tweak on database from modelfiles	2025-02-12 05:04:34 -07:00
dal	2805d7ed70	implemented database name for snowflake warehouse. temp fix	2025-02-12 04:52:32 -07:00
dal	45739b73f2	get rid of unused code	2025-02-11 17:04:11 -07:00
dal	89912fed59	ITS WORKING	2025-02-11 16:57:44 -07:00
dal	29900149dd	almost there	2025-02-11 16:38:46 -07:00
dal	041859ec28	change thought_pills to thoughts	2025-02-11 15:45:05 -07:00
dal	d17cb7e49a	bulk work on deploy	2025-02-11 15:43:07 -07:00
dal	e7fedd0a59	working except for bulk in the validation	2025-02-11 15:21:53 -07:00
dal	bf24b249d4	Refactor message and thread handling with explicit UUID references	2025-02-11 12:46:21 -07:00
dal	8e8140246d	Update message transformation to include chat and message IDs	2025-02-11 12:34:06 -07:00
dal	8b96ec01fb	Refactor message transformation with new container types and UUIDs	2025-02-11 11:53:06 -07:00
dal	bf05c7f06b	Add new thread events for chat generation progress tracking	2025-02-11 11:21:57 -07:00
dal	e7b96d9bd5	make a chart card	2025-02-11 10:58:25 -07:00
dal	41a7146a8d	Add initial chat response for new thread creation	2025-02-11 10:05:20 -07:00
dal	6fcfc4fba6	ids matching for stream messages	2025-02-11 09:57:19 -07:00
dal	2376153459	ids and initial message repeat handled	2025-02-11 09:36:28 -07:00
dal	973e9b41ce	ids are set	2025-02-11 09:21:08 -07:00
dal	fcc15a0e6f	fix tool name transform path	2025-02-11 08:23:58 -07:00
dal	8940fbf3b6	using name as id for file type reasoning message	2025-02-11 08:21:32 -07:00
dal	8de08323fa	consistent message id for text stream	2025-02-11 08:10:58 -07:00
dal	bcf1ac1a65	file stream is working 👍🏼 for create file.	2025-02-11 07:49:22 -07:00
dal	afb40c90be	ok restructured the transformer to get vecs back	2025-02-11 07:40:07 -07:00
dal	90931fc029	Add support for chat routes and update thread route handling	2025-02-11 07:27:56 -07:00
dal	d75931dcb0	We did it joe	2025-02-11 07:14:06 -07:00
dal	43e2cf44f4	its working, but not sure if it's how I want it...	2025-02-10 17:09:01 -07:00
dal	61153020ba	save point	2025-02-10 16:08:08 -07:00
dal	8c6c1d83ee	still working on the darn stream	2025-02-10 14:49:22 -07:00
dal	fd65408e30	clean up to try different impl	2025-02-10 14:22:13 -07:00
dal	2593536efa	So close, just a few more tweaks to clean up stream.	2025-02-10 13:35:57 -07:00
dal	118ef9c691	ok streaming it back	2025-02-10 12:29:13 -07:00
dal	8c8372b50e	ok everything sending back except create and modify	2025-02-10 12:15:21 -07:00
dal	d0400b5226	transforms for events	2025-02-10 11:53:19 -07:00
dal	233b580e1c	data catalog search transformation	2025-02-10 11:29:08 -07:00
dal	6a67931667	message transform	2025-02-10 10:47:16 -07:00
dal	93206fcf5c	added in duration and passed back req params	2025-02-10 09:21:07 -07:00
dal	9169a1c9e8	small tweaks	2025-02-10 08:26:25 -07:00
dal	c849a22b4f	added in progress on tools	2025-02-10 08:24:56 -07:00
dal	8bd14e0ee7	added in progress on stream messages	2025-02-10 08:17:14 -07:00
dal	789b22fe1e	Improve message streaming and tool call processing - Enhance message streaming with more precise content and tool call handling - Add logic to only send and store meaningful assistant messages and tool calls - Prevent sending empty or redundant messages during stream processing - Improve tool call and content update tracking in agent stream method - Optimize message inclusion in recursive thread generation	2025-02-10 07:30:52 -07:00
dal	641527114c	refactor(agent): Enhance LLM request handling with improved tool call processing - Add support for initial non-tool response and subsequent tool-enabled processing - Implement `PendingToolCall` to manage incremental tool call updates - Update `Delta` and related types to support more flexible tool call streaming - Modify agent stream processing to handle tool calls with improved state management - Add robust handling for tool call deltas and function call arguments	2025-02-10 07:17:14 -07:00
dal	373a83efea	refactor(agent): Extract agent prompt to constant and update date - Move hardcoded agent prompt to a constant `AGENT_PROMPT` in the `agent_thread.rs` file - Update the prompt's date from January 27, 2025 to February 7, 2025 - Simplify thread initialization by referencing the new constant - Maintain existing prompt structure and guidelines	2025-02-07 14:18:02 -07:00
dal	4b743fe5ec	feat(agent): Add recursion depth limit to prevent infinite processing - Implement a maximum recursion depth of 30 for agent thread processing - Add recursion depth tracking to prevent potential infinite loops - Provide user-friendly message when maximum recursion depth is reached - Update debug logging to include current recursion depth - Modify both synchronous and streaming thread processing methods	2025-02-07 11:35:13 -07:00
dal	8b51618afd	refactor(agent): Implement recursive stream processing with improved tool execution - Refactor agent stream processing to use a recursive approach for handling tool calls - Enhance tool execution with more robust error handling and result tracking - Improve stream chunk processing with detailed state management - Add support for recursive thread generation based on tool call results - Implement cloning for LiteLLMClient to support stream processing tasks	2025-02-07 11:22:43 -07:00
dal	bb4e4ca9d8	refactor(ai_tools): Standardize search tool response format and requirements - Update search data catalog and file search tools to use consistent JSON response structure - Enforce strict response format with mandatory "results" key and comprehensive field requirements - Add explicit guidelines for result composition and handling of empty result sets - Improve response predictability and parsing reliability for AI-powered search tools	2025-02-07 11:12:08 -07:00
dal	8054bedf1a	fix(ai_tools): Update LLM request parameters and improve response handling - Add `stream: Some(false)` to file search and data catalog tools - Make `json_schema` optional in `ResponseFormat` serialization - Enhance logging in search file tool with debug and warning messages - Improve error context when parsing LLM JSON responses	2025-02-07 11:05:39 -07:00
dal	864257bc24	refactor(agent): Improve thread-safe tool management with Arc - Wrap agent tools in an Arc for safe concurrent access - Modify tool addition methods to work with Arc-wrapped HashMap - Ensure thread-safe tool registration and retrieval - Update stream processing to use Arc-cloned tools reference	2025-02-07 10:53:26 -07:00
dal	2bf27a9eda	feat(agent): Enhance agent thread processing with improved debugging and user-specific streaming - Add comprehensive debug logging for agent thread processing - Modify agent thread to use user ID for streaming subscription - Update stream processing to include detailed tool call and content logging - Improve error handling and visibility in stream processing - Add user context to agent thread initialization - Remove redundant tool execution handling in stream processing	2025-02-07 10:39:15 -07:00
dal	a2f3433555	refactor(tools): Implement ValueToolExecutor for generic tool output conversion - Add `ValueToolExecutor` to convert tool outputs to `serde_json::Value` - Introduce `IntoValueTool` trait for easy value type conversion - Update agent tool addition methods to use new value conversion mechanism - Simplify tool registration by automatically converting tool outputs - Remove previous manual boxing and type conversion logic	2025-02-07 10:15:55 -07:00
dal	2090e0b7d7	refactor(post_thread): Simplify thread creation with agent-based approach - Remove complex AI-driven thread generation modules - Introduce a new `AgentThreadHandler` to manage thread creation - Streamline post thread logic by delegating to agent handler - Remove deprecated SQL generation and AI-related modules - Reduce code complexity and improve maintainability	2025-02-07 09:15:58 -07:00
dal	adba6d6954	feat(data_catalog): Implement AI-powered dataset search tool - Add comprehensive dataset search functionality using LLM for intelligent dataset matching - Implement search across datasets with relevance ranking based on YML content - Create structured search result output with dataset metadata - Add robust error handling, logging, and parsing for search operations - Include test coverage for search result validation - Enhance tool with flexible query parameter support and detailed response messages	2025-02-07 08:08:37 -07:00
dal	372694bf1f	feat(file_search): Implement advanced AI-powered file search tool - Add comprehensive file search functionality using LLM for intelligent file matching - Implement search across metric and dashboard files with relevance ranking - Create structured search result output with file metadata - Add robust error handling and logging for search operations - Include test coverage for search result parsing	2025-02-07 07:47:35 -07:00
dal	94c1635a34	refactor(file_tools): Enhance file modification and creation processes - Implement robust line-based content modification for metric and dashboard files - Add comprehensive error handling and validation for file modifications - Improve modification tracking with detailed modification results - Optimize file processing with batch insertion and validation - Add extensive test coverage for modification and validation logic	2025-02-07 01:29:49 -07:00
dal	0e9075ca2c	Added Line Number Formatting on File outputs	2025-02-07 00:29:11 -07:00
dal	900eb28a67	removed bool on create_file tool	2025-02-06 23:54:11 -07:00
dal	711bbe899a	refactor(tools): Enhance ToolExecutor trait and file-related tools - Add generic Output type to ToolExecutor trait - Update file tools to use strongly-typed output structs - Modify agent and tool implementations to support generic output - Improve error handling and result reporting in file-related tools - Add more detailed status messages for file operations	2025-02-06 23:45:48 -07:00
dal	4ec6e78648	refactor(database): Update metric and dashboard file models and migrations - Rename file type modules from `metric_file` and `dashboard_file` to `metric_yml` and `dashboard_yml` - Modify metric files migration to use a verification enum instead of boolean - Update messages_to_files junction table with UUID primary key and additional timestamp columns - Adjust database models to support new file and message structures - Refactor file creation utility to use new model structures	2025-02-06 17:07:52 -07:00
dal	b0e40007b0	Merge branch 'staging' into dal/goat-chat	2025-02-06 16:23:12 -07:00
dal	3ec059a99e	feat(prompts): Enhance SQL generation prompt with database identifier guidance - Add instruction about paying attention to database identifier in SQL generation prompt - Clarify potential cross-database referencing considerations - Update SQL generation guidelines to improve clarity and flexibility	2025-02-06 16:21:53 -07:00
dal	33d5990907	feat(datasets): Add database_identifier support for dataset creation and deployment - Extend Dataset model and schema to include optional database_identifier field - Update dataset creation and deployment routes to handle new database_identifier parameter - Modify dataset DDL generation to use database_identifier for schema resolution when available	2025-02-06 15:18:40 -07:00
dal	c7a6e2788a	Merge branch 'staging' into dal/simplify-deploy-endpoint	2025-02-06 15:00:23 -07:00
dal	2ec0b7743f	feat(snowflake): Improve timestamp handling and JSON processing - Add support for parsing Snowflake timestamp structs with epoch and fraction fields - Implement handling of Snowflake timestamp logical types (with and without timezone) - Enhance JSON value processing to detect and convert Snowflake timestamp objects - Add error handling and logging for timestamp parsing	2025-02-06 14:09:39 -07:00
dal	ef685d87a4	Fix x axis intervals (#105 ) * xAxisTickinerval * fix(visualization): Add x-axis time unit configuration for bar and combo charts - Extend chart configuration to support optional x-axis time unit - Update modify visualization agent to dynamically set x-axis time interval - Modify bar line and combo chart prompts to include x_axis_time_unit parameter * only use valid time units * refactor(visualization): Simplify x-axis time unit configuration - Modify modify_visualization_agent to extract and remove x-axis time unit more efficiently - Update global styling result structure for x-axis time interval - Adjust format_label_prompt comment to clarify date format default behavior --------- Co-authored-by: Nate Kelley <nate@buster.so>	2025-02-06 11:58:25 -08:00
dal	c27c27a7e8	fix: Improve column reference validation in model expression checks - Update column validation to use model-defined columns instead of dataset columns - Enhance error message to clarify column reference context - Refine validation logic for expression column references	2025-02-06 10:46:12 -07:00
dal	4e2e9795b6	refactor: Simplify dataset deployment and validation process - Restructure deploy_datasets function to separate concerns - Improve column and relationship validation in dataset deployment - Enhance error handling and validation result generation - Add support for more comprehensive column and relationship checks - Refactor validation logic to handle multiple error types	2025-02-05 23:42:51 -07:00
dal	08ecb44de1	Merge branch 'staging' into dal/simplify-deploy-endpoint	2025-02-05 21:56:45 -07:00
dal	e144377ada	Tweaked the fix sql to return a json output so we don't get parse errors.	2025-02-05 21:55:19 -07:00
dal	b872cf63a4	feat(prompts): Enhance SQL query generation and error handling instructions - Update failed SQL fix prompt to emphasize query output format - Add clarification to dataset selector prompt about selecting multiple datasets	2025-02-05 18:24:34 -07:00
dal	fa480f6797	feat: enhance column metadata retrieval across database sources - Add support for capturing source type (table, view, materialized view) - Improve column metadata queries for Postgres, MySQL, BigQuery, and Snowflake - Include more comprehensive column information during dataset import - Extend DatasetColumnRecord to include source_type field	2025-02-05 18:21:40 -07:00
dal	6e5c299389	feat: improve dataset column validation and deployment process - Add comprehensive column validation before dataset deployment - Validate existence of all required columns in source database - Simplify column type and nullability retrieval - Enhance error reporting for missing columns - Update deployment logic to use pre-validated column information	2025-02-05 17:20:11 -07:00
dal	f081f3e16e	feat: enhance dataset validation and deployment error handling - Add detailed validation error logging in CLI - Improve type compatibility checks in dataset validation - Modify deployment process to handle and report validation errors more comprehensively - Add Hash derive for Verification enum - Update API and CLI to support more informative validation results	2025-02-05 17:04:13 -07:00
dal	fb75c1f554	fix(search): Add organization_id filter to semantic and terms search queries	2025-02-05 16:23:24 -07:00
dal	7bcd7d81bc	make sure the output of fix sql is delimited	2025-02-05 16:19:23 -07:00
dal	3c82ac0774	feat: add dataset validation and improved deployment process	2025-02-05 15:00:52 -07:00
dal	2c7ef16956	fix the search value table	2025-02-05 12:30:51 -07:00
dal	d9973a13dd	bugfix(datasets): add delete dataset route Implement a new DELETE route for removing datasets by their ID	2025-02-05 11:58:43 -07:00
dal	2be7383656	you learn something new every day... a schema in pg can't start with a number.	2025-02-05 11:36:31 -07:00
dal	960c89ab84	fix: janky check for values	2025-02-04 17:26:57 -07:00
dal	87a6225f1d	remove the limit query bc mixing things up	2025-02-04 17:18:02 -07:00
dal	0fde90b848	refactor(snowflake_query): optimize Arrow data processing with explicit row collection (#92 )	2025-02-04 15:57:35 -08:00
dal	d0ff21e10d	Merge pull request #90 from buster-so/dal/stored_values_enum_push_to_description feat(stored_values): enhance column value processing with enum detect…	2025-02-04 15:10:54 -08:00
dal	d4825c0ffe	bugfix(snowflake_query): add data processing helpers for query results (#88 ) - Introduce helper functions for processing string and JSON values - Implement case-insensitive string and JSON value transformations - Add robust timestamp parsing with error handling - Enhance Snowflake query result processing with consistent data normalization	2025-02-04 14:32:23 -08:00
dal	59049b5604	refactor(stored_values): improve background processing and error handling for stored column values (#85 ) - Refactor stored values processing in dataset deployment to use background task - Add `StoredValueColumn` struct to encapsulate column processing details - Implement `process_stored_values_background` for parallel and resilient value storage - Add logging for successful and failed stored value processing - Update CLI to handle optional SQL definitions and improve file processing	2025-02-04 11:30:45 -08:00
dal	ef18eff61d	Merge branch 'staging' into dal/goat-chat	2025-02-04 09:26:16 -07:00
dal	daf4ec794f	Upgrade: Updated to o3-mini models upgrade(ai): update OpenAI model configurations and add support for O…	2025-02-04 07:50:19 -08:00
dal	158f5ba0a9	Dal/stored values fix (#81 ) * fix: add stored values support for dataset columns This commit introduces stored values functionality for dataset columns, including: - Adding a `stored_values` flag to column deployment requests - Implementing a mechanism to store column values during dataset deployment - Updating data analyst and SQL generation agents to leverage stored values - Creating a new utility module for stored values search and management * refactor(stored_values): improve stored values implementation and schema management This commit enhances the stored values functionality with several key improvements: - Update schema and table creation to use organization ID as schema name - Modify stored values storage to include column ID - Improve value extraction and embedding generation process - Remove unnecessary distance calculation in search results - Clean up unused values_engine module	2025-02-04 07:15:34 -08:00
dal	f11bfa9941	fix(invite_users): simplify and streamline user invitation process (#78 )	2025-02-03 12:41:00 -08:00
dal	2a183ca711	fix(dashboard): improve dashboard access and permission handling (#76 )	2025-02-03 12:19:59 -08:00
dal	a665648308	feat: Implement file creation for metric and dashboard files - Added implementation for creating metric files with database insertion - Introduced `create_metric_file()` and `create_dashboard_file()` functions - Updated `CreateFilesTool` to handle different file types - Added `data_metadata` field to `MetricFile` struct - Implemented basic file type validation and creation logic	2025-02-03 12:51:55 -07:00
dal	a3ee00ff84	refactor: Enhance file creation tool with error handling and type validation - Added robust error handling for JSON parameter parsing in `CreateFilesTool` - Updated file type description to clarify naming conventions - Prepared for file creation logic implementation - Moved file-related types to a new `file_types` module - Removed redundant `types.rs` file	2025-01-31 10:16:35 -07:00
dal	7d153e06af	refactor: Consolidate file-related tools into a single module - Removed individual tool files for bulk file modifications, file creation, file opening, data catalog search, file search, and sending to user - Created a new `file_tools` module in `tools/mod.rs` to centralize file-related tool implementations - Commented out individual tool module imports in preparation for future implementation - Simplified the tools module structure for better organization and maintainability	2025-01-30 14:58:41 -07:00
dal	ec04a5e98e	refactor: Enhance Agent and Tool management with new methods and tests - Added environment variable-based LLM client initialization in `Agent::new()` - Introduced `add_tool()` and `add_tools()` methods for more flexible tool registration - Implemented new `get_name()` method for `ToolExecutor` trait - Added comprehensive test cases for Agent with and without tools - Updated `AgentThread` with a convenient constructor method - Temporarily commented out unused tool modules - Added debug print in LiteLLM client for response logging	2025-01-30 14:12:59 -07:00
dal	27016df995	chore: cleanup limit insertion	2025-01-28 12:50:31 -07:00
dal	6a73b59aa1	deprecated old threads and messages table	2025-01-28 12:18:59 -07:00
dal	22d75ae0b6	refactor: Update messages table schema and database references - Renamed existing messages table to `messages_deprecated` - Created new `messages` table with updated schema and additional indexes - Updated Diesel schema to reflect new table structure and relationships - Added new foreign key constraints for threads and users - Prepared for migration of existing message data	2025-01-28 12:03:11 -07:00
dal	d7087e8cd5	chore: snowflake warehouse specification	2025-01-28 11:45:37 -07:00
dal	48447d5bc5	Snowflake limit to prevent memory issues	2025-01-28 11:24:37 -07:00
dal	9624bc33ad	feat: Add file junction table and update database schema - Created `messages_to_files` junction table to link messages with dashboard and metric files - Added foreign key constraints and indexes for efficient file-message relationships - Updated Diesel schema to include new `messages_to_files`, `dashboard_files`, and `metric_files` tables - Removed unnecessary timestamp triggers from migration files	2025-01-28 11:21:51 -07:00
dal	f1879dc15c	new tables	2025-01-28 09:57:40 -07:00
dal	6271e00b70	chore: Enhance SQL agent generation with data source type context (#69 ) - Added support for retrieving and passing data source type during SQL agent generation - Updated SQL generation prompts to include data source type in system messages - Modified `generate_sql_agent` function to fetch and utilize data source type information - Improved SQL generation context by dynamically incorporating data source type details	2025-01-28 07:35:57 -08:00
dal	e190ff9a14	I apolog	2025-01-28 08:29:43 -07:00
dal	0020e6ed4a	refactor: Rename and update dataset search tool - Renamed `search_datasets.rs` to `search_data_catalog.rs` - Updated `mod.rs` to reflect the new module and tool name - Removed the placeholder `SearchDatasetsTool` implementation - Prepared for future implementation of data catalog search functionality	2025-01-27 14:24:10 -07:00
dal	2bc68e8599	refactor: Move ToolExecutor trait to tools module - Relocated `ToolExecutor` trait from `agent/types.rs` to `tools/mod.rs` - Updated import paths in `agent/agent.rs` to reflect new trait location - Simplified `types.rs` by removing unused imports and trait definition - Prepared for more comprehensive tool management in the tools module	2025-01-26 19:31:51 -07:00
dal	692f8f7a1d	refactor: Update Agent and Thread types for improved message handling - Renamed `Thread` to `AgentThread` for clarity - Modified `ToolExecutor` to return `serde_json::Value` instead of `String` - Updated message processing to handle new message structures - Improved content handling in streaming and tool call scenarios - Simplified message content extraction and serialization	2025-01-26 08:45:49 -07:00
dal	aeb1a02ba1	refactor: Re-enable LiteLLM client and agent modules - Uncommented and activated module implementations for `LiteLLMClient` and `Agent` - Updated test cases to use new message and tool call structures - Simplified test assertions for chat completion and tool call scenarios - Restored module exports in `mod.rs` files	2025-01-26 08:38:20 -07:00
dal	aaaa8ef544	refactor: Disable LiteLLM client and agent modules temporarily - Commented out imports and module exports for `LiteLLMClient` and `Agent` - Removed active module implementations in `mod.rs` files - Simplified type definitions and test cases - Prepared for potential future reimplementation or restructuring	2025-01-26 08:30:08 -07:00
dal	9d22a10983	refactor: Simplify LiteLLM types and improve error handling - Simplified message and tool structures in `types.rs` - Updated `agent.rs` to use new type structures - Enhanced error handling in agent tests with detailed error printing - Updated `.cursorrules` with debugging recommendation - Removed redundant fields and improved type flexibility - Streamlined serialization and deserialization of AI client types	2025-01-26 07:58:24 -07:00
dal	c560b660bb	refactor: Enhance LiteLLM client types and improve type flexibility - Updated `types.rs` to support more complex message and response structures - Added support for multi-content messages with `Content` struct - Introduced optional fields and improved serialization handling - Enhanced type flexibility for AI client responses - Updated test cases to reflect new type structures - Added `Default` implementation for `LiteLLMClient` - Improved support for tool calls and streaming responses	2025-01-25 17:24:07 -07:00
dal	5235aa977d	feat: Add LLM configuration support with environment variable fallback - Updated `.env.example` with new LLM configuration variables - Modified `LiteLLMClient` to support API key and base URL from environment variables - Enhanced client initialization to use env vars with optional parameter overrides - Added comprehensive test case for environment variable configuration	2025-01-25 15:32:02 -07:00
dal	085bc17a4d	test: Improve LiteLLM client tests with async mockito and request validation - Updated test cases to use `mockito::Server::new_async()` - Added request body validation in test mocks - Improved test coverage for chat completion and streaming scenarios - Updated .cursorrules with detailed testing guidelines	2025-01-25 15:25:30 -07:00
dal	23d5c40923	clean up	2025-01-25 15:18:07 -07:00
dal	087867032d	chore: Update API project configuration and add new modules - Added dev dependencies for Mockito and Tokio async testing - Updated .cursorrules with async test requirement - Expanded utils modules with new serde_helpers and tools - Added new AI client module for LiteLLM	2025-01-25 15:17:21 -07:00
dal	5202438fa8	Dal/cli-updates-skip-dbt (#67 ) * feat: enhance deploy command with skip_dbt option - Updated the Deploy command to accept a `skip_dbt` boolean argument, allowing users to bypass the dbt run during deployment. - Refactored the deploy function to conditionally execute the dbt command based on the `skip_dbt` flag, improving deployment flexibility. * Refactor query engine and CLI commands for improved functionality and error handling - Updated `get_bigquery_columns` and `get_snowflake_columns` functions to enhance column name handling and ensure proper error reporting. - Modified `get_snowflake_client` to accept a database ID for better connection management. - Enhanced the `deploy` command in the CLI to include additional parameters (`path`, `data_source_name`, `schema`, `env`) for more flexible deployments. - Improved error handling and reporting in the `deploy` function, including detailed summaries of deployment errors and successful file processing. - Updated `get_model_files` to accept a directory path and added checks for file existence, enhancing robustness. - Adjusted model file structures to include schema information and refined the upload process to handle optional parameters more effectively. These changes collectively improve the usability and reliability of the query engine and deployment process. * Update dataset DDL generation to include optional YML file content - Modified `generate_sql_agent` to append optional YML file content to dataset DDL - Ensures more comprehensive dataset representation during SQL agent generation - Handles cases where YML file might be present or absent gracefully	2025-01-24 16:00:38 -08:00
dal	87466e8d61	Enhance email functionality by adding HTML escaping - Added `html-escape` crate to `Cargo.toml` for HTML escaping. - Updated email template processing to escape HTML in message and button text, preventing potential XSS vulnerabilities. - Modified test cases to include HTML content in email parameters, ensuring proper handling and escaping. This change improves security by sanitizing user input in email communications.	2025-01-24 08:57:48 -07:00
dal	ee58c05b54	dashboard permissions fix	2025-01-23 16:54:09 -07:00
dal	af7e969af8	fix permission check on post_dataset rest (#59 ) * fix permission check on post_dataset rest * refactor: enhance dataset overview access lineage and permission checks - Updated the `get_dataset_overview` function to conditionally add default access lineage based on user roles and existing access paths. - Simplified the logic for adding user roles to the lineage, ensuring clarity and maintainability. - Improved handling for the `RestrictedQuerier` role to include checks for existing access before adding default lineage, enhancing permission accuracy. - Streamlined code by removing redundant checks and consolidating role handling, optimizing overall readability. * feat: Enhance permission group handling and data retrieval - Introduced a new `PermissionGroupInfo` struct to encapsulate detailed information about permission groups, including user and dataset counts. - Updated the `get_permission_group` and `list_permission_groups` functions to improve data retrieval and error handling. - Refactored SQL queries in `list_permission_groups` to include additional joins for counting users and datasets associated with permission groups, enhancing the overall functionality and clarity of the API. - Streamlined code for better readability and maintainability, ensuring consistent handling of user and permission group data. * refactor: Improve dataset access handling and permission checks - Enhanced the `get_restricted_user_datasets` and `get_restricted_user_datasets_with_metadata` functions to include additional permission checks for dataset groups and permission groups. - Consolidated SQL queries to ensure proper filtering of deleted records and improved clarity in dataset retrieval logic. - Introduced new joins and filters to handle dataset group permissions, ensuring accurate access control for users. - Streamlined code for better readability and maintainability, enhancing overall functionality in dataset access management. * fix: Update SQL migration and seed data for user attributes - Modified the SQL migration to specify the schema for the `users` table, ensuring clarity in the update statement. - Adjusted the seed data for `users_to_organizations` to change the `organization_id` from 'public' to 'none', reflecting a more accurate state for user roles and organization associations. - Ensured consistency in the formatting of SQL insert statements for better readability. * fix: Prevent users from updating their own profiles - Added a check in the `update_user_handler` to prevent users from updating their own information, returning an error if they attempt to do so. - This change enhances security by ensuring that users cannot modify their own records, which could lead to unauthorized changes. * refactor: Simplify dashboard permission queries by removing team-based joins - Removed left joins with `teams_to_users` table in dashboard permission queries - Simplified permission checks to only filter by direct user ID - Updated queries in `get_user_dashboard_permission`, `get_bulk_user_dashboard_permission`, and `list_dashboards_handler` - Streamlined SQL query logic for more direct and efficient permission checks	2025-01-23 15:22:07 -08:00
dal	b8b60e8778	refactor: Improve dataset access handling and permission checks - Enhanced the `get_restricted_user_datasets` and `get_restricted_user_datasets_with_metadata` functions to include additional permission checks for dataset groups and permission groups. - Consolidated SQL queries to ensure proper filtering of deleted records and improved clarity in dataset retrieval logic. - Introduced new joins and filters to handle dataset group permissions, ensuring accurate access control for users. - Streamlined code for better readability and maintainability, enhancing overall functionality in dataset access management.	2025-01-23 10:14:55 -07:00
dal	ca7c2c4b72	feat: Enhance permission group handling and data retrieval - Introduced a new `PermissionGroupInfo` struct to encapsulate detailed information about permission groups, including user and dataset counts. - Updated the `get_permission_group` and `list_permission_groups` functions to improve data retrieval and error handling. - Refactored SQL queries in `list_permission_groups` to include additional joins for counting users and datasets associated with permission groups, enhancing the overall functionality and clarity of the API. - Streamlined code for better readability and maintainability, ensuring consistent handling of user and permission group data.	2025-01-23 08:20:31 -07:00
dal	212e4102dc	refactor: enhance dataset overview access lineage and permission checks - Updated the `get_dataset_overview` function to conditionally add default access lineage based on user roles and existing access paths. - Simplified the logic for adding user roles to the lineage, ensuring clarity and maintainability. - Improved handling for the `RestrictedQuerier` role to include checks for existing access before adding default lineage, enhancing permission accuracy. - Streamlined code by removing redundant checks and consolidating role handling, optimizing overall readability.	2025-01-23 08:10:09 -07:00
dal	f6333c0f64	fix permission check on post_dataset rest	2025-01-23 07:56:48 -07:00
dal	6a6b663342	Refactor Snowflake client connection to remove warehouse and database IDs - Updated the `get_snowflake_client` function to no longer require `warehouse_id` and `database_id`, simplifying the connection process. - This change enhances flexibility in client initialization and aligns with recent updates to Snowflake API handling.	2025-01-22 11:04:10 -07:00
dal	04a745f0ad	Update dependencies and refactor Snowflake query handling - Downgraded the `base64` crate version in `Cargo.toml` from `0.22.1` to `0.21`. - Refactored the `snowflake_query` function in `snowflake_query.rs` to improve data type handling, including support for additional Arrow data types and enhanced null value checks. - Updated the `route_to_query` function in `query_router.rs` to use mutable `snowflake_client` for better state management during query execution. - Improved error handling for closing the Snowflake client session, ensuring proper logging of any issues encountered.	2025-01-22 10:31:02 -07:00
dal	7142ee289d	refactor: Enhance dataset access control and lineage tracking - Streamlined the `get_dataset_overview` function to improve access control for the `RestrictedQuerier` role, ensuring more precise permission checks. - Updated the `get_user_information` function to optimize dataset processing, categorizing datasets based on direct access and permission group access. - Removed redundant code and improved readability by consolidating logic for user roles, enhancing maintainability. - Enhanced lineage tracking for datasets, providing a clearer representation of user permissions across different access types.	2025-01-21 16:42:06 -07:00
dal	a247a9bae4	feat: Improve dataset access control and lineage tracking - Enhanced the `get_dataset_overview` function to refine access control for the `RestrictedQuerier` role, allowing for more granular permission checks based on various access paths. - Updated the `get_user_information` function to streamline dataset processing, ensuring that datasets are categorized correctly based on direct access and permission group access. - Removed redundant code and improved readability by consolidating logic for user roles, enhancing maintainability of both functions. - Improved lineage tracking for datasets, providing a clearer representation of user permissions across different access types.	2025-01-21 16:14:02 -07:00
dal	19b3e17ba9	feat: Refactor dataset overview access lineage in get_dataset_overview - Introduced a default access lineage for users, ensuring consistent representation of user permissions. - Simplified the addition of user roles to the lineage, consolidating logic for WorkspaceAdmin, DataAdmin, Querier, and Viewer roles. - Enhanced lineage tracking for RestrictedQuerier role to include direct dataset access and permission group lineage, improving granularity of dataset permissions. - Removed redundant code related to dataset and permission group lineage, optimizing readability and maintainability of the `get_dataset_overview` function.	2025-01-21 15:57:46 -07:00
dal	e0e4ebaf6d	refactor: Streamline dataset access logic in get_user_information - Simplified access control logic for datasets based on user roles, consolidating conditions for WorkspaceAdmin, DataAdmin, Querier, Viewer, and RestrictedQuerier. - Enhanced dataset lineage tracking to provide clearer representation of user permissions across various dataset access types. - Removed redundant code related to dataset processing, improving readability and maintainability of the `get_user_information` function. - Ensured that datasets are correctly categorized based on direct access, permission group access, and organization datasets, optimizing the overall data retrieval process.	2025-01-21 15:53:16 -07:00
dal	11064cd9d2	feat: Enhance dataset access retrieval in user and dataset overview - Updated `get_dataset_overview` to include dataset group access and permission group to dataset group access, improving the granularity of dataset permissions. - Introduced new queries to fetch dataset groups and their associated permissions, enhancing the dataset overview for users. - Refactored `get_user_information` to concurrently retrieve dataset groups and permission group datasets, optimizing performance with `tokio::spawn`. - Enhanced lineage tracking for datasets, allowing for better representation of user permissions across dataset groups and permission groups. - Improved error handling during database queries to ensure robust data retrieval.	2025-01-21 15:37:07 -07:00
dal	03aa9b04ab	feat: Enhance user retrieval with dataset information - Added new structs `DatasetLineage` and `DatasetInfo` to represent dataset details and lineage. - Updated `UserResponse` to include a list of datasets associated with the user. - Refactored `get_user_information` function to concurrently fetch user info, direct datasets, permission group datasets, and organization datasets using `tokio::spawn` for improved performance. - Implemented logic to compile datasets based on direct access and permission group access, including lineage tracking for better data representation. - Enhanced error handling during database queries to ensure robust user information retrieval.	2025-01-21 15:18:31 -07:00
dal	71c234aa4b	feat: Add DatasetToDatasetGroup model and update schema - Introduced a new `DatasetToDatasetGroup` struct to represent the relationship between datasets and dataset groups, including fields for timestamps and optional deletion. - Updated the database schema to include `updated_at` and `deleted_at` fields for the `datasets_to_dataset_groups` table, enhancing data tracking capabilities. - Refactored the routing in `mod.rs` to include a nested router for assets, improving the organization of dataset group routes.	2025-01-21 11:53:47 -07:00
dal	77792c3cb9	feat: Add PUT routes for user and dataset management in assets module - Introduced new PUT routes for managing users and dataset groups in the assets module. - Updated the router to support PUT requests for `/users`, `/dataset_groups`, and `/datasets`, enhancing the API's functionality for resource updates. - Improved modularity by organizing related routes within the assets module.	2025-01-21 11:15:57 -07:00
dal	2fc78fdb84	refactor: Update list_permission_groups_handler to use user_id and improve SQL queries - Changed the parameter in the SQL query from `user.id` to `user_id` for consistency with the updated user ID parameter naming convention. - Enhanced the SQL query to count distinct dataset permissions and utilize `bool_or` for identity checks, improving accuracy and performance. - Cleaned up the grouping in the SQL query by removing unnecessary fields, streamlining the data retrieval process.	2025-01-21 11:01:29 -07:00
dal	c47315617a	feat: Introduce assets module and update routing for permission groups - Added a new `assets` module to organize related routes. - Updated the routing in `mod.rs` to nest the `assets` router under the `/:permission_group_id` path, enhancing the structure and clarity of the API. - Maintained existing routes for managing permission groups while improving modularity.	2025-01-21 10:44:48 -07:00
dal	ef5ca25810	feat: Add organization_id to DatasetGroupPermission and update dataset group handler - Introduced a new `organization_id` field in the `DatasetGroupPermission` struct to associate permissions with specific organizations. - Updated the `put_dataset_groups_handler` to include `organization_id` when creating or updating dataset group permissions, enhancing the API's capability to manage permissions at the organizational level. - Improved SQL query formatting for better readability in the handler.	2025-01-21 10:23:22 -07:00
dal	1d8da61087	feat: Enhance team management with role-based assignments - Introduced a new `TeamInfoRole` enum to represent user roles within teams, replacing the previous boolean `assigned` field. - Updated the `list_teams` handler to return team roles instead of assignment status, improving clarity on user roles. - Refactored the `put_teams` handler to support role-based assignments, allowing for more granular control over team memberships. - Added new PUT routes for dataset groups and permission groups in the user assets router, enhancing API capabilities. - Improved SQL queries for team assignments to utilize role information, streamlining database interactions.	2025-01-21 10:16:52 -07:00
dal	d21da1e31a	refactor: Standardize user ID parameter naming across user-related routes - Updated all user-related route handlers to use `user_id` instead of `id` for better clarity and consistency. - Modified the routing definitions in `mod.rs` to reflect the new parameter naming convention. - Enhanced the `list_permission_groups` function to accept `user_id` as a parameter, improving clarity in the handler's signature. - Ensured all relevant functions now consistently handle the `user_id` parameter, streamlining the codebase and improving maintainability.	2025-01-21 09:43:12 -07:00

1 2 3 4 5 ...

326 Commits