1. Introduce the concept of "collection_name" to docstore and vector store. Each collection can be viewed similarly to a table in a SQL database. It allows better organizing information within this data source.
2. Move the `Index` and `Source` tables from the application scope into the index scope. For each new index created by user, these tables should increase accordingly. So it depends on the index, rather than the app.
3. Make each index responsible for the UI components in the app.
4. Construct the File UI page.
* feat: Add installers for linux, windows, and macos
* docs: Update README
* pre-commit fix styles
* Update installers and README
* Remove env vars check and fix paths
* Update installers:
* Remove start.py and move install and launch part back to .sh/.bat
* Add conda deactivate
* Make messages more informative
* Improve kotaemon based on insights from projects (#147)
- Include static files in the package.
- More reliable information panel. Faster & not breaking randomly.
- Add directory upload.
- Enable zip file to upload.
- Allow setting endpoint for the OCR reader using environment variable.
* feat: Add installers for linux, windows, and macos
* docs: Update README
* pre-commit fix styles
* Update installers and README
* Remove env vars check and fix paths
* Update installers:
* Remove start.py and move install and launch part back to .sh/.bat
* Add conda deactivate
* Make messages more informative
* Make macOS installer runable and improve Windows, Linux installers
* Minor fix macos commands
* installation should pause before exit
* Update Windows installer: add a new label to exit function with error
* put install_dir to .gitignore
* chore: Add comments to clarify the 'end' labels
---------
Co-authored-by: Duc Nguyen (john) <trungduc1992@gmail.com>
Co-authored-by: ian <ian@cinnamon.is>
- Include static files in the package.
- More reliable information panel. Faster & not breaking randomly.
- Add directory upload.
- Enable zip file to upload.
- Allow setting endpoint for the OCR reader using environment variable.
* add albert to git-secret
* update readme
* Limit llama-index version
* Langchain upgrade their wikipedia tool name
---------
Co-authored-by: trducng <trungduc1992@gmail.com>
* Allow customizing the base application
* Make the core llms and embeddings customizable
* Make the settings, reasoning and index customizable
* Import from langchain_openai
- Migrate the MVP into kotaemon.
- Preliminary include the pipeline within chatbot interface.
- Organize MVP as an application.
Todo:
- Add an info panel to view the planning of agents -> Fix streaming agents' output.
Resolve: #60Resolve: #61Resolve: #62
Refactor the `kotaemon/pipelines` module to `kotaemon/indices`. Create the VectorIndex.
Note: currently I place `qa` to be inside `kotaemon/indices` since at the moment we only have `qa` in RAG. At the same time, I think `qa` can be an independent module in `kotaemon/qa`. Since this can be changed later, I still go at the 1st option for now to observe if we can change it later.
* enforce Document as IO
* Separate rerankers, splitters and extractors (#85)
* partially refractor importing
* add text to embedding outputs
---------
Co-authored-by: Nguyen Trung Duc (john) <trungduc1992@gmail.com>