Releases: Marker-Inc-Korea/AutoRAG
v0.3.9
What's Changed
- Edit documentation about data schema and descriptions by @vkehfdl1 in #905
- autorag —version by @vkehfdl1 in #913
- [Hotfix] fix hf space url at README.md by @bwook00 in #917
- ✨ feat: improve sample size handling in Validator class by @hongsw in #912
- Fix error that missing init of huggingface llm and ollama by @vkehfdl1 in #919
- Fix: added table_html variable initialization by @effortprogrammer in #920
- enhanced documentation at custom LLM models by @vkehfdl1 in #923
- just return original texts when there is no corresponding tokenizer a… by @vkehfdl1 in #931
- add Arxiv citation for our paper by @vkehfdl1 in #932
- delete tqdm by @bwook00 in #934
- add demojize with emoji package by @rjwharry in #935
- Release/v0.3.9 by @vkehfdl1 in #936
New Contributors
- @effortprogrammer made their first contribution in #920
Full Changelog: v0.3.8...v0.3.9
v0.3.8
What's Changed
- Feature/docker deploy push by @hongsw in #887
- Edit stream API endpoint and add instructions deploying kotaemon to fly.io by @vkehfdl1 in #891
- �Delete trial path logic at parse & chunk + add detail docs & tutorial at docs by @bwook00 in #894
- Feature/#892 by @rjwharry in #895
- Add documentation for custom_query_gen and make_custom_gen_gt function by @rjwharry in #897
- Edit api routes url by @eduumach in #899
- Add test code for query expansion with vectordb by @vkehfdl1 in #902
- Add progress bar by @bwook00 in #903
- dump version 0.3.8 by @vkehfdl1 in #904
New Contributors
Full Changelog: v0.3.7...v0.3.8
v0.3.7
What's Changed
- fix the error and release 0.3.5-rc1 by @vkehfdl1 in #842
- Add Huggingface Space at README.md by @bwook00 in #847
- Add new Sample YAML file by @bwook00 in #848
- Fix README.md by @Jake-Song in #850
- Add AWS Bedrock llm and upgrade VERSION 0.3.6 by @bwook00 in #856
- Add roadmap and other badges at README.md by @vkehfdl1 in #862
- Add use multimodal feature at llama parse by @bwook00 in #868
- ✨ feat: Update supporting nodes and modules information in index.md by @hongsw in #859
- Add External VectorDB Connections by @vkehfdl1 in #872
- Release/v0.3.7 by @vkehfdl1 in #883
New Contributors
- @Jake-Song made their first contribution in #850
Full Changelog: v0.3.5...v0.3.7
v0.3.5
What's Changed
- Run validation at the start_trial by @vkehfdl1 in #826
- AutoRAG api version & api docker container + gpu version docker container by @vkehfdl1 in #823
- Add FlashRank Reranker module by @bwook00 in #818
- set the fixed port number of the panel dashboard by @vkehfdl1 in #827
- change stream to astream, and add non-async stream function by @vkehfdl1 in #835
- add setup python at sphinx.yml by @vkehfdl1 in #836
- Change recency filter parameter name to threshold_datetime from threshold by @vkehfdl1 in #837
- Release/v0.3.5 by @vkehfdl1 in #838
- [Hotfix] name change Konlpy at chunk_full.yaml by @bwook00 in #840
Full Changelog: v0.3.4...v0.3.5
v0.3.4
What's Changed
- Add OpenVINO Reranker module by @bwook00 in #808
- Properly truncate to 8000 tokens when we use OpenAI Embeddings by @vkehfdl1 in #812
- Refactor API server with streaming and passage return by @vkehfdl1 in #810
- ✨ feat: Added Docker push workflow, Dockerfile updates, and build script by @hongsw in #807
- Add VoyageAI Reranker module by @bwook00 in #809
- calculate the right cosine similarity score at the get_id_scores by @vkehfdl1 in #816
- 日本語対応 by @wooheum-xin in #814
- Add Mixedbread AI Reranker Module by @bwook00 in #805
- Release/v0.3.4 by @vkehfdl1 in #813
New Contributors
- @wooheum-xin made their first contribution in #814
Full Changelog: v0.3.3...v0.3.4
v0.3.3
What's Changed
- [Parse Bug] Fix only parse the first page of the whole pdf files by @bwook00 in #783
- [Parse Bug] Add non-table exists page to use clova.py by @bwook00 in #784
- Prevent error that httpx uses different event loop at method chaining on the QA by @vkehfdl1 in #785
- add deepeval metrics by @Eastsidegunn in #750
- Release/v0.3.3 by @vkehfdl1 in #803
Full Changelog: v0.3.2...v0.3.3
v0.3.2
v0.3.1
What's Changed
- Add toctree by @bwook00 in #745
- Fix minor errors at the documentations by @vkehfdl1 in #747
- add effective_order at bleu as True by @vkehfdl1 in #748
- add passage dependency filter at data creation by @vkehfdl1 in #751
- Add Passage Dependency at README.md by @bwook00 in #761
- docs: update data_format.md by @eltociear in #772
- change the README and tutorial of deploying the result. by @vkehfdl1 in #769
- Windows support (partially) AutoRAG by @vkehfdl1 in #766
- Feature/hongsw/671 dockerfile Add Dockerfile and Docker configuration for AutoRAG production environment by @hongsw in #763
- Add total three evolving methods to QA creation by @vkehfdl1 in #767
- Possible error when the QA retrieval_gt shape will be different by @vkehfdl1 in #774
- dump version 0.3.1 by @vkehfdl1 in #776
New Contributors
- @eltociear made their first contribution in #772
Full Changelog: v0.3.0...v0.3.1
v0.3.0
What's Changed
- Refactoring to v3.0 for efficient deployment by @vkehfdl1 in #727
- resolve vllm error by @vkehfdl1 in #735
- Change data creation package names to v0.3 by @vkehfdl1 in #740
- Add more yaml file by @bwook00 in #743
- Update README for v 0.3.0 by @bwook00 in #739
- dump version 0.3.0 by @vkehfdl1 in #741
Full Changelog: v0.2.18...v0.3.0
🚀 AutoRAG v0.3.0 is Here! 🚀
We're thrilled to introduce AutoRAG v0.3.0, packed with new features and key improvements. Here’s what’s new:
1. Improved Response Time for Deployment
In earlier versions, the response time during deployment was slow, making it difficult to use optimized RAG pipeline. With v0.3.0, we've significantly reduced the response time, making deployment much more efficient for user-facing services.
2. Re-designed Data Creation Process
Data creation is an essential part of optimizing RAG pipelines, and we've made the process much smoother. In earlier versions, this feature was still in its early stages. Now, in v0.3.0, you can build the data creation process within AutoRAG.
We’ve added AutoParse and AutoChunk, allowing you to configure, parse, and chunk your data using a single YAML file. You can also easily compare different methods to refine your pipeline. Whether you build QA datasets with LLMs or manually, this structure offers a human-in-the-loop process to help you create and manage your data.
Check out the detailed guide on data creation.
3. Python & Library Support Updates
- Python 3.9 is no longer supported. Please upgrade to Python 3.10.
- AutoRAG now works with LangChain 0.3, LlamaIndex 0.11, Pydantic v2, and OpenAI o1 models.
Share Your Feedback
Your insights help us improve AutoRAG! Let us know how these updates impact your workflow and what you’d like to see in future versions.
Join Discord server now!
Thank you for being part of the AutoRAG journey!
v0.2.18
What's Changed
- change add_file_name language notation by @bwook00 in #717
- Ingest bm25_tokenizer and embedidng only in the strategy of other modules by @vkehfdl1 in #716
- OpenAI o1 model compatibility by @vkehfdl1 in #719
- Compatible with Langchain version 0.3.0 by @bwook00 in #724
- Release/v0.2.18 by @vkehfdl1 in #726
Full Changelog: v0.2.17...v0.2.18