kotaemon/tests/test_ingestor.py
Duc Nguyen (john) 5a9d6f75be Migrate the MVP into kotaemon (#108)
- Migrate the MVP into kotaemon.
- Preliminary include the pipeline within chatbot interface.
- Organize MVP as an application.

Todo:

- Add an info panel to view the planning of agents -> Fix streaming agents' output.

Resolve: #60
Resolve: #61 
Resolve: #62
2024-01-10 15:28:09 +07:00

16 lines
463 B
Python

from pathlib import Path
from kotaemon.indices.ingests import DocumentIngestor
from kotaemon.indices.splitters import TokenSplitter
def test_ingestor_include_src():
dirpath = Path(__file__).parent
ingestor = DocumentIngestor(
pdf_mode="normal",
text_splitter=TokenSplitter(chunk_size=50, chunk_overlap=10),
)
nodes = ingestor(dirpath / "resources" / "table.pdf")
assert type(nodes) is list
assert nodes[0].relationships