Overview
Sources
Section titled “Sources”Sources are the knowledge foundation of Lattice. Add documents, URLs, and repositories to build a searchable knowledge base that grounds AI responses.
Supported Source Types
| Type | Description | Processing |
|---|---|---|
pdf | PDF documents | Text extraction, chunking, embeddings |
url | Web pages | HTML parsing, content extraction |
github | GitHub repositories | README, code files, documentation |
youtube | YouTube videos | Transcript extraction |
google_docs | Google Documents | API-based content retrieval |
markdown | Markdown files | Direct parsing |
text | Plain text | Direct chunking |
artifact | Promoted artifacts | Converted from Studio |
Auto-Classification
Sources are automatically categorized based on content analysis:
requirements- SLAs, PRDs, RFPs, specificationsresearch- Academic papers, blog posts, transcriptsvendor- Pricing pages, model cards, API documentationarchitecture- System designs, diagrams, technical specsbenchmarks- Leaderboards, evaluations, comparisonstutorial- Guides, tutorials, learning content
Processing Pipeline
- Ingestion: Content fetched/uploaded
- Extraction: Text extracted from format
- Chunking: Split into semantic chunks (~1000 tokens)
- Embedding: Vector embeddings generated (if enabled)
- Indexing: Full-text search index updated