Overview
A well-organized knowledge base improves agent accuracy, reduces hallucinations, and makes maintenance easier. Follow these practices to get the most from your knowledge base.These practices are based on real-world usage patterns and help ensure your agents provide accurate, helpful responses.
Content Quality
Use Clear, Concise Language
Good:- Clear language is easier for AI to understand and extract
- Concise content reduces token usage
- Structured information improves retrieval accuracy
Structure Information Logically
Good:- Structured content is easier to parse
- Lists and headings improve readability
- Clear organization helps AI find relevant information
Include Specific Details
Good:- Specific details provide accurate answers
- Vague information leads to generic responses
- Precise data reduces follow-up questions
Avoid Ambiguity
Good:- Clear conditions prevent confusion
- Ambiguous statements lead to incorrect answers
- Specific rules help agents make accurate decisions
Naming Conventions
Be Descriptive
Good:- “Return Policy - Electronics 2024”
- “FAQ - Shipping Information”
- “Product Catalog - Q1 2024”
- “Returns”
- “Doc1”
- “Policy”
- “file_final_v2”
- Descriptive names make searching easier
- Clear names help identify content quickly
- Consistent naming improves organization
Include Dates for Versioned Content
Good:- “Employee Handbook - January 2024”
- “Product Catalog - Q1 2024”
- “Return Policy - 2024”
- Dates help identify current vs. outdated content
- Version tracking prevents confusion
- Easy to identify what needs updating
Use Consistent Naming
Good:- Consistent patterns make organization clear
- Easier to find related items
- Better for filtering and searching
Include Context
Good:- “Return Policy - Electronics”
- “Return Policy - Clothing”
- “Return Policy - Software”
- Context helps agents choose the right information
- Prevents confusion between similar topics
- Improves retrieval accuracy
Organization
Group Related Content
Strategy:- Create separate items for different topics
- Don’t combine unrelated information
- Use clear names to show relationships
- Focused items improve retrieval accuracy
- Easier to update specific topics
- Better organization reduces maintenance effort
Keep Items Focused
Good:- One item per topic
- Focused, specific content
- Clear purpose for each item
- Everything in one large item
- Multiple unrelated topics combined
- Unclear item purpose
- Focused items are easier to retrieve
- Specific content improves accuracy
- Easier to maintain and update
Break Down Large Content
Strategy:- Split long documents into chapters/sections
- Create separate items for major topics
- Keep items under 2,000 words when possible
- Smaller items are easier to process
- Better retrieval accuracy
- Faster processing times
Update Regularly
Schedule:- Review knowledge base quarterly
- Remove outdated items
- Update changed information
- Archive old versions
- Outdated information leads to incorrect answers
- Regular updates maintain accuracy
- Clean knowledge base improves performance
Content Size Guidelines
Text Items
Recommended:- Optimal: 100-2,000 characters
- Maximum: 5,000 characters
- Minimum: 50 characters
- Break long content into multiple items
- Keep each item focused on one topic
- Use multiple items for comprehensive coverage
File Items
Recommended:- Optimal: Under 5MB per file
- Maximum: 21MB per file
- Strategy: Split large documents into chapters
- Use well-formatted documents
- Ensure text is extractable (not just images)
- Avoid complex layouts
- Include clear headings and structure
URL Items
Recommended:- Optimal: Focused pages over entire websites
- Maximum: 100 URLs per item
- Strategy: Group related URLs together
- Choose content-heavy pages
- Avoid pages with excessive navigation/ads
- Use sitemaps for large-scale imports
- Monitor failed URLs regularly
Content Types by Use Case
Customer Support
Best Content Types:- Text items for FAQs
- File items for policy documents
- URL items for support articles
- Group by topic (shipping, returns, payments)
- Use consistent naming
- Keep content current
Product Information
Best Content Types:- File items for product manuals
- URL items for product pages
- Text items for specifications
- One item per product or product line
- Include version numbers
- Update when products change
Technical Documentation
Best Content Types:- File items for technical manuals
- URL items for API documentation
- Text items for quick references
- Organize by topic or feature
- Include version information
- Link related documentation
Company Policies
Best Content Types:- Text items for key policies
- File items for comprehensive handbooks
- URL items for policy pages
- One policy per item
- Include effective dates
- Archive old versions
Maintenance Practices
Regular Reviews
Weekly:- Check for failed items
- Review processing status
- Monitor storage usage
- Review unused items
- Update outdated content
- Clean up duplicates
- Comprehensive audit
- Remove obsolete items
- Optimize organization
- Review naming conventions
Storage Management
Monitor Usage:- Check storage regularly
- Delete unused items
- Compress large files
- Upgrade plan if needed
- Remove failed items that can’t be fixed
- Archive old items instead of deleting
- Use inactive status for temporary content
Quality Assurance
Test Content:- Verify items process successfully
- Test retrieval in agents
- Check for accuracy
- Review error messages
- Track which items are used
- Identify unused items
- Review agent responses
- Update based on feedback
Error Prevention
Before Creating Items
Checklist:- ✅ Verify files open correctly
- ✅ Check file sizes (max 21MB)
- ✅ Ensure URLs are accessible
- ✅ Remove password protection
- ✅ Verify text length (50-5,000 chars)
- ✅ Check storage quota
During Creation
Best Practices:- Use descriptive names
- Validate content before saving
- Monitor processing status
- Review error messages immediately
After Creation
Verification:- Confirm PROCESSED status
- Review item details
- Test in agent if needed
- Fix any errors promptly
URL Management
Single URLs
Best Practices:- Use focused, content-heavy pages
- Avoid pages with lots of navigation
- Choose pages that are likely to remain stable
- Test URLs before adding
Sitemap Imports
Best Practices:- Use for large-scale imports
- Review fetched URLs before adding
- Group related URLs together
- Monitor processing status
- Maximum 100 URLs per item
- Some sites may block scraping
- Large sitemaps take time
URL Maintenance
Regular Tasks:- Monitor failed URLs
- Remove inaccessible URLs
- Update changed URLs
- Test URLs periodically
Performance Optimization
Retrieval Accuracy
Improve Accuracy:- Use focused, specific items
- Include relevant keywords in names
- Structure content logically
- Keep items appropriately sized
Processing Speed
Optimize Processing:- Keep files under 5MB when possible
- Use well-formatted documents
- Avoid complex layouts
- Split large content into smaller items
Storage Efficiency
Maximize Storage:- Delete unused items
- Remove failed items
- Compress files before uploading
- Use inactive status for temporary content
Common Mistakes to Avoid
❌ Too Much Content in One Item
Problem:- Large items are harder to retrieve accurately
- Slower processing times
- Difficult to maintain
- Break into multiple focused items
- Keep items under 2,000 words
- One topic per item
❌ Vague or Ambiguous Content
Problem:- Leads to incorrect answers
- Confuses agents
- Requires follow-up questions
- Use specific, clear language
- Include exact details
- Avoid ambiguity
❌ Poor Naming
Problem:- Hard to find items
- Difficult to organize
- Confusing for team members
- Use descriptive names
- Follow consistent patterns
- Include context
❌ Outdated Content
Problem:- Incorrect information
- Confusing responses
- Poor user experience
- Regular reviews
- Update promptly
- Archive old versions
❌ Ignoring Failed Items
Problem:- Clutters knowledge base
- Wastes storage
- Confusing status
- Review failed items weekly
- Fix or delete promptly
- Monitor for patterns
Advanced Tips
Content Versioning
Strategy:- Include dates in names
- Archive old versions
- Keep current version active
- Document changes
Multi-Language Support
Strategy:- Create separate items per language
- Use language in name
- Organize by language
- Keep translations synchronized
Seasonal Content
Strategy:- Use inactive status for off-season
- Activate when needed
- Don’t delete seasonal content
- Update dates in names
Collaborative Management
Strategy:- Establish naming conventions
- Document organization structure
- Regular team reviews
- Clear ownership
Measuring Success
Key Metrics
Usage Metrics:- Items used in agents
- Items not used (candidates for cleanup)
- Processing success rate
- Agent response accuracy
- User satisfaction
- Error rates
- Processing times
- Storage usage
- Retrieval accuracy
Continuous Improvement
Process:- Monitor metrics regularly
- Identify areas for improvement
- Implement changes
- Measure impact
- Iterate
Next Steps
- Creating Items - Apply these practices when creating items
- Managing Items - Organize your knowledge base effectively
- Status Lifecycle - Monitor and maintain items