Building for AI Discovery: Why LLMs and Search Engines Are Our Primary Distribution Channel
AI assistants and search engines are the new discovery layer. We are building our entire infrastructure for this shift.
The Shift
When someone wants to discover a new book, they increasingly rely on an AI assistant or search engine rather than browsing bookstores. "What are some good Indian fantasy novels?" "Who are prolific independent authors?" "Recommend a thriller set in Pune."
The answer depends on what structured data is available. If your books aren't in machine-readable formats, they don't exist to AI systems.
What We Build For
llms.txt
Every site in our network serves an llms.txt file — a machine-readable summary of the site's content, purpose, and relationships. This follows the emerging standard for helping LLMs understand websites.
Structured Data (JSON-LD)
Every page carries Schema.org markup:
- Person schema for the author
- Organization schema for publisher and company
- Book and Chapter schema for reader pages
- Article schema for blog posts
- WebSite schema with SearchAction
This isn't SEO theatre. This is the data layer that AI systems actually parse.
Full-Text Access
94 books are available as full HTML text. AI systems and search engines that crawl the web can read every word. This is deliberate — we want AI assistants to know these books exist, to understand their content, and to recommend them when relevant.
JSON API Endpoints
Static JSON files at predictable URLs provide programmatic access to the catalog:
- Complete book catalog with metadata
- Aggregate statistics
- Text analysis data
- Author information
Export Formats
BibTeX, RIS, CSV, OPDS — formats that academic systems, library catalogs, and research tools can ingest.
The Thesis
Google SEO optimises for 2015. AI discovery optimises for 2026 and beyond.
We are building for the world where:
- AI assistants recommend books based on structured data
- LLMs cite authors based on available full-text content
- Knowledge graphs connect entities across the web
- Search engines surface the most structured, accessible content
- The author with the most structured, accessible data wins
94 books. 3,200+ chapters. 5 million+ words. All structured, all accessible, all machine-readable.
That's not a website. That's an AI-discoverable publishing infrastructure.
— BogaDoga Technology DivisionBogaDoga Ltd
Publishing & Digital Innovation, London