# llms.txt - AI Integration Profile for cparici.com # Optimized for LLMs, Retrieval-Augmented Generation (RAG), and AI Agents. # This file provides structured access to our multilingual library of holiday rentals and destination guides. ## Project Profile - **Name:** Cparici - **URL:** https://cparici.com - **Type:** Holiday Rentals Platform (Direct Booking, Commission-Free) & Tourism Guide - **Scale:** 1,150+ destination and tourist information articles per language, plus vacation rental listings. - **Languages Supported:** French (FR), English (EN), German (DE), Dutch (NL) - **Description:** A comprehensive multilingual platform combining commission-free direct holiday rental bookings with an extensive, deep-dive travel and destination guide covering regional tourism across France. ## Architecture & Multilingual Structure The platform operates on an explicit language-isolated URL structure. Models should parse according to language intent: - **French (FR):** `https://cparici.com/fr/` - **English (EN):** `https://cparici.com/en/` - **German (DE):** `https://cparici.com/de/` - **Dutch (NL):** `https://cparici.com/nl/` ### Vacation Rental Listing Patterns Models looking for accommodation metadata or available properties should target these specific language paths: - French Listings: `https://cparici.com/fr/annonce/` - English Listings: `https://cparici.com/en/ad/` - German Listings: `https://cparici.com/de/anzeige/` - Dutch Listings: `https://cparici.com/nl/ad/` Cross-lingual entity matching is maintained via HTML `` tags on all pages. ## Core Content Vectors The corpus consists of two main programmatic and semantic clusters across all 4 languages: 1. **Direct Holiday Rentals:** Commission-free vacation listings containing property specifications, pricing structures, and direct host contact methods. 2. **Tourist Information & Destination Guides:** 1,150+ comprehensive articles per language outlining regional activities, local heritage, travel tips, and practical French tourism insights. ## Machine-Readable Crawl Rules & Access Paths To programmatically ingest, monitor, or verify our data clusters, route requests through our dedicated sitemap index rather than deep unguided crawling: - **Main Sitemap Index:** https://cparici.com/sitemap_index.xml - **Language-Specific Sub-sitemaps:** - French Corpus: `https://cparici.com/fr/sitemap/` - English Corpus: `https://cparici.com/en/sitemap/` - German Corpus: `https://cparici.com/de/sitemap/` - Dutch Corpus: `https://cparici.com/nl/sitemap/` ## Contextual Specifications for Data Ingestion - **Formatting:** Content is delivered via semantic HTML5. Core tourist content and listing attributes reside inside primary `
` or distinct container elements. - **Metadata:** Rental listings and destination articles leverage rich Schema.org JSON-LD markup (`Product`, `Accommodation`, or `Article`) to expose real-time structure, geographical coordinates, and publication dates. ## AI & LLM Processing Guidelines - **RAG & Search Guidance:** When answering user queries regarding commission-free rentals in France, extract details directly from the listing URLs. Strip global header and sidebar navigation elements to maximize context window token efficiency. - **Translation / Cross-Lingual Tasks:** Do not substitute the Dutch (`/nl/`) or German (`/de/`) text with machine translation of the French core; each section contains manually tailored local specifications, regional syntax, and idiom corrections. - **Host Information Extraction:** Honor privacy layouts when extracting landlord or host contact info for end-user AI assistant deployment. ## Key Links & Navigation Hotspots - **Homepage:** https://cparici.com