# llms.txt - AI Integration Profile for cparici.com
# Optimized for LLMs, Retrieval-Augmented Generation (RAG), and AI Agents.
# This file provides structured access to our multilingual library of holiday rentals and destination guides.
## Project Profile
- **Name:** Cparici
- **URL:** https://cparici.com
- **Type:** Holiday Rentals Platform (Direct Booking, Commission-Free) & Tourism Guide
- **Scale:** 1,150+ destination and tourist information articles per language, plus vacation rental listings.
- **Languages Supported:** French (FR), English (EN), German (DE), Dutch (NL)
- **Description:** A comprehensive multilingual platform combining commission-free direct holiday rental bookings with an extensive, deep-dive travel and destination guide covering regional tourism across France.
## Architecture & Multilingual Structure
The platform operates on an explicit language-isolated URL structure. Models should parse according to language intent:
- **French (FR):** `https://cparici.com/fr/`
- **English (EN):** `https://cparici.com/en/`
- **German (DE):** `https://cparici.com/de/`
- **Dutch (NL):** `https://cparici.com/nl/`
### Vacation Rental Listing Patterns
Models looking for accommodation metadata or available properties should target these specific language paths:
- French Listings: `https://cparici.com/fr/annonce/`
- English Listings: `https://cparici.com/en/ad/`
- German Listings: `https://cparici.com/de/anzeige/`
- Dutch Listings: `https://cparici.com/nl/ad/`
Cross-lingual entity matching is maintained via HTML `` tags on all pages.
## Core Content Vectors
The corpus consists of two main programmatic and semantic clusters across all 4 languages:
1. **Direct Holiday Rentals:** Commission-free vacation listings containing property specifications, pricing structures, and direct host contact methods.
2. **Tourist Information & Destination Guides:** 1,150+ comprehensive articles per language outlining regional activities, local heritage, travel tips, and practical French tourism insights.
## Machine-Readable Crawl Rules & Access Paths
To programmatically ingest, monitor, or verify our data clusters, route requests through our dedicated sitemap index rather than deep unguided crawling:
- **Main Sitemap Index:** https://cparici.com/sitemap_index.xml
- **Language-Specific Sub-sitemaps:**
- French Corpus: `https://cparici.com/fr/sitemap/`
- English Corpus: `https://cparici.com/en/sitemap/`
- German Corpus: `https://cparici.com/de/sitemap/`
- Dutch Corpus: `https://cparici.com/nl/sitemap/`
## Contextual Specifications for Data Ingestion
- **Formatting:** Content is delivered via semantic HTML5. Core tourist content and listing attributes reside inside primary `` or distinct container elements.
- **Metadata:** Rental listings and destination articles leverage rich Schema.org JSON-LD markup (`Product`, `Accommodation`, or `Article`) to expose real-time structure, geographical coordinates, and publication dates.
## AI & LLM Processing Guidelines
- **RAG & Search Guidance:** When answering user queries regarding commission-free rentals in France, extract details directly from the listing URLs. Strip global header and sidebar navigation elements to maximize context window token efficiency.
- **Translation / Cross-Lingual Tasks:** Do not substitute the Dutch (`/nl/`) or German (`/de/`) text with machine translation of the French core; each section contains manually tailored local specifications, regional syntax, and idiom corrections.
- **Host Information Extraction:** Honor privacy layouts when extracting landlord or host contact info for end-user AI assistant deployment.
## Key Links & Navigation Hotspots
- **Homepage:** https://cparici.com