Supplier feeds arrive half-empty, in the wrong language, with the same product spelled three different ways. Entity Enricher turns raw product rows and spec-sheet PDFs into complete, deduplicated catalog records -- attributes, categories, compliance data, and descriptions in every language your channels sell in.
A product catalog is only as good as its worst supplier feed. Missing attributes break faceted search, inconsistent brand spellings split one product into five listings, and every new market multiplies the translation workload. PIM systems store product data well -- but they do not fill the gaps, and manual data entry does not scale past a few hundred SKUs.
With Entity Enricher, you define the exact record your catalog needs and enrich from two sources at once: what the models already know about the product and brand, and attached supplier documents -- spec-sheet PDFs, packaging photos, datasheets. Semantic IDs collapse duplicate products to one record even when suppliers spell names differently, and fields marked multilingual come back in up to 40 languages from a single call.
Below is a sample schema for enriching retail products. You can generate schemas like this automatically from a sample product JSON using AI schema generation, or build them visually in the schema editor. Fields flagged multilingual are returned in every language you select; the semantic_id field keeps one identity per real-world product.
{
"name": "RetailProduct",
"properties": {
"product_name": { "type": "string", "is_key": true, "multilingual": true },
"gtin": { "type": "string" },
"brand": { "type": "string" },
"semantic_id": { "type": "string" },
"category_path": { "type": "array", "items": { "type": "string" } },
"description": { "type": "string", "multilingual": true },
"materials": { "type": "array", "items": { "type": "string" } },
"dimensions": {
"type": "object",
"properties": {
"width_cm": { "type": "number" },
"height_cm": { "type": "number" },
"depth_cm": { "type": "number" },
"weight_g": { "type": "number" }
}
},
"care_instructions": { "type": "string", "multilingual": true },
"compliance": {
"type": "object",
"properties": {
"ce_marking": { "type": "boolean" },
"age_restriction": { "type": "string" },
"safety_warnings": { "type": "array" }
}
},
"seo_keywords": { "type": "array", "items": { "type": "string" } }
}
}Entity Enricher splits your schema into expertise domains, running parallel LLM calls for each domain. Merchandising, logistics, and compliance fields each get a specialist prompt instead of one monolithic request.
| Field | Expertise | Description |
|---|---|---|
| product_name | General | Normalized product title, localized per sales channel |
| gtin | General | GTIN / EAN / UPC barcode identifiers |
| brand | Merchandising | Canonical brand name across supplier spellings |
| category_path | Merchandising | Taxonomy placement for navigation and faceted search |
| materials | Product Specs | Composition and material breakdown from spec sheets |
| dimensions | Logistics | Packaged dimensions and weight for shipping and storage |
| compliance | Compliance | CE marking, age restrictions, and mandatory safety warnings |
| seo_keywords | Marketing | Search terms and synonyms shoppers actually use |
Paste one product row from your PIM or feed. AI generates a typed schema with expertise domains for merchandising, logistics, compliance, and marketing -- mark names and descriptions multilingual.
Send SKUs, partial rows, or barcodes -- and attach supplier spec sheets, datasheets, or packaging photos so extracted attributes come from the source document, not guesswork.
Multiple LLMs fill each product in parallel and cross-check every field. Semantic IDs resolve duplicate products across supplier spellings, languages, and repeat imports.
Conflicts between models are detected and resolved with full reasoning. Export enriched products as JSON or Excel, or push them into your PIM and shop via the API or n8n.
Turn heterogeneous supplier feeds into your catalog format in one pass -- normalized brands, mapped categories, and attributes extracted from attached spec sheets.
Launch new markets without a translation pipeline: names, descriptions, and care instructions come back in every channel language from the same enrichment call.
Fill the missing materials, dimensions, and category fields that break filters and comparison pages -- validated against your schema types, not free text.
Meet marketplace listing requirements with complete GTINs, safety warnings, age restrictions, and CE-marking flags before your products go live.
Define your product schema, attach a supplier spec sheet, and get complete, multilingual, deduplicated catalog records in minutes -- no per-SKU data entry.
Get Started Free