cdek.ru
May 27, 2026
Database dump from CDEK (СДЭК), a major Russian logistics and courier delivery company. The data contains customer and shipment records including full names in Russian/Cyrillic, internal IDs, order/shipment numbers (ИМ-prefixed), email addresses, branch/pickup point codes, and UUIDs. The dataset is approximately 18 GB and contains hundreds of millions of records spanning senders, recipients, and corporate clients.
Data found in this dataset
Source files
Expand any file to inspect its column headers and the LLM's field-mapping reasoning, recorded during ingestion.
CDEK_BF__data__base.txt3 columns134,595,857 rows
File structure
Format: CSV·Delimiter: tab·Has header: no·Quote: "
| Source column | Mapped field | Confidence | LLM assessment |
|---|---|---|---|
| 3 | fullName | high | [3] contains full names in Cyrillic and Latin (e.g. 'ПАВЛОВ АЛЕКСАНДР', 'Студенихина Валерия Александровна', 'Zhao JIan'), appears to be sender/company name field |
| 4 | fullName | high | [4] contains full names in Cyrillic and Latin (e.g. 'Павлов Александр', 'Полякова Елена Александровна', 'Chao Yu'), appears to be recipient name field |
| 7 | high | [7] contains email addresses (e.g. '[email protected]', '[email protected]', '[email protected]') |
Notes: Tab-delimited file with no header row. 8 columns total. Column 0: internal numeric ID (skip). Column 1: UUID or legacy hex ID (skip). Column 2: shipment/order number with ИМ- or other prefixes (skip). Column 3: sender full name (often company names mixed with personal names). Column 4: recipient full name. Column 5: internal numeric ID (skip). Column 6: branch/pickup point code (skip). Column 7: email address (sparse). Many rows are shipment-only records with nulls in name/email fields.