telcel.com
Jan 1, 2010
A breach of Mexican wireless telecommunications company Telcel, approximately in 2010, exposing sensitive information on over 36 million customers. Compromised data includes full names, phone numbers, and physical addresses (street, neighborhood, city, state). The dataset contains 36,653,022 records in semicolon-delimited CSV format.
Data found in this dataset
Source files
Expand any file to inspect its column headers and the LLM's field-mapping reasoning, recorded during ingestion.
Telcel2010__Info.txt17 rows
File structure
Notes: Pre-LLM auto-detection: free-form text with visible emails / phones
Telcel2010__data__36M-Mexico-Telcel-Telecommunications-Company-Mobile-Broadband-csv-2010.csv10 columns3,146,655 rows
File structure
Format: CSV·Delimiter: semicolon·Has header: yes·Quote: "
| Source column | Mapped field | Confidence | LLM assessment |
|---|---|---|---|
| 0 | phone | high | [0] Header 'Campo1', values are 10-digit numbers matching Mexican phone number format (612xxxxx pattern consistent with Telcel) |
| 1 | fullName | high | [1] Header 'Campo2', values are full person names in Spanish (e.g., 'CATALINO RODRIGUEZ ALVARADO', 'NORMA RRFUGIO AVILEZ RODRIGUEZ') |
| 2 | skip | high | [2] Header 'Campo3', contains mixed noise: IDs, placeholder values ('SIN', '0', 'NA', 'DESC', 'NO INDICA'), some appear to be truncated identifiers or document numbers; not reliable PII |
| 3 | address1 | high | [3] Header 'Campo4', values are street addresses with street names, numbers, and apt/interior designations (e.g., 'ENCINAS Y EMILIANO ZAPATA #2695', 'RIO PANUCO Y RIO BALZA SIN Int.SIN') |
| 4 | address2 | high | [4] Header 'Campo5', values are neighborhood/colony names (e.g., 'LOS OLIVOS LA RINCONADA', 'VILLA DEL ENCANTO', 'LAGUNA AZUL') |
| 5 | city | high | [5] Header 'Campo6', values are city names (LA PAZ, TIJUANA, BAJA CALIFORNIA SUR municipality names) |
| 6 | state | high | [6] Header 'EdoSep', values are Mexican state names or abbreviations ('BAJA CALIFORNIA SUR', 'BCS', 'OAXACA', 'SINALOA', blank entries) |
| 7 | city | high | [7] Header 'MpoSep', values are municipality names within states (secondary city reference) |
| 8 | state | high | [8] Header 'EdoCof', values are state names or codes (appears to be duplicate/confirmation state field) |
| 9 | city | high | [9] Header 'MpoCof', values are municipality names (secondary municipality reference) |
Notes: Telcel 2010 Mexican telecom breach. 10 columns total. Delimiter is semicolon. Campo2-Campo6 contain core PII (phone, full name, address details). Columns 6-9 contain duplicate/confirmation geographic data. Campo3 contains mixed noise/identifiers that are not reliable PII and should be skipped. The file structure repeats address and location information across columns 3-9 for redundancy/data quality purposes.