facebook.com
Aug 1, 2016
A 1.5 million record extract of Facebook user data containing email addresses, first names, last names, and profile URLs. This file was distributed as part of a larger multi-part compilation of over 100GB of leaked databases traded on dark web markets (Hansa Market onion link referenced). The compilation index lists dozens of major breaches across social media, gaming, dating, and other sectors. The Facebook-specific CSV contains plaintext email-to-profile mappings with no passwords.
Data found in this dataset
Source files
Expand any file to inspect its column headers and the LLM's field-mapping reasoning, recorded during ingestion.
All_BIG_Database_Leak_in_3_part_MORE_THAN_100GB_OF_DATA.txt1 rows
File structure
Notes: Pre-LLM auto-detection: free-form text with visible emails / phones
Facebook-1.5mil_-_Name__URL__Email.csv4 columns1,432,396 rows
File structure
Format: CSV·Delimiter: comma·Has header: no·Quote: "
| Source column | Mapped field | Confidence | LLM assessment |
|---|---|---|---|
| 0 | high | [0] All values contain @ symbol and are valid email addresses | |
| 1 | firstName | high | [1] Values are common given names (Charles, Cezmi, Gevorg, Alandria, Fred, Roland, etc.) |
| 2 | lastName | high | [2] Values are surnames (H, Düzce, Ayvazyan, Zeigler, Doner, Bobinger, etc.) |
| 3 | skip | high | [3] Facebook profile URLs, not PII mapping field |
Notes: Facebook 2016 breach extract: 1.5M records with email-to-profile mappings. Format is 4-column CSV with no header row. Column 3 (profile URLs) contains non-PII data and is skipped. Some rows have incomplete data (missing lastName or URL).