zoosk.com
May 1, 2020
Breach of Zoosk, an online dating platform. The dataset contains records with registration/activity dates (spanning 2012–2015), usernames, email addresses, and MD5-hashed passwords. This data is consistent with the widely reported Zoosk breach that surfaced around 2020, containing approximately 30 million records originally collected from the platform.
Data found in this dataset
Source files
Expand any file to inspect its column headers and the LLM's field-mapping reasoning, recorded during ingestion.
zoosk.txt4 columns57,554,881 rows
File structure
| Source column | Mapped field | Confidence | LLM assessment |
|---|---|---|---|
| 0 | skip | high | Registration/activity dates in DD-MM-YYYY format (e.g., 03-11-2014, 24-05-2015) |
| 1 | username | high | Alphanumeric usernames or numeric IDs (e.g., pdaddy, 1981821, chux13) |
| 2 | high | Email addresses containing @ symbol (e.g., gorostitaldea@hotmail.com, fleik@aon.at) | |
| 3 | password | high | 32-character hexadecimal strings consistent with MD5 hashes (e.g., afaa2e055ff3c8d8eac45e91b6eb3f77) |
Notes: Standard Zoosk dating platform combo list format: date;username;email:md5_password. Some records contain additional numeric identifiers embedded before email addresses (skip fields), e.g., '174800092:asibeyi_33@hotmail.com:0174800092:' - these appear to be internal IDs or reference numbers and should be disregarded. All passwords are MD5-hashed.