← All datasets

appen.com

Mar 1, 2019

5,888,237
Records
1
Files
May 30, 2026
Added

Database dump from Appen (formerly CrowdFlower), a data annotation and AI training data company. The breach contains user account records including names, email addresses, bcrypt-hashed passwords, authentication tokens, phone numbers, company affiliations, sign-in metadata, and account creation timestamps. The data file is named after CrowdFlower, the company that Appen acquired in 2016. Data records span from approximately 2014 to early 2019.

Data found in this dataset

EmailskipfullNamephone

Search this dataset

Scoped to this dataset. Fill any combination — results match if any field hits.

Source files

Expand any file to inspect its column headers and the LLM's field-mapping reasoning, recorded during ingestion.

Appen_BF__data__crowdflower.txt
38 columns5,888,237 rows

File structure

Source columnMapped fieldConfidenceLLM assessment
0skiphighUUID identifier field (id)
1fullNamehighFull names of users (e.g., 'Denis kemei', 'Brian Ward')
2emailhighEmail addresses with @ symbol (e.g., '[email protected]')
3passwordhighbcrypt-hashed passwords starting with $2a$12$ prefix
4skiphighreset_password_token - security token, not PII
5skiphighreset_password_sent_at - timestamp metadata
6skiphighremember_created_at - timestamp metadata
7skiphighsign_in_count - internal counter
8skiphighcurrent_sign_in_at - timestamp metadata
9skiphighlast_sign_in_at - timestamp metadata
10skiphighcurrent_sign_in_ip - login metadata
11skiphighlast_sign_in_ip - login metadata
12skiphighfailed_attempts - internal counter
13skiphighunlock_token - security token
14skiphighlocked_at - timestamp metadata
15skiphighauthentication_token - API/session token
16skiphighsalt - password salt for hashing
17skiphighcreated_at - timestamp
18skiphighupdated_at - timestamp
19skiphighemail_verified_at - timestamp
20skiphighemail_verification_sent_at - timestamp
21skiphighemail_verification_token - security token
22skipmediumunverified_email - duplicate of verified email, metadata
23phonemediumphone_number field, mostly empty in samples
24skiphighcompany - employer affiliation, not core PII field
25skiphighemail_subscriber - boolean flag
26skiphightitle - honorific prefix (Mr, Mrs, Ms, Dr, Prof, Rev), no PII field exists per instructions
27skiphighroles_updated_at - timestamp
28skiphighquick_sign_up - boolean flag
29skiphighinternal_contributor_created_at - timestamp
30skiphighexternal_contributor_created_at - timestamp
31skiphighrequestor_created_at - timestamp
32skiphighresend_verification_email_count - counter
33skiphighidentity_id - internal identifier
34skiphighdisabled_at - timestamp
35skiphighterms_of_service_accepted_at - timestamp
36skiphighcurrent_team_id - internal team identifier
37skiphighapi_team_id - internal team identifier

Notes: This is a database dump (pipe-delimited) from Appen's user accounts table, not a traditional combo list. Extracted PII: fullName, email, bcrypt passwords, and optional phone. Most fields are timestamps, tokens, or internal metadata. The 'title' field contains honorifics (Mr, Dr, etc.) which are skipped per instructions—no PII field type exists for these. Phone_number field is present but mostly empty in sample data.