aitype.com
Jun 1, 2014
Breach of AIType, an Android AI keyboard application. The dataset contains user records including IP addresses, city, country, device brand and model, device ID, Android version, app package name, username (email addresses), geographic coordinates, GCM push notification IDs, and user language preferences. The data appears to be a MongoDB export from AIType's backend infrastructure.
Data found in this dataset
Source files
Expand any file to inspect its column headers and the LLM's field-mapping reasoning, recorded during ingestion.
AIType_BF__data__aitype.txt1 column75,014,733 rows
File structure
Format: CSV·Delimiter: comma·Has header: yes·Quote: "
| Source column | Mapped field | Confidence | LLM assessment |
|---|---|---|---|
| 17 | username | high | [17] header 'userName', values are valid email addresses |
Notes: Only username (email) is PII. All other columns are internal IDs, device info, timestamps, or non-PII metadata per breach context. Columns like 'ip', 'city', 'country', 'location' are excluded by rules (IP addresses and geographic coordinates are not mapped as PII fields in this schema).