← All datasets

comelec.gov.ph

Mar 27, 2016

100,479,164
Records
12
Files
Jun 2, 2026
Added

A breach of the Commission on Elections (COMELEC) of the Philippines, exposing the entire Philippine voter registration database. The archive contains voter registration records (new_id_released.txt, web_id_onhand.txt, web_id_disapproved.txt), overseas absentee voter data (overseas_absentee_all.txt, overseas_absentee_scratch.txt), geographic reference codes, embassy and country codes, web application user accounts with hashed passwords (dbadmin_usersinformation.txt), and internal system user accounts (fum_users.txt). The data includes full names, dates of birth, addresses, fingerprint data, voter identification numbers (VINs), passport numbers, and biometric information for millions of Filipino voters including overseas absentee voters.

Data found in this dataset

EmailFirst nameLast nameMiddle nameUsernameAddressCityStateCountryGenderSuffixskipphonefullNamedobzipssnaddress2

Search this dataset

Scoped to this dataset. Fill any combination — results match if any field hits.

Source files

Expand any file to inspect its column headers and the LLM's field-mapping reasoning, recorded during ingestion.

code_tables.txt
0 rows

File structure

Notes: The provided text is NOT DATA. It contains reference documentation: registration type codes, absentee disapproval codes, disapproval codes, system removal codes, change codes, and 'Pages' codes with their descriptions. This is a data dictionary/reference guide explaining what various codes mean in the COMELEC breach dataset, not a structured data file with actual voter records or PII. No columns to map. The actual data files (new_id_released.txt, web_id_onhand.txt, overseas_absentee_all.txt, dbadmin_usersinformation.txt, fum_users.txt, etc.) are not provided in this submission.

dbadmin_usersinformation.txt
5 columns11 rows

File structure

Source columnMapped fieldConfidenceLLM assessment
1usernamehigh[1] header 'username', values are login identifiers (alvin, chief.sungahid, rose.gomez, etc.)
3lastNamehigh[3] header 'lastname', values are surnames (GENOTA, SUNGAHID, GOMEZ, etc.)
4firstNamehigh[4] header 'firstname', values are given names (ALVIN, ROMEL, ROSEMARIE, etc.)
5middleNamehigh[5] header 'maternalname', values are middle/maternal names (VILLANUEVA, UMALI, FERNANDEZ, etc.)
7passwordhigh[7] header 'password', values are hashed passwords (V12L6tEVPpv4JxIz40LS+/Llnic=, U2y5D2athOMZaN0ph1qp/+bAp28=, etc.)

Notes: This is a system user account database from fum_users.txt (internal COMELEC web application user accounts). Columns 0 (Id), 2 (userrole), 6 (nickname), 8 (status), 9 (connected), and 10 (lastconnection) are skipped as non-PII (internal IDs, status flags, connection timestamps). 11 columns total, 5 contain PII.

embassy_country_codes_ref.txt
4 columns0 rows

File structure

Format: CSV·Delimiter: pipe·Has header: yes·Quote: "

Source columnMapped fieldConfidenceLLM assessment
0countryhigh[0] header 'country', values are full country names
1countryhigh[1] header 'mailcountry', values are country codes representing mailing country
2skiphigh[2] header 'embassy', values are embassy location names - geographic reference data, not PII
3skiphigh[3] header 'mailembassy', values are embassy codes - geographic reference data, not PII

Notes: This file contains geographic reference data (countries and embassies) for the COMELEC overseas absentee voter system. Only country fields map to PII; embassy and location codes are non-PII reference data. The 'mailcountry' column contains country codes and maps to the country field as it represents actual country information.

fum_users.txt
52 columns2,559 rows

File structure

Source columnMapped fieldConfidenceLLM assessment
0skiphighsequential ID numbers (email_id)
1emailhighemail addresses with @comelec.gov.ph domain
2skiphighdepartment codes (ECAD, FINANCE, etc.)
3skiphighinternal classification codes (0-9, A, B, etc.)
4usernamehighuser login identifiers
5passwordhighbcrypt hashed passwords ($2a$15$ format), many empty (not set)
6skiphighboolean flag (is_passwdchanged)
7skipmediumregion code
8skipmediumprovince code
9skipmediummunicipality code
10skipmediummunicipality value
11skipmediumregion value
12skipmediumprovince value
13skipmediumembassy code
14skipmediumembassy value
15countrymediumcountry code
16skipmediumcountry value
17skipmediumcontinent code
18skipmediumcontinent value
19skipmediumvisit number
20skiphightimestamp (date_updated)
21skipmediumperson flag
22skipmediumlatitude coordinate
23skipmediumlongitude coordinate
24skipmediumquery district
25skipmediumtotal registered
26skipmediumbarangay list
27skipmediumdistrict number
28skipmediumper day slots
29skipmediumper hour slots
30skipmediumper schedule slots
31skipmediumtotal doc table
32suffixmediumtitle/suffix (ATTY., etc.)
33lastNamehighsurname of OIC
34firstNamehighgiven name of OIC
35middleNamehighmiddle name of OIC
36skipmediumadditional name field
37genderhighgender (M/F)
38phonehightelephone number
39phonehighalternate telephone number
40address1highoffice address
41skipmediumposition office code
42skipmediumdesignation office
43emailhighalternate email address
44skipmediumis_acting flag
45skipmediumis_dfa_emailadd flag
46skipmediumtotal_formtype
47skipmediumextract_formtype
48skipmediumtoextract_formtype
49skipmediumblocksatsun flag
50skipmediumis_activepost flag
51skipmediumis_delete flag

Notes: Web application user accounts export from COMELEC database. Contains COMELEC staff/administrator credentials with bcrypt-hashed passwords. Includes personal information for Office-In-Charge (OIC) staff members including names, contact info, addresses, and gender. Many password fields are empty (users without set passwords). Columns 7-31 contain geographic/administrative codes. Columns 32-42 contain personnel information for supervisory staff.

geo_codes_ref.txt
0 rows
Schema not yet detected for this file. Headers and field mappings will appear here once the LLM analysis completes.
new_id_released.txt
28 columns19,435,262 rows

File structure

Source columnMapped fieldConfidenceLLM assessment
6lastNamehigh[6] header 'LASTNAME', values are common Filipino surnames (DEMABILDO, TABAMO, CASTRO)
7firstNamehigh[7] header 'FIRSTNAME', values are given names (ROSEMARIE, MERCY, ORESTES)
8middleNamehigh[8] header 'MATERNALNAME', values are maternal/middle names (PAJARILLO, LOPEZ, HERNANDEZ)
9genderhigh[9] header 'SEX', values are M/F gender codes
11fullNamehigh[11] header 'SPOUSENAME', contains spouse's full names for married individuals
12address1high[12] header 'RESSTREET', values are street addresses (VILLA ANGELA SUBD. BRGY. PILAR, PUROK 7)
16cityhigh[16] header 'RESCITY', values are city names (HINIGARAN, INITAO)
17statehigh[17] header 'RESPROVINCE', values are province names (NEGROS OCCIDENTAL, MISAMIS ORIENTAL)
23lastNamehigh[23] header 'FLASTNAME', father's last name—still a surname/searchable name identifier
24firstNamehigh[24] header 'FFIRSTNAME', father's first name—still a name identifier
25middleNamehigh[25] header 'FMATERNALNAME', father's maternal name
26lastNamehigh[26] header 'MLASTNAME', mother's last name—still a surname identifier
27firstNamehigh[27] header 'MFIRSTNAME', mother's first name—still a name identifier
28middleNamehigh[28] header 'MMATERNALNAME', mother's maternal name
29lastNamehigh[29] header 'REPLASTNAME', representative/contact last name
30firstNamehigh[30] header 'REPFIRSTNAME', representative/contact first name
31middleNamehigh[31] header 'REPMATERNALNAME', representative/contact maternal name
32dobhigh[32] header 'DOBYEAR', birth year component (1959, 1986, 1983)
33dobhigh[33] header 'DOBMONTH', birth month component (02, 10, 01)
34dobhigh[34] header 'DOBDAY', birth day component (10, 24, 11)
35cityhigh[35] header 'BIRTHCITY', city of birth (CITY OF PARANÁ QUE, INITAO)
36statehigh[36] header 'BIRTHPROVINCE', province of birth (NATIONAL CAPITAL REGION, MISAMIS ORIENTAL)
37countryhigh[37] header 'CITIZENSHIP', country/citizenship code (B for Filipino)
40ziphigh[40] header 'COUNTRYRES', country of residence; also appears to encode postal/geographic codes
59ssnmedium[59] header 'TIN', Tax Identification Number (9-digit numeric identifier similar to SSN)
62lastNamehigh[62] header 'PASSPORTPLACE', passport place contains names; values like PASSPORTLOST, PASSPORTNB indicate passport data related to identity
71cityhigh[71] header 'REGCITY', registration city (residence registration)
72statehigh[72] header 'REGPROVINCE', registration province

Notes: COMELEC 2016 Philippine voter registration database. File contains full voter records with names (last, first, middle, maternal/patronymic), dates of birth (split into year/month/day), addresses, birthplace, citizenship, passport/identification numbers, family member names (parents, spouse, representative), and biometric/fingerprint data. Total columns: 95+. Columns mapped: 26 PII fields including personal identification data, family relationships, and location information. Fingerprint and biometric data columns (FINGER_INFO, FINGER_TOPO_COORD, QUALITY, MATCHING_FINGER, etc.) were excluded as they are encoded/binary. Internal administrative columns (IDs, timestamps, flags, status codes) were excluded per EXCLUSION RULES.

overseas_absentee_all.txt
37 columns1,763,311 rows

File structure

Source columnMapped fieldConfidenceLLM assessment
6lastNamehigh[6] header 'LASTNAME', values are family names (DE JESUS, CABANALAN, CHAN)
7firstNamehigh[7] header 'FIRSTNAME', values are given names (LEOPOLDO, TITO, ROSEMARIE)
8middleNamehigh[8] header 'MATERNALNAME', values are maternal/middle names (BARTOLOME, AMOYAN, CONSTANTINO)
9genderhigh[9] header 'SEX', values are M/F gender codes
12address1high[12] header 'RESSTREET', residential street addresses (3 HORN BILL, 1351 LEGASPI ST.)
15cityhigh[15] header 'RESCITY', city names (MEYCAUAYAN CITY, ALIMODIAN)
16statehigh[16] header 'RESPROVINCE', province/state names (BULACAN, ILOILO)
21address1high[21] header 'ABROADSTREET', overseas street addresses
22ziphigh[22] header 'ABROADZIP', postal codes (249969, 13066)
24cityhigh[24] header 'ABROADCITY', overseas city names (MACAU, SENTRUM)
25countryhigh[25] header 'ABROADCOUNTRY', country codes (QA, NO, SG, HK, KW)
39emailhigh[39] header 'EMAIL', values contain @ symbol (anamarieamador@yahoo.com, pauneml@cpchem)
51lastNamehigh[51] header 'FLASTNAME', father's last name (DE JESUS, CABANALAN)
52firstNamehigh[52] header 'FFIRSTNAME', father's first name (LIWANAG, CALIXTO)
53middleNamehigh[53] header 'FMATERNALNAME', father's maternal name
54lastNamehigh[54] header 'MLASTNAME', mother's last name (DE JESUS, AMOYAN)
55firstNamehigh[55] header 'MFIRSTNAME', mother's first name (PRISCILA, ROSARIO)
56middleNamehigh[56] header 'MMATERNALNAME', mother's maternal name (B., LACSON)
57lastNamehigh[57] header 'REPLASTNAME', representative last name
58firstNamehigh[58] header 'REPFIRSTNAME', representative first name
59middleNamehigh[59] header 'REPMATERNALNAME', representative maternal name
60dobhigh[60] header 'DOBYEAR', birth year component (1951, 1964, 1962)
61dobhigh[61] header 'DOBMONTH', birth month component (12, 12, 10)
62dobhigh[62] header 'DOBDAY', birth day component (27, 14, 11)
63cityhigh[63] header 'BIRTHCITY', birth city names
64statehigh[64] header 'BIRTHPROVINCE', birth province/state
130address1high[130] header 'MAILSTREET', mailing street address
131ziphigh[131] header 'MAILZIP', mailing postal code
132cityhigh[132] header 'MAILCITY', mailing city
133countryhigh[133] header 'MAILCOUNTRY', mailing country
135address1high[135] header 'REPSTREET', representative street
136cityhigh[136] header 'REPBARANGAY', representative barangay (area/subdivision)
137cityhigh[137] header 'REPCITY', representative city
138statehigh[138] header 'REPPROVINCE', representative province
168countryhigh[168] header 'CONTINENT', continent identifier
169countryhigh[169] header 'COUNTRY', country name
170cityhigh[170] header 'POST', postal/city reference

Notes: Philippine COMELEC voter registration database. File contains overseas absentee voter records with full PII: names (voter, parents, representatives), dates of birth (year/month/day components), addresses (residential, overseas, mailing, birth), email, and geographic locations. 171 total columns; mapped 30 PII columns containing searchable personal information. Columns not listed are non-PII (application IDs, registration codes, biometric fingerprint data, processing flags, timestamps, profession codes, physical characteristics like height/weight, internal system fields).

overseas_absentee_scratch.txt
69 columns138,928 rows

File structure

Source columnMapped fieldConfidenceLLM assessment
0skiphighFORM_ID - internal identifier
1skiphighAPP_TYPE - application type code
2skiphighREGISTRATION - registration status code
3lastNamehighLASTNAME field
4firstNamehighFIRSTNAME field
5middleNamehighMATERNALNAME field (mother's maiden name, used as middle name)
6genderhighSEX field containing M/F values
7skiphighMARITALSTATUS - not PII
8skiphighSPOUSENAME - relationship data, not direct voter PII
9address1highRESSTREET - residential street address
10skiphighRESPRECINCTCODE - precinct code
11skiphighRESREGION - region code
12skiphighRESBARANGAY - barangay code
13cityhighRESCITY - residential city
14statehighRESPROVINCE - residential province
15skiphighVILLAGE - address component
16skiphighCITY - duplicate city field
17skiphighPROVINCE - duplicate province field
18emailhighEMAIL field containing email addresses
19skiphighABROADSTATUS - overseas voter status code
20skiphighABROADSTATUSSPECIF - status specification
21lastNamehighFLASTNAME - father's last name
22firstNamehighFFIRSTNAME - father's first name
23middleNamehighFMATERNALNAME - father's maternal name
24lastNamehighMLASTNAME - mother's last name
25firstNamehighMFIRSTNAME - mother's first name
26middleNamehighMMATERNALNAME - mother's maternal name
27skiphighDOBYEAR - date of birth year (separate component)
28skiphighDOBMONTH - date of birth month
29skiphighDOBDAY - date of birth day
30skiphighBIRTHCITY - birth city
31skiphighBIRTHPROVINCE - birth province
32skiphighCITIZENSHIP - citizenship status
33skiphighNATURALIZATIONDATE - date field
34skiphighCERTIFICATENB - certificate number
35countryhighCOUNTRYRES - country of residence
36skipmediumCITYRESYEAR - years in city
37skipmediumCITYRESMONTH - months in city
38skiphighPROFESSION - occupation
39skiphighSECTOR - employment sector
40skiphighHEIGHT - physical characteristic
41skiphighWEIGHT - physical characteristic
42skiphighDISABLED - disability status
43skiphighASSISTEDBY - assistance indicator
44skiphighTIN - tax ID (not voter ID)
45skiphighPASSPORTNB - passport number (encrypted/masked when present)
46skiphighPASSPORTPLACE - passport place
47skiphighPASSYEAR - passport year
48skiphighPASSMONTH - passport month
49skiphighPASSDAY - passport day
50skiphighREGBARANGAY - registration barangay
51skiphighREGREGION - registration region
52skiphighREGCITY - registration city
53skiphighREGPROVINCE - registration province
54skiphighREG_DATE - registration date
55skiphighSTATIONID - station identifier
56skiphighLOCAL_ID - local identifier
57skiphighANNEXTYPE - annex type code
58skiphighANNEXRECORD - annex record
59skiphighCREATE_TIME - timestamp
60skiphighUPDATE_TIME - timestamp
61phonehighCONTACTNUMBER - phone contact number
62skiphighREFERENCENUMBER - voter reference number
63skiphighEMAIL_ID - email identifier
64skiphighUPDATED_DATETIME - timestamp
65skiphighIS_FRONTPAGE - boolean flag
66skiphighIS_REPRINT - boolean flag
67skiphighIS_OV - overseas voter flag
68skiphighIS_COUNTED - ballot counted flag

Notes: Philippine COMELEC voter registration database. Contains full voter registration records with personal identification data. Fields 3-5 are voter's name info; fields 21-26 contain biometric relative information (parents). Some name fields appear encrypted/hashed (base64-encoded values) in certain records. DOB stored as separate year/month/day fields (27-29). Phone numbers sometimes contain multiple entries separated by 'or'. Email field frequently empty. This is structured voter registry data, not a combo list.

web_id_disapproved.txt
11 columns13,001,017 rows

File structure

Source columnMapped fieldConfidenceLLM assessment
0skiphigh[0] FORM_ID - internal form identifier, numeric/auto-generated
4lastNamehigh[4] LASTNAME header, values are surnames (DEL ROSARIO, VALEROS, BARRERAS, etc.)
5firstNamehigh[5] FIRSTNAME header, values are given names (CORAZON, FERDINAND, ANALYN, etc.)
6middleNamehigh[6] MATERNALNAME header - maternal/middle name (CAGALPIN, MARIANO, SAWADAN, etc.)
7genderhigh[7] SEX header, values are F/M or S/M/W (single/married/widow indicators mixed with gender)
9address1high[9] RESSTREET - residential street address (BUENDIA BIGNAY I SARIAYA, etc.)
13cityhigh[13] CITY header, values are city names (SARIAYA, BANGUED, etc.)
14statehigh[14] PROVINCE header, values are province names (QUEZON, ABRA, etc.)
19dobhigh[19] DOBYEAR - birth year component (1971, 1969, 1993, etc.)
20dobhigh[20] DOBMONTH - birth month component (09, 04, 01, etc.)
21dobhigh[21] DOBDAY - birth day component (19, 10, 23, etc.)

Notes: Philippine voter registration database (COMELEC 2016 breach). File contains delimited voter records with 40 total columns. Excluded: APP_TYPE, REGISTRATION, ABSENTIA, MARITALSTATUS, RESPRECINCT, RESPRECINCTCODE, VILLAGE, RESBARANGAY, RESCITY, RESPROVINCE, BIRTHCITY, BIRTHPROVINCE, DISABLED (status flag), VINP1/VINP2/VINP3/VINCONTROLCODE (voter ID numbers), REG_DATE, UPDATE_TIME (timestamps), DISAPPROVED (status flag), LOCAL_ID, GOV_ID, APPLICATION_ID, PAGES_DESCR, ID, N_ID (all internal identifiers). Date of birth mapped across three separate columns (DOBYEAR, DOBMONTH, DOBDAY) all coded as dob field since they collectively represent DOB.

web_id_onhand.txt
121 columns5,752,070 rows

File structure

Format: CSV·Delimiter: pipe·Has header: yes·Quote: none

Source columnMapped fieldConfidenceLLM assessment
1skiphigh[1] APPLICATION_ID - internal application identifier, auto-generated numeric
2skiphigh[2] FORM_ID - internal form identifier, system reference
3skiphigh[3] APP_TYPE - application type code (L, H)
4skiphigh[4] ABSENTEE - status flag (L, O, V, R, N)
5skiphigh[5] REGISTRATION - registration status code
6lastNamehigh[6] LASTNAME - surname values (TABAMO, CASTRO, SADE, etc.)
7firstNamehigh[7] FIRSTNAME - given name values (MERCY, ORESTES, WALING WALING, etc.)
8middleNamehigh[8] MATERNALNAME - maternal/middle name values (LOPEZ, HERNANDEZ, SACAY, etc.)
9genderhigh[9] SEX - values are F/M (Female/Male)
10skiphigh[10] MARITALSTATUS - marital status code (S, M, W, etc.)
11skiphigh[11] SPOUSENAME - spouse full name (non-voter PII, mixed with empty)
12skiphigh[12] SPOUSEFIRSTNAME - spouse first name (non-voter PII)
13skiphigh[13] SPOUSELONGNAME - spouse long name (non-voter PII)
14address1high[14] RESSTREET - residential street address
15skiphigh[15] VILLAGE - village/barangay subdivision (geographic, not direct PII)
16cityhigh[16] CITY - city name (INITAO, BALAGTAS, etc.)
17statehigh[17] PROVINCE - province name (MISAMIS ORIENTAL, BULACAN, etc.)
18skiphigh[18] RESPRECINCT - precinct code, numeric identifier
19skiphigh[19] RESPRECINCTCODE - precinct code suffix
20skiphigh[20] RESBARANGAY - barangay code numeric identifier
21skiphigh[21] RESCITY - city code numeric identifier
22skiphigh[22] RESPROVINCE - province code numeric identifier
23address1medium[23] ABROADSTREET - overseas/abroad street address
24zipmedium[24] ABROADZIP - overseas postal/zip code
25skiphigh[25] ABSENTIA - absentia flag status
26citymedium[26] ABROADCITY - overseas city of residence
27countryhigh[27] ABROADCOUNTRY - overseas country of residence
28skiphigh[28] ABROADPERIOD - period of overseas residence (duration)
29skiphigh[29] ABROADRESCONT - residential contact status overseas
30countryhigh[30] REGCOUNTRY - registration country
31skiphigh[31] REGEMBASSY - embassy code identifier
32address1medium[32] MAILSTREET - mailing address street
33zipmedium[33] MAILZIP - mailing postal code
34citymedium[34] MAILCITY - mailing city
35countrymedium[35] MAILCOUNTRY - mailing country
36skiphigh[36] MAILEMBASSY - mailing embassy code
37address1medium[37] REPSTREET - representative/authorized agent street address
38skiphigh[38] REPBARANGAY - representative barangay code
39citymedium[39] REPCITY - representative city
40statemedium[40] REPPROVINCE - representative province
41emailhigh[41] EMAIL - email addresses with @ symbol
42skiphigh[42] ABROADSTATUS - status flag for overseas voters
43skiphigh[43] ABROADSTATUSSPECIF - specific status details (non-PII descriptor)
44skiphigh[44] LASTENTRYDATE - timestamp of last entry, not DOB
45skiphigh[45] ABSREGISTERED - registration status flag
46skiphigh[46] OLDPRECINCT - old precinct code identifier
47skiphigh[47] OLDREGBARANGAY - old barangay code
48skiphigh[48] OLDREGCITY - old city code
49skiphigh[49] OLDREGPROVINCE - old province code
50skiphigh[50] OLDREGDATE - old registration date, not DOB
51lastNamehigh[51] FLASTNAME - father's last name (family lineage, still personal)
52firstNamehigh[52] FFIRSTNAME - father's first name
53middleNamehigh[53] FMATERNALNAME - father's maternal name
54lastNamehigh[54] MLASTNAME - mother's last name
55firstNamehigh[55] MFIRSTNAME - mother's first name
56middleNamehigh[56] MMATERNALNAME - mother's maternal name
57lastNamehigh[57] REPLASTNAME - representative last name
58firstNamehigh[58] REPFIRSTNAME - representative first name
59middleNamehigh[59] REPMATERNALNAME - representative maternal name
60dobhigh[60] DOBYEAR - birth year (1949-1987 values)
61dobhigh[61] DOBMONTH - birth month component (01-12)
62dobhigh[62] DOBDAY - birth day component (01-31)
63citymedium[63] BIRTHCITY - city of birth
64statemedium[64] BIRTHPROVINCE - province of birth
65skiphigh[65] CITIZENSHIP - citizenship status (B=Filipino, etc.)
66skiphigh[66] NATURALIZATIONDATE - naturalization date (non-DOB timestamp)
67skiphigh[67] CERTIFICATENB - certificate number, internal ID
68skiphigh[68] COUNTRYRES - country of residence code
69skiphigh[69] CITYRESYEAR - year moved to city (duration, not DOB)
70skiphigh[70] CITYRESMONTH - month moved to city
71skiphigh[71] PROFESSION - occupation descriptor
72skiphigh[72] SECTOR - employment sector code
73skiphigh[73] HEIGHT - physical height (biometric, not PII identifier)
74skiphigh[74] WEIGHT - physical weight (biometric, not PII identifier)
75skiphigh[75] MARKS - physical marks/distinguishing features (biometric)
76skiphigh[76] DISABLED - disability status flag
77skiphigh[77] ASSISTEDBY - assisted by designation (operational flag)
78skiphigh[78] OLD_VIN - old voter ID number, internal identifier
79skiphigh[79] VINP1 - voter ID part 1 (internal reference)
80skiphigh[80] VINP2 - voter ID part 2
81skiphigh[81] VINP3 - voter ID part 3
82skiphigh[82] VINCONTROLCODE - voter ID control code
83skiphigh[83] TIN - tax identification number (financial, not standard PII)
84skiphigh[84] PASSPORTLOST - passport lost status flag
85skiphigh[85] PASSPORTNB - passport number (travel document, not voter PII for mapping)
86skiphigh[86] PASSPORTPLACE - passport issuance place (descriptor)
87skiphigh[87] PASSYEAR - passport year (non-DOB timestamp)
88skiphigh[88] PASSMONTH - passport month
89skiphigh[89] PASSDAY - passport day
90skiphigh[90] REGBARANGAY - registration barangay code
91skiphigh[91] REGCITY - registration city code
92skiphigh[92] REGPROVINCE - registration province code
93skiphigh[93] REG_DATE - registration date (non-DOB timestamp)
94skiphigh[94] INTERNAME - internal operator name (system staff, not voter)
95skiphigh[95] OFFICERNAME - officer name (system staff, not voter)
96skiphigh[96] OPERNAME - operator name (system staff, not voter)
97skiphigh[97] STATIONID - station identifier code
98skiphigh[98] CDID - CD identifier (internal)
99skiphigh[99] SETID - set identifier (internal)
100skiphigh[100] PRINT_FLAG - printing flag status
101skiphigh[101] FINGER_INFO - fingerprint data (biometric, not searchable PII)
102skiphigh[102] FINGER_TOPO_COORD - fingerprint topographical coordinates (biometric)
103skiphigh[103] QUALITY - fingerprint quality metric
104skiphigh[104] MATCHING_FINGER - matching finger code (biometric)
105skiphigh[105] TRANSFER_STATUS - transfer status flag
106skiphigh[106] TRANSFER_UPDATE_TIME - transfer update timestamp
107skiphigh[107] PAGES_DESCR - pages description (document metadata)
108skiphigh[108] LOCAL_ID - local identifier code
109skiphigh[109] CREATE_TIME - record creation timestamp
110skiphigh[110] UPDATE_TIME - record update timestamp
111skiphigh[111] LOCK_USER - lock user identifier
112skiphigh[112] LOCK_TIME - lock timestamp
113skiphigh[113] PROCESSING - processing status flag
114skiphigh[114] IS_CURRENT - current status flag
115skiphigh[115] DOC_VERSION - document version number
116skiphigh[116] CD_STAT_ENTY - CD status entry code
117skiphigh[117] DISAPPROVED - disapproval status flag
118skiphigh[118] VOTING_HIST1 - voting history flag 1
119skiphigh[119] VOTING_HIST2 - voting history flag 2
120skiphigh[120] OP_CODE - operation code
121skiphigh[121] OP_DATE - operation date (non-DOB timestamp)

Notes: Philippine Commission on Elections (COMELEC) 2016 voter registration database. This is a 122-column voter registration export containing full voter records with personal identifiers (names, DOB, addresses, email), parental information (father/mother names), and overseas voter data. Columns 0, 1-5, 18-22, 25, 28-31, 36, 38, 42-50, 65-121 are non-PII or system metadata. Multiple address fields (residential, mailing, representative) are included and all mapped. Biometric fields (fingerprints, height, weight, physical marks) excluded. Family relation names (father, mother, spouse, representative) included as they constitute searchable personal identifiers in the context of voter records.

webvs_primary_7therb.txt
13 columns75,302,279 rows

File structure

Source columnMapped fieldConfidenceLLM assessment
4lastNamehigh[4] header 'LASTNAME', values are surnames (DEL ROSARIO, PERLAS, VALEROS, etc.)
5firstNamehigh[5] header 'FIRSTNAME', values are given names (CORAZON, JOANNA ROSE, JOHN LENON, etc.)
6middleNamehigh[6] header 'MATERNALNAME', values are middle names (CAGALPIN, PRINCENA, BRINGAS, etc.)
7genderhigh[7] header 'SEX', values are M/F (F, F, M, F, etc.)
9address1high[9] header 'RESSTREET', values are street addresses (BUENDIA BIGNAY I SARIAYA, _, DALNETAN, etc.)
10address2high[10] header 'VILLAGE', values are village/barangay names (BIGNAY 1, AGTANGAO, etc.)
11cityhigh[11] header 'CITY', values are city names (SARIAYA, BANGUED, etc.)
12statehigh[12] header 'PROVINCE', values are province names (QUEZON, ABRA, etc.)
18dobhigh[18] header 'DOBYEAR', year component of DOB (1971, 1992, 1993, etc.)
19dobhigh[19] header 'DOBMONTH', month component of DOB (09, 12, 03, etc.)
20dobhigh[20] header 'DOBDAY', day component of DOB (19, 14, 10, etc.)
21citymedium[21] header 'BIRTHCITY', values are city names (SARIAYA, BANGUED, etc.) — birth location city
22statemedium[22] header 'BIRTHPROVINCE', values are province names (QUEZON, ABRA, etc.) — birth location province

Notes: This is a voter registration database from the Philippine COMELEC 2016 breach. The file contains 39 columns total; 13 map to PII fields (names, DOB components, address components, city, state, gender). Columns not listed (FORM_ID, APP_TYPE, REGISTRATION, ABSENTIA, MARITALSTATUS, RESPRECINCT, RESPRECINCTCODE, RESBARANGAY, RESCITY, RESPROVINCE, DISABLED, VINP1, VINP2, VINP3, VINCONTROLCODE, REG_DATE, UPDATE_TIME, DISAPPROVED, LOCAL_ID, APPLICATION_ID, PAGES_DESCR, ID, N_ID) are skipped as they are internal IDs, registration metadata, VIN data, timestamps, or system identifiers.

webvs_primary_data_errors.txt
0 rows

File structure

Notes: This file contains only internal administrative data with no PII fields. All 8 columns are non-PII: [0] ID is an internal voter ID number (skip), [1] REGISTRATION is a registration status code (skip), [2] REG_DATE is a timestamp (skip), [3] CREATE_TIME is a timestamp (skip), [4] RESPROVINCE is a geographic code (skip), [5] RESCITY is a geographic code (skip), [6] REMARKS contains only data quality error descriptions and flags (skip), [7] ID_NEW is a derived internal reference ID (skip). No personal identifiable information is present in this extract.