Skip to main content
Skip table of contents

Legacy profile set classifiers

This section describes all the path classifiers used by the profile sets HIPAA - Legacy and Financial - Legacy included with the product, as well as several data classifiers included with the product but not in any of the pre-constructed Profile Sets.

Account numbers

An account number is the primary identifier for ownership of an account, whether a vendor account, a checking or brokerage account, or a loan account. An account number is used whether or not the identifier uses letters or numbers. Below are the classifiers Delphix uses to identify account numbers:

Classifier Name

Domain

Type

Regular Expression

Account number

ACCOUNT_NO

Path

(?>(acc(oun\|n)?t)_?(num(ber)?\|nbrjno)?)(?!\w\*(ID\|type))

Physical addresses

Below are the classifiers Delphix uses to identify physical addresses:

Classifier Name

Domain

Type

Regular Expression

Address

ADDRESS

Path

^(?:(?!postalcode\|city\|state\|country\|email\|(l\|ln\|lin\|line)?_?2{1}\|ID).)*addre?s?s?_?(?:(?!city\|state\|country\|email|(l\|ln\|lin\|line)?_?2{1}\|ID).)*$

Street Address

ADDRESS

Path

(?>(str(eet)?_?addre?s?s?\|street))(?!\w*(ID\|type))

Data - Address

ADDRESS

Regex

(.[\s]+b(ou)?*l(e)?v(ar)?d[\s].)|(.[\s]+st[.]?(reet)?[\s].)|(.[\s]+ave[.]?(nue)?[\s].)|(.[\s]+r(oa)?d[\s].)|(.[\s]+l(a)?n(e)?[\s].)|(.[\s]+cir(cle)?[\s].*)

Address Line2 - before

ADDRESS_LINE2

Path

^(?:(?!email\|ID).)*(l\|ln\|lin\|line)?2{1}_?addre?s?s?(?:(?!email\|ID).)*$

Address Line2 - after

ADDRESS_LINE2

Path

^(?:(?!email\|ID).)*addre?s?s?_?(l\|ln\|lin\|line)?_?2{1}(?:(?!email\|ID).)*$

Data - Address Line 2

ADDRESS_LINE2

Regex

(.*[\s]*ap(ar)?t(ment)?[\s]+.*)|(.*[\s]*s(ui)?te[\s]+.*)\|(c(are)?[\s]*[\\\\]?[/]?o(f)?[\s]+.*)

Beneficiary ID

Below are the classifiers Delphix uses to identify beneficiary IDs:

Classifier Name

Domain

Type

Regular Expression

Beneficiary number

BENEFICIARY_NO

Path

(?>(bene(ficiary)?)_?(num(ber)?|nbr\|no))(?!\w*ID)1

Beneficiary ID

BENEFICIARY_NO

Path

(?>(bene(ficiary)?)_?id)

Biometrics

Below are the classifiers Delphix uses to identify biometric data:

Classifier Name

Domain

Type

Regular Expression

Biometric

BIOMETRIC

Path

biometric

Certificate ID

Below are the classifiers Delphix uses to identify certificate IDs:

Classifier Name

Domain

Type

Regular Expression

Certificate number

CERTIFICATE_NO

Path

(?>cert(ificate)?_?(num(ber)?\|nbr\|no\|id))

Certificate ID

CERTIFICATE_NO

Path

(?>cert(ificate)?_?id)

City

Below are the classifiers Delphix uses to identify cities:

Classifier Name

Domain

Type

Regular Expression

City

CITY

Path

ci?ty(?!\w*ID)

Country

Below are the classifiers Delphix uses to identify countries:

Classifier Name

Domain

Type

Regular Expression

Country

COUNTRY

Path

c(ou)?nty(?!\w*ID)

Credit card

Below are the classifiers Delphix uses to identify credit cards:

Classifier Name

Domain

Type

Regular Expression

Card number

CREDIT CARD

Path

(?>ca?rd_?(num(ber)?\|nbr\|no)?)(?!\w*ID)

Credit Card number

CREDIT CARD

Path

(?>cre?di?t_?(ca?rd)?_?(num(ber)?\|nbr\|no)?)(?!\w*ID)

Data - Credit Card

CREDIT CARD

Regex

^(?:3[47][0-9]{13}|4[0-9]{12}(?:[0-9]{3})?(?:[0-9]{3})?\|(?:5[1-5][0-9]{2}\|222[1-9]\|22[3-9][0-9]\|2[3-6][0-9]{2}\|27[01][0-9]\|2720)[0-9]{12}\|6(?:(011\|5[0-9][0-9])[0-9]{2}\|4[4-9][0-9]{3}\|2212[6-9]\|221[3-9][0-9]\|22[2-8][0-9]{2}\|229[0-1][0-9]|2292[0-5])[0-9]{10}?(?:[0-9]{3})?\|3(?:0[0-5,9]\|6[0-9])[0-9]{11}\|3[89][0-9]{14}?(?:[0-9]{1,3})?)$

Customer number

Below are the classifiers Delphix uses to identify customer IDs:

Classifier Name

Domain

Type

Regular Expression

Customer number

CUSTOMER_NUM

Path

(?>(cu?st(omer\|mr)?)_?(num(ber)?\|nbr|no)?)(?!\w*ID)

Date of birth

Below are the classifiers Delphix uses to identify dates of birth:

Classifier Name

Domain

Type

Regular Expression

Birth Date

DOB

Path

(?>(bi?rth)_?(date?\|day\|dt))(?!\w*ID)

Birth Date1

DOB

Path

(?>dob\|dtofb\|(day\|date?\|dt)_?(of)?_?(bi?rth))(?!\w*ID)

Birth Date2

DOB

Path

(?>b_?(date?\|day))(?!\w*ID)

Admission Date

DOB

Path

(?>(adm(it\|ission)?)_?(date?\|day\|dt))(?!\w*ID)

Treatment Date

DOB

Path

(?>(tr(ea)?t(ment)?)_?(date?\|day|dt))(?!\w*ID)

Discharge Date

DOB

Path

(?>(ds\|disc(h\|harge)?)_?(date?\|day\|dt))(?!\w*ID)

Driver license number

Below are the classifiers Delphix uses to identify driver license numbers:

Classifier Name

Domain

Type

Regular Expression

Drivers License number

DRIVING_LC

Path

(?>(dri?v(e?rs?e?)?)_?(license|li?c)?_?(num(ber)?\|nbr|no)?)(?!\w*ID)

Drivers License number1

DRIVING_LC

Path

(^license$\|(license\|li?c)_?(num(ber)?\|nbr\|no))(?!\w*ID)

Email

Below are the classifiers Delphix uses to identify emails:

Classifier Name

Domain

Type

Regular Expression

Email

EMAIL

Path

^(?:(?!invalid).)*email(?!\w*ID)

Data - Email

EMAIL

Regex

\b[A-Z0-9._%+-]+@[A-Z0-9.-]+\.[A-Z]{2,6}\b

First name

Below are the classifiers Delphix uses to identify first names:

Classifier Name

Domain

Type

Regular Expression

First Name

FIRST_NAME

Path

(?>(fi?rst)_?(na?me?)\|f_?name)(?!\w*ID)

Middle Name

FIRST_NAME

Path

(?>(mid(dle)?)_?(na?me?)\|m_?name)(?!\w*ID)

IP address

Below are the classifiers Delphix uses to IP addresses:

Classifier Name

Domain

Type

Regular Expression

IP Address

IP ADDRESS

Path

(?>(ip_?addre?s?s?))(?!\w*(ID\|type))

Data - IP Address

IP ADDRESS

Regex

\b(?:(?:25[0-5]\|2[0-4][0-9]\|1[0-9][0-9]\|[1-9]?[0-9])\.){3}(?:25[0-5]\|2[0-4][0-9]\|1[0-9][0-9]\|[1-9]?[0-9])\b

Last name

Below are the classifiers Delphix uses to identify last names:

Classifier Name

Domain

Type

Regular Expression

Last Name

LAST_NAME

Path

^(?:(?!portal\|ID).)*((la?st)_?(na?me?)\|l_?name)(?:(?!portalname\|ID).)*$

Plate number

Below are the classifiers Delphix uses to identify plate numbers:

Classifier Name

Domain

Type

Regular Expression

License Plate

PLATE_NO

Path

^(?:(?!template|ID|type).)*(license\|li?c)?_?plate_?(num(ber)?\|nbr\|no)?(?:(?!template\|ID\|type).)*$

PO Box numbers

Below are the classifiers Delphix uses to identify PO box numbers:

Classifier Name

Domain

Type

Regular Expression

PO Box

PO_BOX

Path

po_?box

Data - PO Box

PO_BOX

Regex

po box\|p\.o\

Precinct

Below are the classifiers Delphix uses to identify precincts:

Classifier Name

Domain

Type

Regular Expression

Precinct

PRECINCT

Path

(>?precinct\|prcnct)(?!\w*ID)

Record number

Below are the classifiers Delphix uses to identify record numbers:

Classifier Name

Domain

Type

Regular Expression

Record number

RECORD_NO

Path

(?>rec(ord)?_?(num(ber)?\|nbr\|no))(?!\w*(ID\|type))

School name

Below are the classifiers Delphix uses to identify school names:

Classifier Name

Domain

Type

Regular Expression

School Name

SCHOOL_NM

Path

(?>school_?na?me?)(?!\w*ID)

Security code

Below are the classifiers Delphix uses to identify security codes:

Classifier Name

Domain

Type

Regular Expression

Security Code

SECURITY_CODE

Path

(?>se?cu?r(i?ty?)?_?co?de?)(?!\w*ID)

Serial number

Below are the classifiers Delphix uses to identify serial numbers:

Classifier Name

Domain

Type

Regular Expression

Serial number

SERIAL_NM

Path

(?>(ser(ial)?)_?(num(ber)?\|nbr|no))(?!\w*ID)

Signature

Below are the classifiers Delphix uses to identify signatures:

Classifier Name

Domain

Type

Regular Expression

Signature

SIGNATURE

Path

signature(?!\w*(ID\|type))

Social security number

Below are the classifiers Delphix uses to social security numbers:

Classifier Name

Domain

Type

Regular Expression

Social Security number

SSN

Path

ssn(?!\w*ID)

Data - SSN

SSN

Regex

\b(?!000)(?!666)[0-8]\d{2}[- ](?!00)\d{2}[- ](?!0000)\d{4}\b

Tax ID

Below are the classifiers Delphix uses to identify tax IDs:

Classifier Name

Domain

Type

Regular Expression

Tax ID number

TAX_ID

Path

tin$\|^tin\|_tin\|tin_

Tax ID Code or number

TAX_ID

Path

(ta?x)_?(id(ent)?)?_?((co?de?)\|(num(ber)?\|nbr\|no))?

Telephone number

Below are the classifiers Delphix uses to identify telephone numbers:

Classifier Name

Domain

Type

Regular Expression

Telphone or Contact number

TELEPHONE_NO

Path

(?>((tele?)?phone)\|(co?nta?ct\|tel)_?(num(ber)?\|nbr\|no))(?!\w*(ID\|type))

Data - Phone number

TELEPHONE_NO

Regex

\(?\b[0-9]{3}\)?[-. ]?[0-9]{3}[-. ]?[0-9]{4}\b

Fax number

TELEPHONE_NO

Regex

(?>fax_?(num(ber)?\|nbr\|no)?)(?!\w*(ID\|type))

Vin number

Below are the classifiers Delphix uses to identify vin numbers:

Classifier Name

Domain

Type

Regular Expression

Vehicle

VIN_NO

Path

vehicle

VIN

VIN_NO

Path

vin$\|^vin\|_vin\|vin_

Web address

Below are the classifiers Delphix uses to identify web addresses:

Classifier Name

Domain

Type

Regular Expression

Web or URL Address

WEB

Path

(?>(url\|web_?addre?s?s?))(?!\w*(ID\|type))

Data - Web Address

WEB

Regex

\b(?:(?:https?\|ftp\|file)://\|www\.\|ftp\.)[-A-Z0-9+&-@#/%=~_\|$?!:,.]*[A-Z0-9+&-@#/%=~_\|$]

ZIP code

Below are the classifiers Delphix uses to identify zip codes:

Classifier Name

Domain

Type

Regular Expression

zip or Postal Code

ZIP

Path

(?>(zip\|post(al)?)_?((co?de?)?4?))(?!\w*ID)

Data - Zip Code

ZIP

Regex

1\b([0-9]{5})-([0-9]{4})\b

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.