Home Blog PII vs PHI Guide

PII vs PHI: Complete Data Classification Guide for Compliance

Master the critical differences between Personally Identifiable Information (PII) and Protected Health Information (PHI) — with examples, regulatory frameworks, and practical classification strategies for your organization.

In the world of data privacy and security, two acronyms come up constantly: PII (Personally Identifiable Information) and PHI (Protected Health Information). While related, they refer to different categories of sensitive data — each governed by distinct regulations and requiring specific protection measures.

Misclassifying data can lead to compliance violations, hefty fines, and damaged customer trust. Whether you're a startup founder, IT professional, or compliance officer, understanding these distinctions is fundamental to building a robust data protection strategy.

Quick Summary

PII = Any data that can identify an individual (name, email, SSN). PHI = PII + health information held by HIPAA-covered entities. All PHI is PII, but not all PII is PHI.

What is PII?

Personally Identifiable Information (PII) is any data that can be used — alone or combined with other information — to identify, contact, or locate a specific individual.

The US National Institute of Standards and Technology (NIST) defines PII as information that can be linked to a specific person through "direct" or "indirect" identification.

Types of PII

Direct Identifiers (Sensitive PII)

Indirect Identifiers (Non-Sensitive PII)

Important

Studies show that 87% of US population can be uniquely identified using just ZIP code + birth date + gender. Indirect identifiers combined become sensitive PII.

What is PHI?

Protected Health Information (PHI) is a specific subset of PII that includes any individually identifiable health information created, received, maintained, or transmitted by a HIPAA-covered entity or business associate.

The key distinction: PHI relates to a person's past, present, or future physical or mental health, healthcare services received, or payment for healthcare services.

What Qualifies as PHI?

PHI includes health information combined with any of the 18 HIPAA identifiers:

PII & PHI by the Numbers

$4.45MAverage Data Breach Cost
$10.93MHealthcare Breach Avg
18HIPAA Identifiers

PII vs PHI: Key Differences

AspectPIIPHI
ScopeAny identifying personal dataHealth-related PII only
Regulatory BodyFTC, State AGs, GDPR, CCPAHHS Office for Civil Rights
Primary LawGDPR, CCPA, State lawsHIPAA + HITECH Act
Covered EntitiesAny organizationHealthcare providers, plans, clearinghouses
PenaltiesVaries by jurisdictionUp to $1.9M/year per category
Breach NotificationVaries by state/law60 days (HIPAA mandate)

Real-World Examples

PII Only (Not PHI)

PHI (Always Also PII)

The Critical Test

Ask yourself: "Is this health information held by a HIPAA-covered entity or business associate?" If YES → it's PHI. If NO → it's likely PII subject to other privacy laws.

When PHI Becomes Just PII

Health information loses its PHI status (and HIPAA protection) when:

Protection Strategies

For PII

For PHI (Additional Requirements)

Need Help with Data Classification?

Our experts can help you classify, inventory, and protect PII and PHI across your organization to meet all regulatory requirements.

Get Data Classification Audit

Other Important Data Classifications

Beyond PII and PHI, you should be aware of:

Best Practices for Data Classification

Step 1: Data Discovery

Inventory all data your organization collects, processes, and stores. Use automated discovery tools for accuracy.

Step 2: Classification Framework

Create tiers like Public, Internal, Confidential, Restricted — and map data types to each tier.

Step 3: Labeling

Apply visible labels to documents, emails, and systems indicating sensitivity level.

Step 4: Handling Procedures

Define how each classification tier must be stored, transmitted, shared, and destroyed.

Step 5: Training

Train all employees to recognize and properly handle different data classifications.

Conclusion

Understanding the difference between PII and PHI is the foundation of effective data privacy compliance. PII casts a wide net covering any identifying information, while PHI is the specialized subset governed by HIPAA in healthcare contexts.

The most important takeaway: classify your data accurately, apply the appropriate regulatory framework, and implement robust protection measures. When in doubt, treat data as more sensitive rather than less — the cost of over-protection is far less than the cost of a breach.

TS

Trouble Shooters Team

Our data privacy and compliance experts help organizations classify, inventory, and protect sensitive data across all major regulatory frameworks including HIPAA, GDPR, CCPA, and more.