PhishNet — Phishing Detection Infrastructure

Built on peer-reviewed research

🔬

NSF Funded

This infrastructure was developed under a National Science Foundation grant awarded to Alabama A&M University's Cybersecurity Laboratory, supporting rigorous academic research into email-based threat detection at scale.

📄

Springer Publication

The detection methodology and experimental results were peer-reviewed and published through Springer, one of the world's leading academic publishers, validating our multi-signal classification approach.

🎤

SAM'25 Las Vegas

Presented at the International Conference on Security & Management (SAM'25) in Las Vegas, where the system was reviewed by security researchers and practitioners from academia and industry worldwide.

Every signal, working in parallel

Google Safe Browsing

Real-time lookup against Google's phishing and malware URL database, updated continuously with newly discovered threats.

OpenPhish Feed

Community-verified phishing URL intelligence, cross-referenced against our scan requests for immediate threat identification.

DNSBL Queries

Parallel lookups against Spamhaus, SURBL, and URIBL blocklists to catch known spam infrastructure and malicious domains instantly.

ML Classifier

TF-IDF ensemble classifier trained on 80,000 emails, achieving 95% F1 score on holdout test sets across multiple phishing categories.

Header Analysis

Deep inspection of SPF, DKIM, and DMARC authentication results, plus Reply-To spoofing and display name impersonation detection.

URL Intelligence

Shannon entropy scoring, homoglyph character substitution detection, redirect chain traversal, and typosquatting pattern matching.

Evasion Detection

Identifies Base64-encoded payloads, CSS-hidden content, misleading HTML comments, and other obfuscation techniques used to bypass filters.

Domain Age

WHOIS-based domain registration age lookup. Domains registered under 30 days are flagged as high risk — a consistent indicator of phishing infrastructure.

Simple, predictable API

Base URL https://scanner-api-st8w.onrender.com

Auth X-API-Key header · per-key rate limits (10/min on /api/scan, 20/min on layer endpoints)

Request Body

JSON

{
  "email_address": "support@paypa1.com",
  "email_text": "Verify your PayPal account immediately or it will be suspended",
  "email_headers": "Authentication-Results: spf=fail; dkim=fail; dmarc=fail (p=REJECT)\r\nFrom: PayPal Support <support@paypa1.com>\r\nReply-To: attacker@gmail.com"
}

Response

JSON

{
  "scan_id": "1a49b796-d242-448d-929a-dbec00106dcf",
  "scam_score": 83.38,
  "risk_level": "CRITICAL",
  "labels": ["Homoglyph Domain", "DMARC Fail", "Phishing Content"],
  "recommendations": [
    "Do not click any links in this email",
    "This email failed DMARC authentication"
  ],
  "content_analysis": {
    "prediction": "Phishing Email",
    "confidence": 0.986,
    "is_phishing": true
  },
  "email_verification": {
    "homoglyph_detected": true,
    "risk_score": 100.0
  }
}

Code Examples

bash

curl -X POST https://scanner-api-st8w.onrender.com/api/scan \
  -H "Content-Type: application/json" \
  -H "X-API-Key: your_api_key_here" \
  -d '{"email_address": "test@example.com", "email_text": "Your message here"}'

python

import requests
response = requests.post(
    "https://scanner-api-st8w.onrender.com/api/scan",
    headers={"X-API-Key": "your_api_key_here"},
    json={"email_address": "test@example.com", "email_text": "Your message here"}
)
result = response.json()
print(f"Risk Level: {result['risk_level']}, Score: {result['scam_score']}")

javascript

const response = await fetch('https://scanner-api-st8w.onrender.com/api/scan', {
  method: 'POST',
  headers: {'Content-Type': 'application/json', 'X-API-Key': 'your_api_key_here'},
  body: JSON.stringify({email_address: 'test@example.com', email_text: 'Your message here'})
});
const result = await response.json();
console.log(`Risk Level: ${result.risk_level}, Score: ${result.scam_score}`);

Request Body

JSON

{
  "email_text": "Click here: https://paypa1.com/verify"
}

Request Body

JSON

{
  "email_text": "Urgent: Your account has been compromised. Click here to verify now."
}

Response

JSON

{
  "status": "healthy",
  "ml_model_loaded": true
}

Transparent by design

Every field in the response has a purpose. Nothing is a black box.

Response Structure

{
  "scan_id": "1a49b796-...",        // unique per request
  "scam_score": 83.38,            // 0-100 composite
  "risk_level": "CRITICAL",        // LOW/MEDIUM/HIGH/CRITICAL
  "labels": [...],               // human-readable signals
  "recommendations": [...],      // actionable guidance
  "content_analysis": {          // ML model output
    "prediction": "Phishing Email",
    "confidence": 0.986,         // 0.0-1.0
    "is_phishing": true
  },
  "header_analysis": {           // SPF/DKIM/DMARC
    "dmarc": "fail",
    "reply_to_mismatch": true,
    "spoofed_brand": "PayPal",
    "flags": [...]
  },
  "email_verification": {        // domain checks
    "homoglyph_detected": true,  // paypa1 != paypal
    "risk_score": 100.0
  }
}

Composite Score (scam_score)

A weighted average across all active detection layers, normalized to 0-100. Weights are calibrated on the 80k training corpus to minimize false positives on legitimate transactional emails.

Risk Level Tiers

LOW (<25), MEDIUM (25-49), HIGH (50-74), CRITICAL (75+). Tier boundaries match industry-standard SOC triage thresholds for email security workflows.

Machine-Readable Flags

The flags array in each sub-analysis contains uppercase string constants ideal for programmatic routing and SIEM integration.

Homoglyph Detection

The email_verification layer compares the sender domain character-by-character against a curated list of major brands, detecting substitutions like 1->l, 0->o, and Unicode lookalikes.

Human-Readable Labels

The top-level labels array condenses the most significant findings into short strings suitable for display in email clients, browser extensions, or end-user dashboards.

Request API Access

Designed to support researchers, defenders, and builders fighting phishing at every level.

🎓

Researchers & Students

Free tier with generous rate limits for academic use, coursework, and thesis research. No credit card required. Cite us in your paper.

Free · Academic

🛡️

Security Teams

Custom rate limits, production SLA guarantees, and dedicated support for security operations centers and enterprise email security integrations.

Custom · Production

🌐

Open Source Projects

Free access for qualifying open source security tools, email clients, and community projects that help protect users from phishing.

Free · OSS

Contact

Developed at AAMU Cybersecurity Lab · NSF Funded

Stop phishing emails
before they land.

Built on peer-reviewed research

NSF Funded

Springer Publication

SAM'25 Las Vegas

Every signal, working in parallel

Google Safe Browsing

OpenPhish Feed

DNSBL Queries

ML Classifier

Header Analysis

URL Intelligence

Evasion Detection

Domain Age

See it in action

Scan an email

Simple, predictable API

Transparent by design

Composite Score (scam_score)

Risk Level Tiers

Machine-Readable Flags

Homoglyph Detection

Human-Readable Labels

Request API Access

Researchers & Students

Security Teams

Open Source Projects

Stop phishing emailsbefore they land.

Built on peer-reviewed research

NSF Funded

Springer Publication

SAM'25 Las Vegas

Every signal, working in parallel

Google Safe Browsing

OpenPhish Feed

DNSBL Queries

ML Classifier

Header Analysis

URL Intelligence

Evasion Detection

Domain Age

See it in action

Scan an email

Simple, predictable API

Transparent by design

Composite Score (scam_score)

Risk Level Tiers

Machine-Readable Flags

Homoglyph Detection

Human-Readable Labels

Request API Access

Researchers & Students

Security Teams

Open Source Projects

Stop phishing emails
before they land.