mirror of
https://github.com/mukul975/Anthropic-Cybersecurity-Skills.git
synced 2026-06-14 06:54:57 +03:00
c47eed6a64
- Fix 25 shell=True subprocess calls with list-based commands - Fix 49 verify=False in defensive skills (env-var override) - Add timeout to 231 HTTP/subprocess/socket calls - Fix 6 SQL injection patterns with whitelist validation - Replace 8 __import__() with standard imports - Remove 701 unused imports across 442 files - Add authorized-testing disclaimers to all offensive skills - Complete 11 incomplete skill directories - Expand 10 stub SKILL.md files with full content - Fix 2 YAML parse errors in frontmatter - Fix 5 pre-existing syntax errors - Convert 22 hardcoded paths/ports to environment variables - Back up 21 redundant skill pairs to .bak - Fix 2 global declaration errors - 724/724 skills with full folder anatomy (SKILL.md + agent.py + api-reference.md + LICENSE) - 0 compile errors across all 724 agent.py files
4.9 KiB
4.9 KiB
API Reference: Performing AI-Driven OSINT Correlation
CLI Usage
# Correlate Sherlock + theHarvester results
python agent.py --target "targetdomain.com" \
--sherlock sherlock-results.csv \
--harvester harvester-results.json \
-o correlation_report.json
# Full multi-source correlation
python agent.py --target "john.doe" \
--sherlock sherlock.csv \
--harvester harvester.json \
--spiderfoot spiderfoot.json \
--breach breach-results.json \
-o report.json \
--markdown intelligence-profile.md
# Normalize only (no correlation)
python agent.py --sherlock sherlock.csv --harvester harvester.json \
--normalize-only -o normalized.json
# Load pre-normalized generic findings
python agent.py --generic normalized_findings.json -o report.json
Supported Data Sources
| Source | Flag | Input Format | Data Extracted |
|---|---|---|---|
| Sherlock | --sherlock |
CSV or text | Usernames, social profile URLs, platforms |
| theHarvester | --harvester |
JSON | Emails, hostnames, IP addresses |
| SpiderFoot | --spiderfoot |
JSON | Mixed OSINT findings (200+ module types) |
| Breach/HIBP | --breach |
JSON | Breach names, dates, data classes |
| Generic | --generic |
JSON array | Any pre-normalized findings |
Input File Formats
Sherlock CSV Format
username,name,url_user,exists,http_status
johndoe,GitHub,https://github.com/johndoe,Claimed,200
johndoe,Twitter,https://twitter.com/johndoe,Claimed,200
theHarvester JSON Format
{
"emails": ["john@targetdomain.com", "admin@targetdomain.com"],
"hosts": ["mail.targetdomain.com", "vpn.targetdomain.com"],
"ips": ["203.0.113.10", "203.0.113.11"]
}
SpiderFoot JSON Format
[
{"type": "EMAILADDR", "data": "john@targetdomain.com", "module": "sfp_hunter"},
{"type": "IP_ADDRESS", "data": "203.0.113.10", "module": "sfp_dnsresolve"},
{"type": "SOCIAL_MEDIA", "data": "https://github.com/johndoe", "module": "sfp_github"}
]
Breach/HIBP JSON Format
[
{
"Name": "ExampleBreach",
"BreachDate": "2023-06-15",
"DataClasses": ["Email addresses", "Passwords", "Usernames"]
}
]
Correlation Confidence Scoring
| Factor | Weight | Description |
|---|---|---|
| Exact email match | 0.95 | Same email found across multiple sources |
| Breach email match | 0.90 | Email found in breach database |
| Exact username match | 0.85 | Same username across multiple platforms |
| Same IP infrastructure | 0.70 | Shared IP address or hosting |
| Domain match | 0.60 | Shared domain registration or hosting |
| Similar username | 0.45 | Partial username overlap with shared metadata |
| Temporal co-registration | 0.40 | Accounts created within similar timeframe |
Cross-source corroboration increases confidence: +0.15 per additional source, capped at 0.95.
Report Output Schema
{
"meta": {
"target": "targetdomain.com",
"generated_at": "2026-03-19T12:00:00+00:00",
"sources_used": ["sherlock", "theHarvester", "spiderfoot", "breach_database"],
"total_findings": 247,
"total_entities": 12
},
"identifiers": {
"usernames": ["johndoe", "jdoe"],
"emails": ["john@targetdomain.com"],
"domains": ["targetdomain.com"],
"ip_addresses": ["203.0.113.10"],
"urls": ["https://github.com/johndoe"]
},
"entities": [
{
"identifier": "johndoe",
"identifier_type": "user",
"confidence": 0.92,
"sources": ["sherlock", "theHarvester", "breach_database"],
"source_count": 3,
"linked_accounts": [
{"source": "sherlock", "platform": "GitHub", "url": "https://github.com/johndoe"}
],
"flags": ["Exposed in 2 breach(es)"],
"risk_level": "high"
}
],
"risk_summary": {
"high_risk": 2,
"medium_risk": 5,
"low_risk": 5
}
}
Markdown Report Output
The --markdown flag generates an intelligence profile in Markdown containing:
- Target metadata and source summary
- Risk summary table
- Entity profiles with linked accounts, confidence scores, and risk flags
OSINT Tool Commands (Data Collection)
# Sherlock: enumerate username across platforms
sherlock "targetuser" --output sherlock.csv --csv
# theHarvester: harvest emails and subdomains
theHarvester -d targetdomain.com -b all -f harvester.json
# SpiderFoot: passive scan via REST API
curl -s http://localhost:5001/api/scan/start \
-d "scanname=recon&scantarget=targetdomain.com&usecase=passive"
# HIBP: check email breach exposure
curl -s -H "hibp-api-key: ${HIBP_KEY}" -H "User-Agent: OSINT-Agent" \
"https://haveibeenpwned.com/api/v3/breachedaccount/target@example.com" \
-o breach.json
References
- Sherlock Project: https://github.com/sherlock-project/sherlock
- theHarvester: https://github.com/laramies/theHarvester
- SpiderFoot: https://github.com/smicallef/spiderfoot
- HIBP API: https://haveibeenpwned.com/API/v3
- Maltego: https://www.maltego.com/
- LOLBAS for graph visualization: https://lolbas-project.github.io/