Add folder anatomy (scripts/agent.py + references/api-reference.md) for 648 cybersecurity skills

Complete skill folder anatomy across all cybersecurity skills:
- scripts/agent.py: 80-150 line Python agents using real libraries (impacket,
  boto3, azure-mgmt-*, kubernetes, pefile, yara, scapy, shodan, stix2, etc.)
- references/api-reference.md: real API documentation with method signatures
- LICENSE: MIT license for all skill folders
This commit is contained in:
mukul975
2026-03-10 21:02:12 +01:00
parent c74d52fa30
commit 27c6414ca5
1390 changed files with 106806 additions and 0 deletions
@@ -0,0 +1,21 @@
MIT License
Copyright (c) 2025 Anthropic Agent Skills Contributors
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.
@@ -0,0 +1,37 @@
---
name: analyzing-powershell-script-block-logging
description: >-
Parse Windows PowerShell Script Block Logs (Event ID 4104) from EVTX files to detect obfuscated
commands, encoded payloads, and living-off-the-land techniques. Uses python-evtx to extract and
reconstruct multi-block scripts, applies entropy analysis and pattern matching for Base64-encoded
commands, Invoke-Expression abuse, download cradles, and AMSI bypass attempts.
---
## Instructions
1. Install dependencies: `pip install python-evtx lxml`
2. Collect PowerShell Operational logs: `Microsoft-Windows-PowerShell%4Operational.evtx`
3. Parse Event ID 4104 entries using python-evtx to extract ScriptBlockText, ScriptBlockId, and MessageNumber/MessageTotal for multi-part script reconstruction.
4. Apply detection heuristics:
- Base64-encoded commands (`-EncodedCommand`, `FromBase64String`)
- Download cradles (`DownloadString`, `DownloadFile`, `Invoke-WebRequest`, `Net.WebClient`)
- AMSI bypass patterns (`AmsiUtils`, `amsiInitFailed`)
- Obfuscation indicators (high entropy, tick-mark insertion, string concatenation)
5. Generate a report with reconstructed scripts, risk scores, and MITRE ATT&CK mappings.
```bash
python scripts/agent.py --evtx-file /path/to/PowerShell-Operational.evtx --output ps_analysis.json
```
## Examples
### Detect Encoded Command Execution
```python
import base64
if "-encodedcommand" in script_text.lower():
encoded = script_text.split()[-1]
decoded = base64.b64decode(encoded).decode("utf-16-le")
```
### Reconstruct Multi-Block Script
Scripts split across multiple 4104 events share a `ScriptBlockId`. Concatenate blocks ordered by `MessageNumber` to recover the full script.
@@ -0,0 +1,59 @@
# API Reference: PowerShell Script Block Logging Analysis
## python-evtx Library
### FileHeader
```python
from Evtx.Evtx import FileHeader
with open(evtx_path, "rb") as f:
fh = FileHeader(f)
for record in fh.records():
xml_string = record.xml() # Returns XML string of the event
```
### Event XML Structure (Event ID 4104)
```xml
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
<System>
<EventID>4104</EventID>
<TimeCreated SystemTime="2024-01-15T10:30:00.000Z"/>
</System>
<EventData>
<Data Name="MessageNumber">1</Data>
<Data Name="MessageTotal">3</Data>
<Data Name="ScriptBlockText">...powershell code...</Data>
<Data Name="ScriptBlockId">guid-string</Data>
<Data Name="Path">C:\script.ps1</Data>
</EventData>
</Event>
```
## lxml etree Parsing
```python
from lxml import etree
NS = {"evt": "http://schemas.microsoft.com/win/2004/08/events/event"}
root = etree.fromstring(xml_bytes)
event_id = root.find(".//evt:System/evt:EventID", NS).text
data_elems = root.findall(".//evt:EventData/evt:Data", NS)
for elem in data_elems:
name = elem.get("Name")
value = elem.text
```
## Script Block Reconstruction
Large PowerShell scripts are split across multiple Event 4104 entries:
- `ScriptBlockId`: Unique GUID shared across all parts
- `MessageNumber`: Part index (1-based)
- `MessageTotal`: Total number of parts
- Reconstruct: concatenate parts ordered by MessageNumber
## Key Detection Patterns
| Pattern | MITRE | Risk |
|---------|-------|------|
| `-EncodedCommand` | T1059.001 | High |
| `FromBase64String` | T1140 | High |
| `Invoke-Expression` / `iex` | T1059.001 | High |
| `DownloadString` / `Net.WebClient` | T1105 | Critical |
| `AmsiUtils` / `amsiInitFailed` | T1562.001 | Critical |
| `Invoke-Mimikatz` | T1003 | Critical |
| High entropy (>5.5) | T1027 | Medium |
@@ -0,0 +1,195 @@
#!/usr/bin/env python3
"""PowerShell Script Block Logging Analyzer - Parses Event 4104 from EVTX for obfuscated commands."""
import json
import math
import re
import base64
import logging
import argparse
from collections import defaultdict
from datetime import datetime
from Evtx.Evtx import FileHeader
from lxml import etree
logging.basicConfig(level=logging.INFO, format="%(asctime)s [%(levelname)s] %(message)s")
logger = logging.getLogger(__name__)
NS = {"evt": "http://schemas.microsoft.com/win/2004/08/events/event"}
SUSPICIOUS_PATTERNS = [
(r"(?i)\-[Ee]ncoded[Cc]ommand", "Encoded command parameter", "T1059.001", "high"),
(r"(?i)FromBase64String", "Base64 decoding", "T1140", "high"),
(r"(?i)(Invoke-Expression|iex)\s*\(", "Invoke-Expression execution", "T1059.001", "high"),
(r"(?i)(DownloadString|DownloadFile|Invoke-WebRequest|wget|curl)", "Download cradle", "T1105", "critical"),
(r"(?i)(Net\.WebClient|WebRequest\.Create)", "Network client instantiation", "T1071.001", "high"),
(r"(?i)(AmsiUtils|amsiInitFailed|AmsiScanBuffer)", "AMSI bypass attempt", "T1562.001", "critical"),
(r"(?i)(Invoke-Mimikatz|Invoke-Kerberoast|Invoke-TokenManipulation)", "Offensive PowerShell tool", "T1003", "critical"),
(r"(?i)(Add-MpPreference\s*-ExclusionPath)", "Defender exclusion", "T1562.001", "high"),
(r"(?i)(Set-MpPreference\s*-DisableRealtimeMonitoring)", "Defender disable", "T1562.001", "critical"),
(r"(?i)(New-Object\s+System\.Net\.Sockets\.TCPClient)", "Reverse shell pattern", "T1059.001", "critical"),
(r"(?i)(Get-Process\s+lsass|MiniDump)", "LSASS dump attempt", "T1003.001", "critical"),
(r"(?i)(ConvertTo-SecureString|PSCredential)", "Credential handling", "T1078", "medium"),
]
def calculate_entropy(text):
"""Calculate Shannon entropy of a string to detect obfuscation."""
if not text:
return 0.0
freq = defaultdict(int)
for char in text:
freq[char] += 1
length = len(text)
return -sum((count / length) * math.log2(count / length) for count in freq.values())
def parse_evtx_4104(evtx_path):
"""Parse Event ID 4104 entries from a PowerShell Operational EVTX file."""
script_blocks = defaultdict(dict)
with open(evtx_path, "rb") as f:
fh = FileHeader(f)
for record in fh.records():
try:
xml = record.xml()
root = etree.fromstring(xml.encode("utf-8"))
event_id_elem = root.find(".//evt:System/evt:EventID", NS)
if event_id_elem is None or event_id_elem.text != "4104":
continue
event_data = {}
for data_elem in root.findall(".//evt:EventData/evt:Data", NS):
name = data_elem.get("Name", "")
event_data[name] = data_elem.text or ""
script_block_id = event_data.get("ScriptBlockId", "")
message_number = int(event_data.get("MessageNumber", "1"))
message_total = int(event_data.get("MessageTotal", "1"))
script_text = event_data.get("ScriptBlockText", "")
time_elem = root.find(".//evt:System/evt:TimeCreated", NS)
timestamp = time_elem.get("SystemTime", "") if time_elem is not None else ""
if script_block_id not in script_blocks:
script_blocks[script_block_id] = {
"parts": {},
"total": message_total,
"timestamp": timestamp,
"path": event_data.get("Path", ""),
}
script_blocks[script_block_id]["parts"][message_number] = script_text
except Exception:
continue
logger.info("Parsed %d unique script blocks from %s", len(script_blocks), evtx_path)
return script_blocks
def reconstruct_scripts(script_blocks):
"""Reconstruct full scripts from multi-part script block entries."""
reconstructed = []
for block_id, block_data in script_blocks.items():
parts = block_data["parts"]
total = block_data["total"]
ordered = [parts.get(i, "") for i in range(1, total + 1)]
full_script = "".join(ordered)
reconstructed.append({
"script_block_id": block_id,
"timestamp": block_data["timestamp"],
"path": block_data["path"],
"part_count": total,
"script_text": full_script,
"length": len(full_script),
})
logger.info("Reconstructed %d complete scripts", len(reconstructed))
return reconstructed
def decode_base64_commands(script_text):
"""Attempt to decode Base64-encoded command strings found in scripts."""
decoded_commands = []
b64_pattern = re.compile(r"[A-Za-z0-9+/=]{40,}")
for match in b64_pattern.finditer(script_text):
try:
decoded = base64.b64decode(match.group()).decode("utf-16-le", errors="ignore")
if any(c.isalpha() for c in decoded[:20]):
decoded_commands.append({"encoded": match.group()[:60], "decoded": decoded[:500]})
except Exception:
continue
return decoded_commands
def analyze_script(script_entry):
"""Analyze a single reconstructed script for suspicious patterns."""
text = script_entry["script_text"]
findings = []
for pattern, description, mitre, severity in SUSPICIOUS_PATTERNS:
matches = re.findall(pattern, text)
if matches:
findings.append({
"pattern": description,
"mitre_technique": mitre,
"severity": severity,
"match_count": len(matches),
"sample": matches[0][:100] if matches else "",
})
entropy = calculate_entropy(text)
if entropy > 5.5 and len(text) > 200:
findings.append({
"pattern": "High entropy (possible obfuscation)",
"mitre_technique": "T1027",
"severity": "medium",
"entropy": round(entropy, 2),
})
decoded = decode_base64_commands(text)
if decoded:
findings.append({
"pattern": "Base64-encoded content decoded",
"mitre_technique": "T1140",
"severity": "high",
"decoded_count": len(decoded),
"samples": decoded[:3],
})
return findings
def generate_report(scripts, all_findings):
"""Generate PowerShell script block analysis report."""
critical = sum(1 for f in all_findings if any(ff["severity"] == "critical" for ff in f["findings"]))
report = {
"timestamp": datetime.utcnow().isoformat(),
"total_scripts": len(scripts),
"suspicious_scripts": len([f for f in all_findings if f["findings"]]),
"critical_scripts": critical,
"findings": all_findings[:50],
}
print(f"PS SCRIPT BLOCK REPORT: {len(scripts)} scripts, {critical} critical")
return report
def main():
parser = argparse.ArgumentParser(description="PowerShell Script Block Logging Analyzer")
parser.add_argument("--evtx-file", required=True, help="Path to PowerShell Operational EVTX")
parser.add_argument("--output", default="ps_analysis.json")
args = parser.parse_args()
blocks = parse_evtx_4104(args.evtx_file)
scripts = reconstruct_scripts(blocks)
all_findings = []
for script in scripts:
findings = analyze_script(script)
if findings:
all_findings.append({
"script_block_id": script["script_block_id"],
"timestamp": script["timestamp"],
"path": script["path"],
"length": script["length"],
"findings": findings,
"script_preview": script["script_text"][:300],
})
report = generate_report(scripts, all_findings)
with open(args.output, "w") as f:
json.dump(report, f, indent=2)
logger.info("Report saved to %s", args.output)
if __name__ == "__main__":
main()