Malicious PDF — malware analysis report

Static analysis result for SHA-256 470a2d27e395bfdd…

MALICIOUS

PDF

16.9 KB
MD5: 9d72883195c5840abbb72d9a5dc2f8f3 SHA-1: 7eeffcb98fae34272f562deadddbbe7ec279d2a0 SHA-256: 470a2d27e395bfddd232f5961c57e90a877abefdafaedda023438b6e3370e722
234 Risk Score

Malware Insights

MITRE ATT&CK
T1059.001 JavaScript/JScript T1204.001 Malicious Link/URL T1140 Deobfuscate/Decode Files or Information

The PDF file contains heavily obfuscated JavaScript, indicated by multiple heuristic firings including 'PDF_JS_OBFUSCATED_DROPPER' and 'PDF_ANNOT_SUBJECT_CALLEE_HEX_STAGER'. The script utilizes techniques like String.fromCharCode and hex decoding to evade static analysis. The primary exploit targets CVE-2007-5659 via the 'Collab.collectEmailInfo' method, suggesting a multi-stage attack designed to download and execute further malicious content.

Heuristics 9

  • Collab.collectEmailInfo — CVE-2007-5659 critical CVE exact CVE_2007_5659
    PDF JavaScript calls Collab.collectEmailInfo — CVE-2007-5659 is a buffer overflow in Adobe Reader triggered by a long argument or heap-sprayed message field passed to Collab.collectEmailInfo(). Part of a series of Acrobat JS API exploits. (identified after JavaScript deobfuscation)
  • ClamAV: Pdf.Exploit.Agent-35901 critical CLAMAV_DETECTION
    ClamAV detected this file as malware: Pdf.Exploit.Agent-35901
  • Annotation subject callee-key hex JavaScript stager high PDF_ANNOT_SUBJECT_CALLEE_HEX_STAGER
    PDF JavaScript uses syncAnnotScan()/getAnnots() to read an indirect annotation /Subject stream, percent-decodes it through marker replacement, then uses a callee.toString()-derived key to decode and eval the final exploit stage.
  • Obfuscated multi-stage PDF JavaScript dropper high PDF_JS_OBFUSCATED_DROPPER
    PDF JavaScript shows 4 independent signals of exploit-kit-style multi-stage obfuscation: annot_subject_stage, hex_codec_loop, incremental_eval_build, repeated_pluginschk. This is strongly consistent with pre-2011 Adobe Reader PDF droppers — OpenAction JS reads encoded data from annotation subjects, decodes it through one or more hex / base-N loops, and invokes eval indirectly (method name built one character at a time). The actual CVE is hidden in the final decoded layer and is not visible via static analysis.
  • JavaScript action low PDF_JAVASCRIPT
    PDF contains a /JavaScript action. Generic JavaScript is common in benign forms; specific dangerous APIs are scored by separate rules.
  • Embedded JS stream low PDF_JS
    PDF references a /JS stream. Generic JavaScript is common in benign forms; specific dangerous APIs are scored by separate rules.
  • String.fromCharCode low PDF_FROMCHARCODE
    String.fromCharCode found — used to construct payload strings dynamically. Common in benign JavaScript libraries for codepoint manipulation, so this alone is informational; weaponised use is also caught by the dedicated fromCharCode-stage and exploit-shape rules. (matched inside decoded stream)
  • syncAnnotScan annotation-staging primitive low PDF_FOXIT_SYNCANNOTSCAN
    PDF JavaScript calls syncAnnotScan() — a no-op annotation-enumeration primitive used by exploit-kit JavaScript to stage payload reads from annotation /Subject fields before eval(). Not a vulnerable sink itself; rarely seen in legitimate PDFs. (matched in decompressed stream)
  • Suspicious extracted artifact info EXTRACTED_FILE_STATIC_TRIAGE
    One or more files extracted from inside this sample matched static suspicious-content checks such as script obfuscation, encoded payload blobs, packed data, or execution/download terms.

Extracted artifacts 4

Files carved from inside the sample during analysis.

FilenameKindSourceSize
javascript_obj0009_000.js
4718a27c2224fc36bf24f8e8e04598f1ad78adce4401c7be2708318738a6983d
pdf-javascript-stream PDF /JS object 9 at offset 0x40F0 469 bytes
Detection
ClamAV: No threats found
Obfuscation or payload: likely
Carved artifact contains 1 eval/decoder/string-building token(s).
annotation_subject_callee_hex_stage_000.js
d5fde943c214e826ca75821b3ab67c55a5dad28273e14aa113bebfff7e494a0f
deobfuscated-js annotation-subject callee-key decoded JavaScript at offset 0x1A4A 5289 bytes
Detection
ClamAV: No threats found
Obfuscation or payload: likely
Carved artifact contains 5 eval/decoder/string-building token(s).
legacy_pdfkit_stage_000.js
16dfa3819dc73cbae3a7e497c797c79a80fa1e93618cf1db6cf1bdfac13a1fc4
deobfuscated-js repeated-marker hex decoded JavaScript at offset 0x1A9E 11972 bytes
Detection
ClamAV: No threats found
Obfuscation or payload: likely
Carved artifact contains 1 eval/decoder/string-building token(s). Carved artifact contains 1 long base64-like blob(s).
deobfuscated.js
bc90fcd30aabc1bb6285c9d17175659a3bf7c05f4724c5a06b5b176ce8b2a82c
deobfuscated-js PDF JavaScript deobfuscation pass 86215 bytes
Detection
ClamAV: No threats found
Obfuscation or payload: likely
Carved artifact contains 10 eval/decoder/string-building token(s). Carved artifact contains 3 long base64-like blob(s).