Malicious PDF — malware analysis report

Static analysis result for SHA-256 117eeee128f395b2…

MALICIOUS

PDF

99.3 KB Created: 2008-04-24 17:46:14 +08:00 Authoring application: Acrobat PDFMaker 8.1 for Word (via Acrobat Distiller 8.1.0 (Windows))
MD5: 8aaecfec24d2a9b2851e43db45a35273 SHA-1: 2e36d506239a4e8c0eed8f9dcebc346f44ea1cfb SHA-256: 117eeee128f395b200ad5170e3dc44d9556901a64bcb50427d08580acb34530c
362 Risk Score

Malware Insights

MITRE ATT&CK
T1203 Exploitation for Client Execution T1059.007 JavaScript T1566.001 Spearphishing Attachment

This PDF document contains embedded JavaScript that exploits CVE-2007-5659 (Collab.collectEmailInfo) to achieve code execution. The JavaScript is heavily obfuscated and appears to be a stager designed to download and execute a secondary payload. The presence of multiple JavaScript exploit clusters and a secondary embedded PDF further indicates malicious intent.

Machine Learning

  • Nyx PDF Classifier malicious score 0.9998

Heuristics 13

  • Collab.collectEmailInfo — CVE-2007-5659 critical CVE exact CVE_2007_5659
    PDF JavaScript calls Collab.collectEmailInfo — CVE-2007-5659 is a buffer overflow in Adobe Reader triggered by a long argument or heap-sprayed message field passed to Collab.collectEmailInfo(). Part of a series of Acrobat JS API exploits. (matched in decompressed stream)
  • getAnnots heap-spray JavaScript stager high CVE related PDF_JS_GETANNOTS_HEAPSPRAY_STAGER
    PDF JavaScript calls getAnnots() in the same staged context as classic Adobe Reader heap-spray shellcode and an embedded payload. This is CVE-2009-1492-related evidence, but is not reported as an exact CVE because the getAnnots argument is not the distinctive integer-overflow or long-string trigger shape.
  • PDF JavaScript exploit cluster critical PDF_JS_EXPLOIT_CLUSTER
    PDF combines an executable JavaScript/action surface with exploit staging indicators such as eval/unescape/fromCharCode, XFA script content, or a related CVE pattern. Benign form JavaScript remains low-severity, but this correlated cluster is high-confidence malicious behavior.
  • Secondary embedded PDF body has suspicious static findings critical POLYGLOT_CHILD_PDF_STATIC_TRIAGE
    A valid PDF body was found at a nonzero offset inside another container and its carved contents matched PDF exploit or lure heuristics. This catches polyglots where the top-level magic routes to ZIP/OLE while a PDF reader or downstream parser opens the hidden PDF payload.
  • unescape() call high PDF_UNESCAPE
    unescape() found — often used to decode shellcode in PDF JS exploits (matched inside decoded stream)
  • Generic recovered JavaScript exploit stage high PDF_GENERIC_STAGE_RECOVERY
    Bounded static stage recovery exposed hidden JavaScript through generic transforms such as null-byte collapse, percent decoding, marker replacement, arithmetic character codes, fromCharCode, numeric arrays, numeric-array minus-key decoders, alphabet-index arrays, /Producer half-difference metadata arrays, hex literals, marker-stripped Base64 literals, custom 6-bit XOR table decoders, or repeated-marker hex carriers. This rule is emitted only when the recovered stage contains exploit-like Acrobat JavaScript or shellcode markers.
  • JavaScript action low PDF_JAVASCRIPT
    PDF contains a /JavaScript action. Generic JavaScript is common in benign forms; specific dangerous APIs are scored by separate rules.
  • Embedded JS stream low PDF_JS
    PDF references a /JS stream. Generic JavaScript is common in benign forms; specific dangerous APIs are scored by separate rules.
  • Embedded file low PDF_EMBEDDED
    PDF embeds a file attachment — could carry an executable or another weaponised document as a nested payload
  • PDF paints image(s) but contains no text operators info PDF_IMAGE_ONLY_LURE
    PDF has 1 image XObject(s) and the content stream contains no text-emitting operators (BT/ET, Tj, TJ, ', ") in either raw bytes or decompressed streams — this is the screenshot-as-PDF pattern used to bypass text-based scanners and to deliver instructions purely through rendered pixels. It is informational unless paired with invisible links or risky URI context.
  • Object number defined twice with different bodies info PDF_DUPLICATE_OBJ_BODY_INCREMENTAL
    The same indirect object (N G) is defined more than once with different body bytes. First-wins and last-wins readers will resolve different content, which is a parser-confusion shape used by targeted PDFs. Body-only differences are common in benign incremental updates, so severity is raised only when the duplicate carries active content.
  • Suspicious extracted artifact info EXTRACTED_FILE_STATIC_TRIAGE
    One or more files extracted from inside this sample matched static suspicious-content checks such as script obfuscation, encoded payload blobs, packed data, or execution/download terms.
  • Embedded URL info EMBEDDED_URL
    One or more URLs were extracted from the document. The URL itself is not a detection — see the per-URL labels for which channel (macro, JS, link annotation, document body, ...) reached each URL.
    URL http://www.w3.org/1999/02/22-rdf-syntax-ns#
    • http://ns.adobe.com/pdf/1.3/
    • http://ns.adobe.com/pdfx/1.3/
    • http://ns.adobe.com/xap/1.0/
    • http://ns.adobe.com/xap/1.0/mm/
    • http://purl.org/dc/elements/1.1/

Extracted artifacts 7

Files carved from inside the sample during analysis.

FilenameKindSourceSize
javascript_obj0014_000.js
97e6c8fb70f6fedab160a41095c99dce3c9d53a0086d3a8d4e6d47cbe03dce61
pdf-javascript-stream PDF /JS object 14 at offset 0x4B9 1946 bytes
javascript_obj0015_001.js
c578aa723efa749365b2f21d6797c636273b024658087a83458d24a3ca6b50b4
pdf-javascript-stream PDF /JS object 15 at offset 0xCDD 7556 bytes
stream_003_off000004b9.js
5c1ab2af46eef55b0d162c3a84464633475df9b138b64aa21a36ffaffbdffa88
decompressed-pdf-stream PDF FlateDecoded stream at offset 0x4B9 1336 bytes
stream_004_off00000cdd.js
4a48bc9abab646adf89aeff916c4d8f8a9d218a6c1deae0033d46e9f40aaed86
decompressed-pdf-stream PDF FlateDecoded stream at offset 0xCDD 3811 bytes
Detection
ClamAV: No threats found
Obfuscation or payload: likely
Carved artifact contains 17 eval/decoder/string-building token(s).
generic_stage_recovery_000.js
0b9639e7243976d9b4971c2a8c1a7bc52c6a8f5114a39d409578c83d31362520
deobfuscated-js generic stage recovery null-collapse from combined JavaScript objects at offset 0x4B9 4950 bytes
Detection
ClamAV: No threats found
Obfuscation or payload: likely
Carved artifact contains 17 eval/decoder/string-building token(s).
generic_stage_recovery_001.js
7ad3a57cbc9aa1bd61ecee65f041e869292a8081f8469b20466fe24f77275b72
deobfuscated-js generic stage recovery null-collapse from JavaScript object 15 at offset 0xCDD 3794 bytes
Detection
ClamAV: No threats found
Obfuscation or payload: likely
Carved artifact contains 17 eval/decoder/string-building token(s).
polyglot_child_pdf_off0000b640.pdf
5660d7764d58476d4545d3888adc155c74c3cf592e289c52d002538bb408221c
polyglot-child-pdf Secondary PDF body inside pdf container at offset 0xB640 54986 bytes