Malicious PDF — malware analysis report

Static analysis result for SHA-256 72f742bd5133916c…

MALICIOUS

PDF

156.8 KB Authoring application: Python PDF Library 055 http072057057pybrary056net057pyPdf057
MD5: b755f1df4f786487f1ecd7e9476eaee5 SHA-1: 737d42d3d4c94b5d5aa110d77dc3f5d39d867642 SHA-256: 72f742bd5133916c57ab2628f4f6bf1e4e908b74dccff42b533df7b388fee0e6
158 Risk Score

Malware Insights

MITRE ATT&CK
T1059.007 JavaScript T1203 Exploitation for Client Execution T1566.001 Spearphishing Attachment

The PDF contains embedded JavaScript and RichMedia (Flash) content, as indicated by the PDF_JAVASCRIPT, PDF_JS, and PDF_RICHMEDIA heuristics. The ML classifier strongly flags this PDF as malicious. The presence of an embedded secondary PDF file (polyglot_child_pdf_off00025a77.pdf) suggests a multi-stage attack, likely involving the execution of further malicious code.

Machine Learning

  • Nyx PDF Classifier malicious score 1.0000

Heuristics 7

  • Secondary embedded PDF body has suspicious static findings critical POLYGLOT_CHILD_PDF_STATIC_TRIAGE
    A valid PDF body was found at a nonzero offset inside another container and its carved contents matched PDF exploit or lure heuristics. This catches polyglots where the top-level magic routes to ZIP/OLE while a PDF reader or downstream parser opens the hidden PDF payload.
  • RichMedia (Flash) high PDF_RICHMEDIA
    PDF contains /RichMedia (Adobe Flash) which is a historic exploit vector
  • JavaScript action low PDF_JAVASCRIPT
    PDF contains a /JavaScript action. Generic JavaScript is common in benign forms; specific dangerous APIs are scored by separate rules.
  • Embedded JS stream low PDF_JS
    PDF references a /JS stream. Generic JavaScript is common in benign forms; specific dangerous APIs are scored by separate rules.
  • Embedded file low PDF_EMBEDDED
    PDF embeds a file attachment — could carry an executable or another weaponised document as a nested payload
  • Object number defined twice with different bodies info PDF_DUPLICATE_OBJ_BODY_INCREMENTAL
    The same indirect object (N G) is defined more than once with different body bytes. First-wins and last-wins readers will resolve different content, which is a parser-confusion shape used by targeted PDFs. Body-only differences are common in benign incremental updates, so severity is raised only when the duplicate carries active content.
  • Embedded URL info EMBEDDED_URL
    One or more URLs were extracted from the document. The URL itself is not a detection — see the per-URL labels for which channel (macro, JS, link annotation, document body, ...) reached each URL.
    URL http://adobe.com/AS3/2006/builtin

Extracted artifacts 2

Files carved from inside the sample during analysis.

FilenameKindSourceSize
pushpro.swf
3f8e0576de2e9af38360a78e3ca79d41fb9060cefdb628cb69589ea03d1d3358
pdf-embedded-file PDF EmbeddedFile object 7 at offset 0xB4F 772 bytes
polyglot_child_pdf_off00025a77.pdf
e92f0770b5e327040d1f31f0c2dd08684ffd158fc5cdb8a73e54f10e5bbe9f6e
polyglot-child-pdf Secondary PDF body inside pdf container at offset 0x25A77 6297 bytes