Malicious PDF — malware analysis report

Static analysis result for SHA-256 ba7a81d9e40b62dd…

MALICIOUS

PDF

61.4 KB
MD5: c039497c7c1e0ce31367fffe4af5b5a3 SHA-1: e34a1425f939a897ec6e4b1d40eda45addd0f1a3 SHA-256: ba7a81d9e40b62dd567490224a807177faba63784447fe00c29a2cbf79d13f32
160 Risk Score

Malware Insights

MITRE ATT&CK
T1204.002 Malicious File: Malicious File T1566.001 Phishing: Spearphishing Attachment

The PDF file contains XFA (XML Forms Architecture) which is known to be vulnerable to heap spray attacks. The 'PDF_XFA_HEAP_SPRAY' heuristic indicates that exploit code was found within an XFA JavaScript stream. This script is likely designed to download and execute a second-stage payload from the unknown URL 'http://g3453453h12378y213.com'. The ML classifier also strongly flagged this PDF as malicious.

Machine Learning

  • Nyx PDF Classifier malicious score 0.9982

Heuristics 6

  • XFA form contains risky executable script high CVE related PDF_XFA_SCRIPT
    PDF embeds an XFA form whose script block contains exploit, submission/launch, or shell-execution primitives. Ordinary LiveCycle print/update scripts are left as generic XFA/JS signals unless stronger behavior is present.
  • XFA JavaScript heap-spray exploit code critical PDF_XFA_HEAP_SPRAY
    PDF contains XFA script content with heap-spray or shellcode-like JavaScript markers such as large encoded word sequences, util.pack, large arrays, or spray variable names. This is a weaponised Adobe Reader exploit pattern, not a normal interactive form.
  • Embedded script payload in PDF stream medium PDF_EMBEDDED_SCRIPT_PAYLOAD
    PDF stream bytes contain an HTML/XFA <script> tag without accompanying Windows shell-execution primitives — common in accessible XFA forms but worth surfacing for analyst review.
  • XFA form low PDF_XFA
    PDF uses XML Forms Architecture — can contain script logic
  • PDF differential parser failed info PDF_DIFFERENTIAL_PARSE_FAILED
    The cross-check parser (pdfminer.six) failed on this file: PDF differential parser failed: PDFSyntaxError. Static heuristics still ran and any of their findings above are valid; only the differential cross-check signal is missing.
  • Embedded URL info EMBEDDED_URL
    One or more URLs were extracted from the document. The URL itself is not a detection — see the per-URL labels for which channel (macro, JS, link annotation, document body, ...) reached each URL.
    URL http://g3453453h12378y213.com
    • http://ns.adobe.com/xdp/
    • http://www.xfa.org/schema/xci/1.0/
    • http://ns.adobe.com/xtd/
    • http://www.xfa.org/schema/xfa-data/1.0/
    • http://ns.adobe.com/xfdf/
    • http://www.xfa.org/schema/xfa-form/2.8/

Extracted artifacts 1

Files carved from inside the sample during analysis.

FilenameKindSourceSize
embedded_pdf_script_00000cfc.bin
eb0e72db7047f55ded573ae449e00d10b07684196541e008beff430d81144b1d
pdf-embedded-script PDF raw stream script payload at offset 0xCFC 60197 bytes