peepdf.py

This post is part of a series of posts categorized as “Wiki” that contain basic how-to information. The intent is to create a reference repository for myself, but I’m not selfish so if anyone else can also benefit from it then I’m happy to share the knowledge!

  • OS: Linux/Windows
  • Description: Get stats for a PDF and identify high risk keywords and dictionary entries, as well as examine structure of PDF and look at its contents
Helpful Options:
 -i  interactive console mode
 -l  loose parsing

Documented commands (type help <topic>):
========================================
bytes
changelog
create
decode
decrypt
embed
encode
encode_strings
encrypt
errors
exit
filters
hash
help
info
js_analyse
js_beautify
js_code
js_eval
js_jjdecode
js_join
js_unescape
js_vars
log
malformed_output
metadata
modify
object
offsets
open
quite
rawobject
rawstream
references
replace
reset
save
save_version
sctest
search
set
show
stream
tree
vtcheck
xor
xor_search