[task] Pdf transformation

Description

Pdft is pdf transformer task that allow to:

Pdft task parameters

Attribute Description Value Required
file input file .pdf file or fileset
tofile output file . tofile or todir
todir output dir no
template freemarker template .ftl yes (or inner template)
overwrite overwrite output file true by default no
ouputproperty output property no
Element Description Value Required
parameter custom param value <parameter name="" value=""/> no

* specific to pdf task

Usages

Use cases

file2file set2set set2file file2set
pdft pdf2xml
  • Transform pdf to xml, see: file2file example.
$ tree pdft/
pdft/
|-- build.xml
|-- datas
|   |-- file1.pdf
|-- gen
|   |-- out.xml
`-- templates
    |-- pdf2xml.ftl

ant file - build.xml

<project name="usages" xmlns:bj="antlib:org.bidji.taskdefs">
  	<target name="pdf2xml" description="intersect pdf files">       
    	<bj:pdft file="datas/file1.pdf" tofile="gen/out.xml" template="templates/pdf2xml.ftl" overwrite="true"/>
	</target>
</project>

input files

  • file1.pdf

pdf2xml.ftl

<?xml version="1.0"?>
<document>
[#foreach page in pdf?api.pages()]
  <page id="${page.id()}">
    <title>${page.title()}</title>
	<content>${page.content()}</content>
  </page>
[/#foreach]
</document>