1Usage: pdf2html [options] -in inputfile -out outputfile
2
3BASIC PARAMETERS:
4
5 -in [ -i ] arg The input file. The default input folder is the
6 current working folder. When running the
7 application from the console or a BAT file, you may
8 use relative file names. However, if you are
9 calling it from an application or a server, we
10 recommend using absolute file names for robustness.
11
12 -out [ -o ] arg The output file. You can also output to HTM
13 instead by specifying ".htm" as the file extension.
14 The default output folder is the same folder as
15 the input. When running the application from the
16 console or a BAT file, you may use relative file
17 names. However, if you are calling it from an
18 application or a server, we recommend using
19 absolute file names for robustness.
20
21OPTIONS:
22
23 -pages arg (=all) Page numbers of pages to be converted. You may
24 specify a single page number, e.g. 2, or a range of
25 pages, e.g. 2-6. If omitted, all pages will be
26 converted.
27
28 -fileTimeout arg (=300) The maximum amount of time allowed, in seconds,
29 for each document conversion. The default timeout
30 is 300 seconds (5 minutes).
31
32 -password arg The master password to open the PDF. The password
33 must give unrestricted content extraction
34 permissions.
35
36 -quality arg (=85) The image compression quality for JPEG,
37 from 5 to 100. Quality is ignored for PNG images.
38 The default quality is 85.
39
40 -compress The output image type, either JPEG ("jpeg") or
41 [ -compression ] PNG ("png"). The default is "jpeg".
42 arg (=jpeg)
43
44 -resolution [ -res ] The resolution for the output images in dots per
45 arg (=96) inch, from 8 to 600.
46 The default resolution is 96 dpi.
47
48 -embedImages arg Flag to embed images inside the HTML output using
49 Base64 encoding ("on") or written out as external
50 JPEG/PNG files ("off"). Embedding images will
51 produce a larger output file, but the HTML file is
52 self-contained. When external image files are used,
53 you must keep them together with the HTML file.
54 External images are always placed in a separate
55 directory. If you move or delete your HTML file,
56 you should also move or delete your images folder
57 with it. Valid parameter values are "on" and "off".
58 By default, this option is turned "on" if specified
59 without an argument. However, if this option is
60 completely omitted, it defaults to "off".
61
62 -title arg The content that goes inside the HTML output's
63 <TITLE> tag. Please ensure that you use double
64 quotes when specifying your title,
65 e.g. -title "PDF converted to HTML". If this option
66 is not specified, the default title is
67 "Created by Apryse".
68
69 -ocred arg (=image+text) Handling of special OCRed PDFs that only contain
70 full-page images with hidden selectable text. This
71 option is only applicable to a PDF produced by an
72 OCR engine based on a scanned image. Valid
73 parameter values are "image+text", "text", "image"
74 and "image+hiddenText". The default option,
75 "image+text", converts both images and text, making
76 the hidden text visible in the output. Use "text"
77 to convert only the text to make the hidden text
78 visible in the output or "image" to convert only
79 the images. If you want to convert the background
80 image and create a hidden selectable text layer
81 around it using complex JavaScript, use
82 "image+hiddenText".
83
84 -simpleLists arg Flag to use <P> tags ("off") or <LI> tags ("on")
85 for list items. Turning this flag "on" outputs
86 lists using <LI> tags and will give you the richest
87 logical content but with limited physical
88 formatting capabilities. For best visual accuracy,
89 turn this flag "off". Valid parameter values are
90 "on" and "off". By default, this option is turned
91 "on" if specified without an argument. However,
92 if this option is completely omitted, it defaults
93 to "off".
94
95 -connectHyphens arg Flag to re-connect basic English dictionary words
96 that are hyphenated at the end of a line. This
97 does not remove hyphens from expressions that
98 require a hyphen, such as "counter-clockwise" or
99 "well-intentioned". Valid parameter values are "on"
100 and "off". By default, this option is turned "on"
101 if specified without an argument. However, if this
102 option is completely omitted, it defaults to "off".
103
104 -symbolToUnicode arg Flag to translate Symbol font to Times New Roman
105 Unicode. It is not recommended to use Symbol font
106 in HTML files. We strongly recommend that you leave
107 this turned "on" unless your HTML is viewed using
108 Internet Explorer on Windows only. Turning this
109 "off" may produce HTML that looks corrupt on macOS,
110 iOS, Android and Linux. Valid parameter values are
111 "on" and "off". By default, this option is turned
112 "on" if specified without an argument. It also
113 defaults to "on" if this option is completely
114 omitted.
115
116 -advanced [ -a ] arg Advanced option to ignore angled text
117 (IgnoreAngledText=True) and/or vertical text
118 (IgnoreVerticalText=True).
119 By default, these are turned off.
120
121 -version Print the version number. Only available in Windows.
122
123 -silent Switches the application to Silent Mode. Warnings
124 and progress messages are not displayed. Only
125 errors that are considered failures are shown.
126 Only available in Windows.
127
128
129
130Examples:
131 pdf2html -in myInput.pdf -out myOutput.html
132 pdf2html -password MyPDFPassword -in my.pdf -out myHTM.htm
Did you find this helpful?
Trial setup questions?
Ask experts on DiscordNeed other help?
Contact SupportPricing or product questions?
Contact Sales