Solid Framework to PDFNet Migration

SolidFramework to PDFNet

The PDFNet Structured Output Module is based on SolidFramework version 10.0.13386. Therefore, if you are running an older version of SolidFramework then you should review the following Changelog for any important changes between your current version and SolidFramework 10.0.13386. Once completed, you can continue with the rest of this guide.

https://solidframework.net/release-notes/

If you are new to PDFNet, please check out our PDF2Office sample application.

API Translation

The following translation tables can be used to map Solid Framework APIs to PDFNet APIs.

Common

Methods

Solid Framework

PDFNet

Convert()
ConvertTo()
AddSourceFile()
AddSourceFiles()
ClearSourceFiles()
GetResultAsStream()

Convert.ToWord()
Convert.ToExcel()
Convert.ToPowerPoint()
Convert.ToHtml()

PDFNet only supports converting one file at a time. No overwrite warning. File conversions only, no in-memory streams or byte arrays. No progress callbacks or cancel mechanism.

PdfToWordConverter

Settings

Solid Framework

PDFNet

DetectSoftHyphens

SetConnectHyphens

FootnotesMode

SetFootnotesSetting

HeaderAndFooterMode

SetHeadersAndFootersSetting

OutputType
- WordML
- Rtf
- Txt
- Doc
- DocX

SetWordOutputFormat
- N/A
- e_wof_rtf
- e_wof_txt
- e_wof_doc
- e_wof_docx

Password

SetPDFPassword

PageRange

SetPages

ReconstructionMode
- Flowing
- Exact
- Continuous
- PlainText
- NoColumns

SetPrioritizeVisualAppearance
- False
- True
- N/A
- N/A
- N/A

TextRecoveryEngine

SetPreferredOCREngine

TextRecoveryLanguage
- "au"
- "ca"
- "da"
- "de"
- "en"
- "es"
- "fi"
- "fr"
- "it"
- "nl"
- "no"
- "pl"
- "pt"
- "ro"
- "ru"
- "sl"
- "sv"
- "tr"
- "zh"
- "zt"
- "ja"
- "ko"

SetCustomOCRLanguage
- "au" (default)
- "ca" or "cat"
- "da" or "dan"
- "de" or "deu"
- "en" or "eng"
- "es" or "spa"
- "fi" or "fin"
- "fr" or "fra"
- "it" or "ita"
- "nl" or "nld"
- "no" or "nor"
- "pl" or "pol"
- "pt" or "por"
- "ro" or "ron"
- "ru" or "rus"
- "sl" or "slv"
- "sv" or "swe"
- "tr" or "tur"
- "zh" or "chi_sim"
- "zt" or "chi_tra"
- "ja" or "jpn"
- "ko" or "kor"

TextRecoveryType
- Automatic
- Always
- Never
- Default

SetSearchableImageSetting
- e_ocr_text
- e_ocr_always
- e_ocr_off
- e_ocr_text

Defaults

sh

1DetectSoftHyphens = False;
2OutputType = DocX;
3Password = None;
4PageRange = All Pages;
5ReconstructionMode = Flowing;
6DetectToc = True;
7DetectLists = True
8DetectTables = True;
9DetectTaggedTables = True;
10DetectStyles = True;
11DetectLanguage = True;
12KeepCharacterSpacing = True;
13AverageCharacterScaling = True;
14SupportRightToLeftWritingDirection = True;
15ImageAnchoringMode = Automatic;
16HeaderAndFooterMode = Detect;
17FootnotesMode = FootnotesMode;
18MarkupAnnotConversionType = TextBox;
19TargetWordFormat = Automatic;
20AutoRotate = True;
21TextRecoveryType = Automatic;
22TextRecoveryLanguage = Automatic;
23TextRecoveryNseType = Automatic;
24TextRecoveryEngine = Automatic;
25TextRecoveryEngineNse = Automatic;
26GraphicsAsImages = False;
27KeepInvisibleText = False;
28KeepBackgroundColorText = False;

PdfToTextConverter

Note: Please switch WordOutputOptions to text mode using SetWordOutputFormat(Convert.WordOutputOptions.WordOutputFormat.e_wof_txt);

Settings

Solid Framework

PDFNet

DetectSoftHyphens

SetConnectHyphens

FootnotesMode

SetFootnotesSetting

HeaderAndFooterMode

SetHeadersAndFootersSetting

KeepLineBreaks

SetPrioritizeVisualAppearance

Password

SetPDFPassword

PageRange

SetPages

TextRecoveryEngine

SetPreferredOCREngine

TextRecoveryLanguage
- "au"
- "ca"
- "da"
- "de"
- "en"
- "es"
- "fi"
- "fr"
- "it"
- "nl"
- "no"
- "pl"
- "pt"
- "ro"
- "ru"
- "sl"
- "sv"
- "tr"
- "zh"
- "zt"
- "ja"
- "ko"

SetCustomOCRLanguage
- "au" (default)
- "ca" or "cat"
- "da" or "dan"
- "de" or "deu"
- "en" or "eng"
- "es" or "spa"
- "fi" or "fin"
- "fr" or "fra"
- "it" or "ita"
- "nl" or "nld"
- "no" or "nor"
- "pl" or "pol"
- "pt" or "por"
- "ro" or "ron"
- "ru" or "rus"
- "sl" or "slv"
- "sv" or "swe"
- "tr" or "tur"
- "zh" or "chi_sim"
- "zt" or "chi_tra"
- "ja" or "jpn"
- "ko" or "kor"

TextRecoveryType
- Automatic
- Always
- Never
- Default

SetSearchableImageSetting
- e_ocr_text
- e_ocr_always
- e_ocr_off
- e_ocr_text

Defaults

sh

1DetectSoftHyphens = False;
2KeepLineBreaks = False;
3Password = None;
4PageRange = All Pages;
5LineLength = 100;
6DetectAndRemoveHeadersAndFooters = True;
7DetectAndRemoveFootnotes = True;
8LineTerminator = Platform;
9KeepLineBreaks = False;
10AutoRotate = True;
11TextRecoveryType = Automatic;
12TextRecoveryLanguage = Automatic;
13TextRecoveryNseType = Automatic;
14TextRecoveryEngine = Automatic;
15TextRecoveryEngineNse = Automatic;
16GraphicsAsImages = False;
17KeepInvisibleText = False;
18KeepBackgroundColorText = False;

PdfToExcelConverter

Settings

Solid Framework

PDFNet

FootnotesMode

SetFootnotesSetting

HeaderAndFooterMode

SetHeadersAndFootersSetting

KeepNonTableContent, TablesFromContent

SetNonTableContent

Password

SetPDFPassword

PageRange

SetPages

SingleTable
- PlaceEachTableOnOwnSheet
- PlaceAllTablesOnSingleSheet
- PlaceTablesForEachPageOnSingleSheet

SetSingleSheet, SetPageSingleSheet
- SetSingleSheet(False)
- SetSingleSheet(True)
- SetPageSingleSheet(True)

TextRecoveryEngine

SetPreferredOCREngine

TextRecoveryLanguage
- "au"
- "ca"
- "da"
- "de"
- "en"
- "es"
- "fi"
- "fr"
- "it"
- "nl"
- "no"
- "pl"
- "pt"
- "ro"
- "ru"
- "sl"
- "sv"
- "tr"
- "zh"
- "zt"
- "ja"
- "ko"

SetCustomOCRLanguage
- "au" (default)
- "ca" or "cat"
- "da" or "dan"
- "de" or "deu"
- "en" or "eng"
- "es" or "spa"
- "fi" or "fin"
- "fr" or "fra"
- "it" or "ita"
- "nl" or "nld"
- "no" or "nor"
- "pl" or "pol"
- "pt" or "por"
- "ro" or "ron"
- "ru" or "rus"
- "sl" or "slv"
- "sv" or "swe"
- "tr" or "tur"
- "zh" or "chi_sim"
- "zt" or "chi_tra"
- "ja" or "jpn"
- "ko" or "kor"

TextRecoveryType
- Automatic
- Always
- Never
- Default

SetSearchableImageSetting
- e_ocr_text
- e_ocr_always
- e_ocr_off
- e_ocr_text

Defaults

sh

1KeepNonTableContent = False;
2Password = None;
3PageRange = All Pages;
4SingleTable = PlaceEachTableOnOwnSheet;
5AutoDetectSeparators = True;
6DetectTiledPages = True;
7OutputType = XlsX;
8HeaderAndFooterMode = Remove;
9FootnotesMode = Ignore;
10TextAnnotationsAsContent = False;
11AutoRotate = True;
12TextRecoveryType = Automatic;
13TextRecoveryLanguage = Automatic;
14TextRecoveryNseType = Automatic;
15TextRecoveryEngine = Automatic;
16TextRecoveryEngineNse = Automatic;
17GraphicsAsImages = False;
18KeepInvisibleText = False;
19KeepBackgroundColorText = False;

PdfToPowerPointConverter

Settings

Solid Framework

PDFNet

FootnotesMode

SetFootnotesSetting

HeaderAndFooterMode

SetHeadersAndFootersSetting

Password

SetPDFPassword

PageRange

SetPages

TextRecoveryEngine

SetPreferredOCREngine

TextRecoveryLanguage
- "au"
- "ca"
- "da"
- "de"
- "en"
- "es"
- "fi"
- "fr"
- "it"
- "nl"
- "no"
- "pt"
- "pl"
- "ro"
- "ru"
- "sl"
- "sv"
- "tr"
- "zh"
- "zt"
- "ja"
- "ko"

SetCustomOCRLanguage
- "au" (default)
- "ca" or "cat"
- "da" or "dan"
- "de" or "deu"
- "en" or "eng"
- "es" or "spa"
- "fi" or "fin"
- "fr" or "fra"
- "it" or "ita"
- "nl" or "nld"
- "no" or "nor"
- "pl" or "pol"
- "pt" or "por"
- "ro" or "ron"
- "ru" or "rus"
- "sl" or "slv"
- "sv" or "swe"
- "tr" or "tur"
- "zh" or "chi_sim"
- "zt" or "chi_tra"
- "ja" or "jpn"
- "ko" or "kor"

TextRecoveryType
- Automatic
- Always
- Never
- Default

SetSearchableImageSetting
- e_ocr_text
- e_ocr_always
- e_ocr_off
- e_ocr_text

Defaults

sh

1Password = None;
2PageRange = All Pages;
3EmbedFontsMode = NoFontEmbedding;
4AutoRotate = True;
5TextRecoveryType = Automatic;
6TextRecoveryLanguage = Automatic;
7TextRecoveryNseType = Automatic;
8TextRecoveryEngine = Automatic;
9TextRecoveryEngineNse = Automatic;
10GraphicsAsImages = False;
11KeepInvisibleText = False;
12KeepBackgroundColorText = False;

PdfToHtmlConverter

Note: The default PDF to HTML engine in PDFNet is not Solid Framework. To enable Solid Framework, please call options.SetContentReflowSetting(Convert.HTMLOutputOptions.e_reflow_full);

Settings

Solid Framework

PDFNet

DetectSoftHyphens

SetConnectHyphens

ExactMode
- False
- True

SetContentReflowSetting
- e_reflow_full
- e_fixed_position

FootnotesMode

SetFootnotesSetting

HeaderAndFooterMode

SetHeadersAndFootersSetting

Images
- Default
- Link
- Ignore
- Embed

SetEmbedImages
- False
- False
- N/A
- True

Password

SetPDFPassword

PageRange

SetPages

TextRecoveryEngine

SetPreferredOCREngine

TextRecoveryLanguage
- "au"
- "ca"
- "da"
- "de"
- "en"
- "es"
- "fi"
- "fr"
- "it"
- "nl"
- "no"
- "pl"
- "pt"
- "ro"
- "ru"
- "sl"
- "sv"
- "tr"

SetLanguage
- e_lang_auto (default)
- e_lang_catalan
- e_lang_danish
- e_lang_german
- e_lang_english
- e_lang_spanish
- e_lang_finnish
- e_lang_french
- e_lang_italian
- e_lang_dutch
- e_lang_norwegian
- e_lang_portuguese
- e_lang_polish
- e_lang_romanian
- e_lang_russian
- e_lang_slovenian
- e_lang_swedish
- e_lang_turkish

TextRecoveryType
- Automatic
- Always
- Never
- Default

SetSearchableImageSetting
- e_ocr_text
- e_ocr_always
- e_ocr_off
- e_ocr_text

Defaults

Note: The default PDF to HTML engine in PDFNet is not Solid Framework.

sh

1DetectSoftHyphens = False;
2Images = Embed;
3Password = None;
4PageRange = All Pages;
5WidthLimit = 0;
6DetectLanguage = True;
7ImageType = Default;
8KeepLineBreaks = False;
9HeaderAndFooterMode = Remove;
10AutoRotate = True;
11TextRecoveryType = Automatic;
12TextRecoveryLanguage = Automatic;
13TextRecoveryNseType = Automatic;
14TextRecoveryEngine = Automatic;
15TextRecoveryEngineNse = Automatic;
16GraphicsAsImages = False
17KeepInvisibleText = False;
18KeepBackgroundColorText = False;

Dependencies

It is expected that whatever instance you were running SolidFramework.dll on will also run the Structured Output Module (StructuredOutput.exe) just fine.

Did you find this helpful?

Trial setup questions?

Ask experts on Discord

Need other help?

Contact Support

Pricing or product questions?

Contact Sales