Split Large Documents
Use the Wordize Splitting for .NET module to split your documents into parts.
Select any document conversion module to work with the required formats.
Wordize provides the Wordize Splitting for .NET module, which allows users to split large documents into multiple smaller documents programmatically. This feature is useful for dividing large content into manageable sections on a page-by-page basis, as well as based on sections, page ranges, and paragraph styles.
Additionally, Wordize allows you to remove blank pages from your documents to improve readability and ensure a professional appearance, as well as reduce file size and prevent printing unnecessary pages.
Let’s look at each of these methods in more detail.
Split Options
To split a document into several parts, use different implementations of the Split and ExtractPages methods.
To split a document using the Split method, you must use one of the properties of the SplitOptions class:
- SplitCriteria – specifies one of the options for splitting a document into parts: by page or by section break
- SplitStyle – specifies the paragraph formatting style, which can also be a criterion for splitting a document into parts
To split a document using the ExtractPages method, you must pass the numbers of the beginning and end of the page range.
For details and examples of the document splitting options, see the following sections.
Split Page by Page
Wordize allows users to split a document page by page. So if you have a document of 5 pages, using this method of splitting the document you will get 5 documents of 1 page each. For this purpose, use the Page value of the SplitCriteria property.
Use one of the Split methods to split a document page by page:
method Split(string, string, SplitOptions)
string doc = "BigDocument.docx";
SplitOptions options = new SplitOptions();
options.SplitCriteria = SplitCriteria.Page;
Splitter.Split(doc, "SplitDocument.1.docx", options);
method (string, string, SaveFormat, SplitOptions)
string doc = "BigDocument.docx";
SplitOptions options = new SplitOptions();
options.SplitCriteria = SplitCriteria.Page;
Splitter.Split(doc, "SplitDocument.2.docx", SaveFormat.Docx, options);
method Split(Stream, SaveFormat, SplitOptions)
using var streamIn = File.OpenRead("BigDocument.docx");
SplitOptions options = new SplitOptions();
options.SplitCriteria = SplitCriteria.Page;
Stream[] stream = Splitter.Split(streamIn, SaveFormat.Docx, options);
Split by Sections
Wordize also allows to use section breaks to split documents into sections. For this purpose, use the SectionBreak value of the SplitCriteria property.
Use one of the Split methods to split a document into sections:
method (string, string, SplitOptions)
string doc = "BigDocument.docx";
SplitOptions options = new SplitOptions();
options.SplitCriteria = SplitCriteria.SectionBreak;
Splitter.Split(doc, "SplitDocument.3.docx", options);
method (string, string, SaveFormat, SplitOptions)
string doc = "BigDocument.docx";
SplitOptions options = new SplitOptions();
options.SplitCriteria = SplitCriteria.SectionBreak;
Splitter.Split(doc, "SplitDocument.4.docx", SaveFormat.Docx, options);
method (Stream, SaveFormat, SplitOptions)
using var streamIn = File.OpenRead("BigDocument.docx");
SplitOptions options = new SplitOptions();
options.SplitCriteria = SplitCriteria.SectionBreak;
Stream[] stream = Splitter.Split(streamIn, SaveFormat.Docx, options);
Split by Styles
Wordize allows you to split a document into parts by paragraph, formatted with a specific style. To split a document by paragraphs, use the Style value of the SplitCriteria property and define styles using the SplitStyle property.
Use one of the Split methods to split a document into parts by paragraph:
method (string, string, SplitOptions)
string doc = "BigDocument.docx";
SplitOptions options = new SplitOptions();
options.SplitCriteria = SplitCriteria.Style;
options.SplitStyle = "Heading 1";
Splitter.Split(doc, "SplitDocument.5.docx", options);
method (string, string, SaveFormat, SplitOptions)
string doc = "BigDocument.docx";
SplitOptions options = new SplitOptions();
options.SplitCriteria = SplitCriteria.Style;
options.SplitStyle = "Heading 1";
Splitter.Split(doc, "SplitDocument.6.docx", SaveFormat.Docx, options);
method (Stream, SaveFormat, SplitOptions)
using var streamIn = File.OpenRead("BigDocument.docx");
SplitOptions options = new SplitOptions();
options.SplitCriteria = SplitCriteria.Style;
Stream[] stream = Splitter.Split(streamIn, SaveFormat.Docx, options);
Split by Page Ranges
Wordize allows you to extract a specific page ranges from a multi-page document. Just select a range of pages and keep only that part of the original document.
Use one of the ExtractPages methods to do this:
method ExtractPages(string, string, int, int)
string doc = "BigDocument.docx";
Splitter.ExtractPages(doc, "ExtractPages.1.docx", 0, 2);
method ExtractPages(string, string, SaveFormat, int, int)
string doc = "BigDocument.docx";
Splitter.ExtractPages(doc, "ExtractPages.2.docx", SaveFormat.Docx, 0, 2);
method ExtractPages(Stream, Stream, SaveFormat, int, int)
using var streamIn = File.OpenRead("BigDocument.docx");
using var streamOut = File.Create("UnsignedDocument.docx");
Splitter.Split(streamIn, streamOut, SaveFormat.Docx, 0, 2);
Remove Blank Pages
You can also remove blank pages from the document.
Use one of the RemoveBlankPages methods to do this:
method RemoveBlankPages(string, string)
string doc = "BlankPages.docx";
Splitter.RemoveBlankPages(doc, "RemoveBlankPages.1.docx");
method RemoveBlankPages(string, string, SaveFormat)
string doc = "BlankPages.docx";
Splitter.RemoveBlankPages(doc, "RemoveBlankPages.2.docx", SaveFormat.Docx);
method RemoveBlankPages(Stream, Stream, SaveFormat)
using var streamIn = File.OpenRead("BigDocument.docx");
using var streamOut = File.Create("UnsignedDocument.docx");
Splitter.Split(streamIn, streamOut, SaveFormat.Docx);