Class TextExtractor

Class TextExtractor

Namespace: Aspose.Pdf.Plugins
Assembly: Aspose.PDF.dll

Represents TextExtractor plugin.

public class TextExtractor : PdfExtractor, IPlugin, IDisposable

Inheritance

objectPdfExtractorTextExtractor

Implements

IPlugin, IDisposable

Inherited Members

PdfExtractor.Process(IPluginOptions), PdfExtractor.Dispose(), object.GetType(), object.MemberwiseClone(), object.ToString(), object.Equals(object?), object.Equals(object?, object?), object.ReferenceEquals(object?, object?), object.GetHashCode()

Examples

The example demonstrates how to extract text content of PDF document.

// create TextExtractor object to extract text in PDF contents
using (TextExtractor extractor = new TextExtractor())
{
    // create TextExtractorOptions
    textExtractorOptions = new TextExtractorOptions();

    // add input file path to data sources
    textExtractorOptions.AddDataSource(new FileDataSource(inputPath));

    // perform extraction process
    ResultContainer resultContainer = extractor.Process(textExtractorOptions);

    // get the extracted text from the ResultContainer object
    string textExtracted = resultContainer.ResultCollection[0].ToString();
}

Remarks

The Aspose.Pdf.Plugins.TextExtractor object is used to extract text in PDF documents.

Constructors

TextExtractor()

public TextExtractor()
 English