Class TextExtractor

Class TextExtractor

Navne til: Aspose.Pdf.Plugins Sammensætning: Aspose.PDF.dll (25.4.0)

Det er en tekstekstraktor plugin.

public class TextExtractor : PdfExtractor, IPlugin, IDisposable

Inheritance

object PdfExtractor TextExtractor

Implements

IPlugin , IDisposable

De arvede medlemmer

PdfExtractor.Process(IPluginOptions) , PdfExtractor.Dispose() , object.GetType() , object.MemberwiseClone() , object.ToString() , object.Equals(object?) , object.Equals(object?, object?) , object.ReferenceEquals(object?, object?) , object.GetHashCode()

Examples

Eksemplet viser, hvordan man ekstraherer tekstindhold fra PDF-dokumentet.

// create TextExtractor object to extract text in PDF contents
using (TextExtractor extractor = new TextExtractor())
{
    // create TextExtractorOptions
    textExtractorOptions = new TextExtractorOptions();

    // add input file path to data sources
    textExtractorOptions.AddDataSource(new FileDataSource(inputPath));

    // perform extraction process
    ResultContainer resultContainer = extractor.Process(textExtractorOptions);

    // get the extracted text from the ResultContainer object
    string textExtracted = resultContainer.ResultCollection[0].ToString();
}

Remarks

Den Aspose.Pdf.Plugins.TextExtractor objekt bruges til at udveksle tekst i PDF-dokumenter.

Constructors

TextExtractor()

public TextExtractor()
 Dansk