Class TextExtractor
Class TextExtractor
Pôvodný názov: Aspose.Pdf.Plugins Zhromaždenie: Aspose.PDF.dll (25.4.0)
Predstavuje TextExtractor plugin.
public class TextExtractor : PdfExtractor, IPlugin, IDisposable
Inheritance
object ← PdfExtractor ← TextExtractor
Implements
Z dedičných členov
PdfExtractor.Process(IPluginOptions) , PdfExtractor.Dispose() , object.GetType() , object.MemberwiseClone() , object.ToString() , object.Equals(object?) , object.Equals(object?, object?) , object.ReferenceEquals(object?, object?) , object.GetHashCode()
Examples
Príklad ukazuje, ako extrahovať textový obsah z PDF dokumentu.
// create TextExtractor object to extract text in PDF contents
using (TextExtractor extractor = new TextExtractor())
{
// create TextExtractorOptions
textExtractorOptions = new TextExtractorOptions();
// add input file path to data sources
textExtractorOptions.AddDataSource(new FileDataSource(inputPath));
// perform extraction process
ResultContainer resultContainer = extractor.Process(textExtractorOptions);
// get the extracted text from the ResultContainer object
string textExtracted = resultContainer.ResultCollection[0].ToString();
}
Remarks
Objekt Aspose.Pdf.Plugins.TextExtractor sa používa na extrahovanie textu do PDF dokumentov.
Constructors
TextExtractor()
public TextExtractor()