Comments (8)
Did you use the Job class, you need it to make the Adobe IFilter work. See the demo app that is in the project about how to use it.
from ifiltertextreader.
See the comment that I added to the Job class :-)
/// <summary>
/// Use this class to sandbox Adobe IFilter 11 or higher when you want to use this code on Windows 2012 or higher
/// </summary>
/// <summary>
/// Make a job object to sandbox the IFilter code
/// </summary>
private readonly Job _job = new Job();
// Add the current process to the sandbox
_job.AddProcess(Process.GetCurrentProcess().Handle);
from ifiltertextreader.
You are right. I forget to use it. I try it right now, just need to reconstitute my code.
from ifiltertextreader.
Kees, it works just great ! Thank you. Please, take my donation as compliment.
from ifiltertextreader.
You are welcome. Just for my own curiosity ... for what are you using the iFilterTextReader?
from ifiltertextreader.
I am extracting text from documents (they are stored in encrypted form and I decrypt them each time I need to index) and using Lucene.Net for search with referencing criterias of various types (mostly IDs of numerous parameters.
from ifiltertextreader.
If my project has some shortcomings then take a look at Tika (https://tika.apache.org/) . It's a java library but there is also a .NET port that is generated with IKVM. You can find it overhere --> https://github.com/KevM/tikaondotnet
from ifiltertextreader.
I will take a look at that for sure. The way my app works is ok for the moment, but who know, may be I will need more productive tools. Thank you for you work, you made a very useful library.
from ifiltertextreader.
Related Issues (20)
- Cannot read text from .xls file HOT 11
- Text extraction hangs when reading .odt file HOT 4
- Index out of bounds reading a pdf document HOT 1
- Can't get the PDF filter to load the IPersistStream in FileLoader.cs HOT 4
- Question of requirements: does not contain a method named 'new' HOT 5
- TextReader not recognixing line breaks in .docx File HOT 4
- Keep file formatting HOT 1
- Open File Reader with MemoryStream HOT 3
- Document metadata properties HOT 8
- Exception if property with multiple values exists
- Weird text encoding issue with colons and section symbols HOT 1
- Registry DLL issue after upgrading HOT 1
- System.AccessViolationException HOT 19
- Outdated(?) OffFilter.dll on Windows Server 2012 HOT 2
- OffFilt.dll AccessViolationException HOT 11
- ReadToEnd() causes "Destination Array Not Long Enough" for legacy Word files HOT 1
- Missing filter return code? HOT 7
- Version 1.7+ - System.ExecutionEngineException and System.AccessViolationException HOT 16
- Cannot read text from .xls HOT 6
- License question HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ifiltertextreader.