this is my class, I use eclipse and I haven't any errors.Do not understand 
where the problem ....
 
 
import java.io.File;
import java.io.IOException;

import org.apache.lucene.analysis.Analyzer;
import org.apache.lucene.analysis.standard.StandardAnalyzer;
import org.apache.lucene.document.Document;
import org.apache.lucene.index.IndexWriter;
import org.apache.lucene.index.Term;
import org.apache.lucene.search.Hits;
import org.apache.lucene.search.IndexSearcher;
import org.apache.lucene.search.Query;
import org.apache.lucene.search.TermQuery;
import org.apache.lucene.store.Directory;
import org.apache.lucene.store.RAMDirectory;
import org.pdfbox.searchengine.lucene.LucenePDFDocument;

public final class SimplePdfSearch 
{
private static final String PDF_FILE_PATH = 
"C:\\Users\\Tiziano\\Desktop\\doc_di_prova\\prova.pdf";
private static final String SEARCH_TERM = "prova";

public static final void main(String[] args) throws IOException
{
Directory directory = null;

try
{
File pdfFile = new File(PDF_FILE_PATH);
Document document = LucenePDFDocument.getDocument(pdfFile);

directory = new RAMDirectory();

IndexWriter indexWriter = null;

try
{
Analyzer analyzer = new StandardAnalyzer();
indexWriter = new IndexWriter(directory, analyzer, true);

indexWriter.addDocument(document);
}
finally
{
if (indexWriter != null)
{
try
{
indexWriter.close();
}
catch (IOException ignore)
{
// Ignore
}

indexWriter = null;
}
}

IndexSearcher indexSearcher = null;

try
{
indexSearcher = new IndexSearcher(directory);

Term term = new Term("contents", SEARCH_TERM);
Query query = new TermQuery(term);

Hits hits = indexSearcher.search(query);

System.out.println((hits.length() != 0) ? "Found" : "Not Found");
}
finally
{
if (indexSearcher != null)
{
try
{
indexSearcher.close();
}
catch (IOException ignore)
{
// Ignore
}

indexSearcher = null;
}
}
}
finally
{
if (directory != null)
{
try
{
directory.close();
}
catch (IOException ignore)
{
// Ignore
}

directory = null;
}
}
}
}> From: [EMAIL PROTECTED]> To: java-user@lucene.apache.org> Subject: Re: Pdf 
in Lucene?> Date: Mon, 1 Dec 2008 08:22:58 -0500> > > On Dec 1, 2008, at 8:01 
AM, tiziano bernardi wrote:> > >> > I tried to use pdfbox but gives me an 
error.> > That the version of lucene and the pdfbox are incompatible.> > Lucene 
knows nothing about PDFBox, so I don't see how they could be > incompatible, 
unless your are referring to PDFBox's Lucene Document > creator, in which case, 
you should ask on the PDFBox mailing list. I > think, however, that it's pretty 
straightforward to create a Lucene > document from PDFBox, so you shouldn't 
need to rely on their version.> > Personally, I'd have a look at Tika 
(http://lucene.apache.org/tika), > which wraps PDFBox (and other extraction 
libraries) and gives you back > SAX-like events via a ContentHandler, which you 
can then use to create > Lucene documents. Else, I've been working on SOLR-284, 
which > integrates Tika into Solr, see 
https://issues.apache.org/jira/browse/SOLR-284> > -Grant> > >> > I use pdf box 
0.7.3 and lucene 2.1.0> Date: Mon, 1 Dec 2008 11:43:00 > > +0000> From: [EMAIL 
PROTECTED]> To: java-user@lucene.apache.org> > > Subject: Re: Pdf in Lucene?> > 
Hi> > > Lucene only indexes text so > > you'll have to get the text out of the 
PDF> and feed it to lucene.> > > > Google for lucene pdf, or go straight to 
http://www.pdfbox.org/> > > > > --> Ian.> > > > 2008/12/1 tiziano bernardi 
<[EMAIL PROTECTED]>:> > > >> >> > Hi,> > I want to index PDF files with lucene 
is possible?> > > > What like?> > Thanks Tiziano Bernardi> > > > 
_________________________________________________________________> > > > Fanne 
di tutti i colori, personalizza la tua Hotmail!> > 
http://imagine-windowslive.com/Hotmail/#0 > > > > > > 
--------------------------------------------------------------------- > > > To 
unsubscribe, e-mail: [EMAIL PROTECTED]> > > For additional commands, e-mail: 
[EMAIL PROTECTED]>> > 
_________________________________________________________________> > 50 nuovi 
schemi per giocare su CrossWire! Accetta la sfida!> > 
http://livesearch.games.msn.com/crosswire/play_it/> > 
--------------------------> Grant Ingersoll> > Lucene Helpful Hints:> 
http://wiki.apache.org/lucene-java/BasicsOfPerformance> 
http://wiki.apache.org/lucene-java/LuceneFAQ> > > > > > > > > > > > 
---------------------------------------------------------------------> To 
unsubscribe, e-mail: [EMAIL PROTECTED]> For additional commands, e-mail: [EMAIL 
PROTECTED]> 
_________________________________________________________________
Vai oltre le parole, scarica il nuovo Messenger!
http://download.live.com/?mkt=it-it

Reply via email to