Re: [PHP] I need to Index and to research files PDF

2006-05-03 Thread Jochem Maas

Carlos Augusto Falcão da Silva wrote:

I'm so sorry, but I didn't notice that it had duplicated the message. :-(


no problem :-) there are worse crimes.


Thank you for help


let us know what your solution ends up being - it might help others,
I'd be especially interested what your experiences are with the google box -
assuming you go that route.

--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php



Re: [PHP] I need to Index and to research files PDF

2006-05-02 Thread Richard Lynch
On Tue, May 2, 2006 11:36 am, Carlos Augusto Falcão da Silva wrote:

You may want to re-post in your native language...

The translation suffered quite a bit...

These may also help:
http://www.google.com/search?q=PHP+PDF2text
http://www.google.com/search?q=gocr
http://php.net/libpdf

> I am developing a system of content management through intranet,
> for us to manage our documents involved in the system of
> administration of
> the
> quality and also the norms of the assemblers etc. I Possess, today,
> some
> 500
> norms, being about 85% in paper. I already come scanning some and
> allowing
> I access, controlled, through links in the intranet. It is what want
> to do
> with all
> them. In other words, scanned in PDF, copy for a paste in the net, I
> distribute the
> accesses and in a page an index is presented with all of the files.
> There
> is everything calm. The one that I want do now it is to generate an
> indexation form and
> he/she researches of those norms, through search fields in that
> you/they are
> researched
> the occurrences inside of the files PDF. I want to do something (if it
> is
> possible)
> similar to the that Coppernic Desktop Search or Google Desktop Search
> they do. If you make a search for the software, he gives you all of
> the
> occurrences,
> in all of the defined files in the research. For me, it would be a
> hand-na-wheel
> absurd. I tried to research in the manual, but I didn't get to arrive
> in a
> point
> common. I think I didn't get to seek right! : - (
>
> Ah, in time, I am using PHP+MySQL in that content manager. ;) And
> all generate my documents in PDF, with recognition of characters (OCR)
> of good quality (for Acrobat I make the researches usually!), besides
> to define several "metadados" (tag's) in the properties of the
> generated
> documents.
>
>
> A great hug and I hope to count with his/her help.
>
>
> Ps.: My English is very poor, therefore, I apologize! I used an
> electronic
> translator as aid. :(
>
> Thank's...
>
> --
> Guto. <[EMAIL PROTECTED]>
>
> "Vai imprimir este email? Pense antes em sua responsabilidade com a
> preservação do meio-ambiente e com a redução de seus custos."
>


-- 
Like Music?
http://l-i-e.com/artists.htm

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php



Re: [PHP] I need to Index and to research files PDF

2006-05-02 Thread Jochem Maas

Carlos Augusto Falcão da Silva wrote:

Good afternoon!!!


please don't post your question more than once - if anyone can help you
you will get an answer.

2 things I would suggest you look at:

1. http://php.net/manual/es/ref.mnogosearch.php - mnogosearch is capable of
indexing PDF content - I have never used it but others on this list are, if you
get stuck with something specific just come back :-)

2. your company might consider the following as a worthwhile investment
(given that implementing a custom solution is not without cost:

http://www.google.com/enterprise/

I have never used the device but a nice blue 1U server with a big Google logo
on it looks pretty cool if nothing else ;-)







Ps.: My English is very poor, therefore, I apologize! I used an electronic
translator as aid. :(


well looks like you made a big effort - that's always welcome around here :-)

you might also try looking for a spanish speaking mailing list - but I have no
idea if there even is one :-/

--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php