Hi, 
has anybody done full-text indexing of pdf and word documents stored in TEXT
columns?
I could find no reference to which documents formats that are supported by
MySQL.
 
I know that DB2, Oracle and SQL Server have restrictions on supported
document formats, so I asume that there are restrictions in MySQL as well?
 
I wan't to do something like the example at
http://www.mysql.com/doc/en/Fulltext_Search.html
<http://www.mysql.com/doc/en/Fulltext_Search.html> : 

mysql> CREATE TABLE articles (
    ->   id INT UNSIGNED AUTO_INCREMENT NOT NULL PRIMARY KEY,
    ->   title VARCHAR(200),
    ->   body TEXT,
    ->   FULLTEXT (title,body)
    -> );
Query OK, 0 rows affected (0.00 sec)
 
mysql> INSERT INTO articles VALUES
    -> (NULL,'MySQL Tutorial', 'DBMS stands for DataBase ...'),
    -> (NULL,'How To Use MySQL Efficiently', 'After you went through a
...'),
    -> (NULL,'Optimising MySQL','In this tutorial we will show ...'),
    -> (NULL,'1001 MySQL Tricks','1. Never run mysqld as root. 2. ...'),
    -> (NULL,'MySQL vs. YourSQL', 'In the following database comparison
...'),
    -> (NULL,'MySQL Security', 'When configured properly, MySQL ...');
Query OK, 6 rows affected (0.00 sec)
Records: 6  Duplicates: 0  Warnings: 0
 
mysql> SELECT * FROM articles
    ->          WHERE MATCH (title,body) AGAINST ('database');

Only that I whish to load documents (PDF, Word, XML, HTML etc) into the TEXT
column using 
    LOAD_FILE(file_name)
or similiar. And be able to do full-text searches on these documents.
 
Q1: Is it possible to do full-text searches on PDF, Word, XML, HTML etc
documents stored in TEXT columns?
 
Q2: Is there a list of supported document formats for MySQL full-text
search?
 
Thanks for any help,
Jan-Erik Öhman
 
 

Reply via email to