Agreed.

Consider keeping meta data about the book in your mysql database, but storing 
and serving the actual files from somewhere else.

If I were you, I'd use an external full text search engine like Sphinx or 
Lucene to handle something like searching for content inside the book.

Also, in terms of requirements, 300k books doesn't say a lot.  Looking at 
project Gutenberg, I see that an uncompressed text copy of Sherlock Holmes is 
only 500k, so you're talking about maybe 150G of data -- which is pretty 
moderate.

Sounds like a fun project though, good luck!

Regards,
Gavin Towey


-----Original Message-----
From: Peter Chacko [mailto:peterchack...@gmail.com]
Sent: Thursday, June 10, 2010 9:05 PM
To: SHAWN L.GREEN
Cc: Andy; mysql@lists.mysql.com
Subject: Re: MySQL For Huge Collections

Usually, you better use a NAS for such purpose. Database is designed
to store highly transactional, record oriented storage that needs fast
access... You can look for any Enterprise content management systems
that rest its storage on a scalable NAS, with file virtualization in
the long run.

thanks

On Fri, Jun 11, 2010 at 8:04 AM, SHAWN L.GREEN <shawn.l.gr...@oracle.com> wrote:
> On 6/10/2010 10:16 PM, Andy wrote:
>>
>> Hello all,
>>
>> I am new to MySQL and am exploring the possibility of using it for my
>> work.
>> I have about ~300,000 e-books, each about 100 pages long. I am first going
>> to extract each chapter from each e-book and then basically store an
>> e-book
>> as a collection of chapters. A chapter could of course be arbitrarily long
>> depending on the book.
>>
>> My questions are:
>>
>> (1) Can MySQL handle data of this size?
>> (2) How can I store text (contents) of each chapter? What data type will
>> be
>> appropriate? longtext?
>> (3) I only envision running queries to extract a specific chapter from a
>> specific e-book (say extract the chapter titled "ABC" from e-book number
>> XYZ
>> (or e-book titled "XYZ")). Can MySQL handle these types of queries well on
>> data of this size?
>> (4) What are the benefits/drawbacks of using MySQL compared to using XML
>> databases?
>>
>> I look forward to help on this topic. Many thanks in advance.
>> Andy
>>
>
> Always pick the right tool for the job.
>
> MySQL may not be the best tool for serving up eBook contents. However if you
> want to index and locate contents based on various parameters, then it may
> be a good fit for the purpose.
>
> Your simple queries would best be handled by a basic web server or FTP
> server because you seem to want
>
> http://your.site.here/ABC/xyz
>
> where ABC is your book and xyz is your chapter.
>
> Those types of technology are VERY well suited for managing the repetitive
> streaming and distribution of large binary objects (chapter files) like you
> might encounter with an eBook content delivery system.
>
> --
> Shawn Green
> MySQL Principle Technical Support Engineer
> Oracle USA, Inc.
> Office: Blountville, TN
>
> --
> MySQL General Mailing List
> For list archives: http://lists.mysql.com/mysql
> To unsubscribe:
>  http://lists.mysql.com/mysql?unsub=peterchack...@gmail.com
>
>

--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/mysql?unsub=gto...@ffn.com


This message contains confidential information and is intended only for the 
individual named.  If you are not the named addressee, you are notified that 
reviewing, disseminating, disclosing, copying or distributing this e-mail is 
strictly prohibited.  Please notify the sender immediately by e-mail if you 
have received this e-mail by mistake and delete this e-mail from your system. 
E-mail transmission cannot be guaranteed to be secure or error-free as 
information could be intercepted, corrupted, lost, destroyed, arrive late or 
incomplete, or contain viruses. The sender therefore does not accept liability 
for any loss or damage caused by viruses or errors or omissions in the contents 
of this message, which arise as a result of e-mail transmission. [FriendFinder 
Networks, Inc., 220 Humbolt court, Sunnyvale, CA 94089, USA, FriendFinder.com

--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/mysql?unsub=arch...@jab.org

Reply via email to