Greg, I agree with Tom H. PHP can also do an excellent job of this, even as a standalone script (ie no web server interaction). And you won't have to figure out how to load the DBI module! I'll adapt an example from something else I did: (Warning, not tested code...)
#!/usr/local/bin/php <?php $filecontents = file("filename"); $numsucc = 0; $numtotal = 0; $vars = array(); foreach( $filecontents as $line) { $Qstart = "INSERT INTO dbname SET "; $COLS = ""; while (1) { $line = trim($line); if (! strlen($line)) break; // next story list($tag,$content) = preg_split("/:/", $line ); $vars[$tag] = $content; $COLS .= "$tag=$content, "; } $QUERY = $Qstart . substr($COLS,0,-2); // remove trailing comma and space $numtotal++; $result = mysql_query($QUERY); if ($result) $numsucc++; } echo "All done! $numsucc of $numtotal records successfully inserted\n"; ?> Greg Peretti wrote: > This is my first post on this list, so bear with me. > > I have a text file with 10,000 stories in it (about 130 MB). I would > like to break it up into 10,000 separate files and am trying to figure > out a way to feed it into a MYSQL database. > > I have set up the database with the pertinent fields, except the > individual stories do not necessarily follow the same pattern. > > For instance, almost all of them have: > > Header > Publication > section > edition > date > page number > headline > byline > story > > all separated by two returns, with each overall story separated by four > returns. > > Now, I can load the file using fields terminated by "\n\n" and lines > terminated by "\n\n\n\n", but if, for instance, a story does not have a > byline, I'm in trouble. It will happily put the story data in the byline > field. > > Am I out of luck, or can I specify text to look for in specific fields > as the file is being loaded, i.e. the date field is always preceded by > "Date:" and if no "Date:" is found, skip the field? > > I'm using 3.23.44 > > > -- > > Greg Peretti > web developer > www.abqjournal.com > (505) 823-3888 > > ----------------------------------- > > The web of our life is of a mingled yarn, good and ill together. > - William Shakespeare > > > > --------------------------------------------------------------------- > Before posting, please check: > http://www.mysql.com/manual.php (the manual) > http://lists.mysql.com/ (the list archive) > > To request this thread, e-mail <[EMAIL PROTECTED]> > To unsubscribe, e-mail <[EMAIL PROTECTED]> > Trouble unsubscribing? Try: http://lists.mysql.com/php/unsubscribe.php > > > -- Steve Rapaport World Citizen --------------------------------------------------------------------- Before posting, please check: http://www.mysql.com/manual.php (the manual) http://lists.mysql.com/ (the list archive) To request this thread, e-mail <[EMAIL PROTECTED]> To unsubscribe, e-mail <[EMAIL PROTECTED]> Trouble unsubscribing? Try: http://lists.mysql.com/php/unsubscribe.php