It's probably easier to convert it to tab delimited txt format encoded in utf8. That way you don't have to deal with XML. If I had to do the job myself, I'd use Python as opposed to awk, but that's just my personal preference :-)
Peter On Oct 12, 8:53 pm, daveoily <[email protected]> wrote: > Hi all, I'm studying Japanese, if you are too, you might have heard of > smartfm, they were brilliant, but then decided to make people pay for > it, it's fair enough I suppose, but I haven't the money. So I spent a > few hours in the days before it went to being a paysite downloading > all the stuff I could, example sentences and the pages with > information about the translations and pronunciation. > > It occurs to me that I could make mnemosyne cards from this bunch of > information, but doing it manually would take me an age, eating into > precious study time. I'm currently working on a whole stack of other > cards to be uploaded upon completion anyway. > > The way I see it, is if I can strip the pertinent information from the > html and put it into the right format for a mnemosyne xml file, I > could automate the process to such an extent that it would take > seconds to create the cards, IF and it is a big if, I knew how to do > it! > > I've had a brush with AWK before, and I think it might be the right > tool for such a job, but I'm no expert to put it mildly, and would > really appreciate some help with this one if some knowledgeable soul > could see how to do it. > > There's hundreds, perhaps thousands of example sentences in mp3 > format, they're the files named JS******.mp3, and also words (I'm not > sure if I got them all, but they may well be there named JW******.mp3 > > All the files I have bundled up and put on wildfire in a file called > sfm.rar > > http://www.mediafire.com/?xdbiu55a71ucjb2 > > If anyone has any pointers, I'd love to hear. Otherwise, I might be > quite some time turning what could be a great learning resource into > something usable. -- You received this message because you are subscribed to the Google Groups "mnemosyne-proj-users" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/mnemosyne-proj-users?hl=en.
