On 2 November 2012 10:40, Sacha Rook <sachar...@gmail.com> wrote: > > I have a problem with a csv file from a supplier, so they export data to csv > however the last column in the record is a description which is marked up > with html. > > trying to automate the processing of this csv to upload elsewhere in a > useable format. If i open the csv with csved it looks like all the records > aren't escaped correctly as after a while i find html tags and text on the > next line/record. > > If I 'openwith' excel the description stays on the correct line/record? > > I want to use python to read these records in and output a valid csv with > the descriptions intact preferably without the html tags so a string of text > formatted with newline/CR where appropriate. > > So far I have this but don't know where to go from here can someone help me? > > import csv > > infile = open('c:\data\input.csv', 'rb') > outfile = open('c:\data\output.csv', 'wb') > > reader = csv.reader(infile) > writer = csv.writer(outfile) > > > for line in reader: > print line > writer.writerow(line) >
You already have a program. Does it work? If not, then what's wrong with the output? If you get an error message can you please show the exact error message? > I have attached the input.csv i hope this is good form here? > > I know I am far from complete but don't know how to proceed :-) It's okay to send attachments when there is a need to. It would be good though to cut the csv file down to a smaller size before posting it here. That's 4 MB wasting space in a lot of inboxes. Better yet, you could copy the first three lines directly into the email so that people can see it without needing to download the attachment. Oscar _______________________________________________ Tutor maillist - Tutor@python.org To unsubscribe or change subscription options: http://mail.python.org/mailman/listinfo/tutor