I have a large text file containing about 6200 records with bibliographic information. I need the information for each record parsed into 26 fields and delivered as a CSV file. (Some of these fields have multiple subfields, so the total number of fields will be larger.) I’ve attached two documents. The first is a detailed description of the fields and the start and end tags that can be used to identify them in the text file. The second attachment is a subset of the data with just 20 records to give you a sense of the structure of the data. (The actual text file has about 6200 records.)
The deliverables are:
1) A CSV file with the parsed data. Each record should occupy one “line” – i..e, if I import it into Microsoft Excel, there should be one (and only one) row for each record.
2) An executable program that I can run on other text files in this format
3) The well-commented source code for this program