The introduction of XML exports for the publication of image datasets has resulted in a range of different approaches to XML data export formats. How can this variety of datasets be used in Libraryhack?
For example, the Tasmanian data is provided in the form of the PictureAustralia schema details. New South Wales data is provided in the form of Excel spreadsheets which will make it easier for developers to access. Victoria has provided data in the form of individual XML files for each image record.
Here is an XML “cleaner” which allows you to test XML data by stripping out the XML codes.
Scott Lewis at the unconference outlined some ways in which XML exports could be processed.