You may have noticed that records from the Directory of Open Access Journals (DOAJ) found in SUNCAT have changed recently. Since the July 2015 update we have received a lot more information on each of the DOAJ titles.
SUNCAT has been downloading records from the Directory of Open Access Journals (DOAJ) since November 2008, when we received data for 3736 titles. The number of DOAJ titles has risen and fallen over the years, but now stands at one of the highest numbers ever â€“ 10551 in September 2015. DOAJ records form an important part of our database, giving our users information and access to high quality, open access, peer-reviewed journals.
The DOAJ journal metadata is downloaded in CSV format. We assign each field (or column heading) to a MARC tag and then â€œmarcifyâ€� the data using a specially written script. Before the July update we were receiving 17 different fields which we transferred into MARC. Now we are receiving 54 fields! As stated on the DOAJ website, â€œthe amount of information in the CSV will increase as each journal has their reapplication accepted to remain indexed in DOAJ. There are currently 510 journals with the new information completed.â€�
It was felt that as we receive all of this valuable data we should make this available to users of SUNCAT. At the moment, we are not using every single field, as some information is very specific to the DOAJ, such as whether the title was accepted after March 2014. This information is important for them to record as all journals that were accepted into DOAJ before March 2014 are now required to reapply.
The new and improved journal metadata we are using includes information on:
- APCs (Article Processing Charges);
- Journal article submission fees;
- Review process;
- Publishing rights.
If you take a look at their catalogue you will see how they present this information. The challenge for us was to convert this information into a MARC format.
Mapping to MARC
It has been a very interesting process to map this extra information into MARC. Here are the steps which we took.
- Looked to see if other SUNCAT Contributing Libraries have incorporated this information into their bibliographic records.
- Looked at the MARC Standards (http://www.loc.gov/marc/)
- Consulted with colleagues.
MARC does not really cater for data which is not strictly bibliographic. This is one issue which needs to be addressed in the age of electronic resources. As a result, much of the metadata is being placed in 500$a tags. This tag does not have a $u, which is normally used to indicate an URL, so we are just including this in the 500$a. We have tried to group together information to put in the same 500 tag where possible.
Example metadata for the journal Current Therapeutic Research:
APC information URL:
USD – US Dollar
500 _aThere are journal Article Processing Charges (APCs). 1200 USD – US Dollar. http://www.elsevier.com/journals/current-therapeutic-research/0011-393X/guide-for-authors
Some of the data obtained from the DOAJ can be directly inputted into a MARC tag, including the full text formats available, full text language and keywords. However, in most cases we need to add some text of explanation to the metadata, especially for URLs, or replace the metadata with text when the only metadata given is â€˜yesâ€™, â€˜noâ€™, â€˜Trueâ€™, â€˜Falseâ€™.
Journal full-text crawl permission
If there is a â€˜yesâ€™ in this field then create 500 ## $aJournal full-text crawl permitted.
URL for journalâ€™s Open Access statement:
If there are contents in this field create a 500 ## $aJournalâ€™s Open Access statement: and add the contents preceded by a space.
The resulting work on marcifying the DOAJ metadata was tricky. but ultimately very rewarding. It was wonderful to see this new metadata in SUNCAT. All the notes fields (5XX tags) can be clearly seen on the full record display of a DOAJ record, as well as in the actual MARC record.
We hope you find this improved metadata really useful. Any comments would be very welcome. Just contact the EDINA helpdesk at firstname.lastname@example.org.