A collection of documentation highlighting quirks around the file formatting for citation and reference managers.
There are many software solutions and tools that allow you to manage your scientific work. Often times, you need to use multiple solutions to cover your whole workflow. This is where things get interesting but also confusing. Some tools just do not work well with others.
In this open repository, we have tried to simplify the process of setting up a scientific workflow for research. We have described the interrelations between popular citation and reference managers, search engines and systematic review software. Additionally, we have gathered relevant datasets to show capabilities of each tool more specifically.
This project is meant to serve as a guide in the field of scientific research, namely dealing with systematic reviews and related software solutions. Or you can check it out for inspiration and learning about said software!
Below, most commonly used file formats in dealing with systematic review are described.
A comma-separated values (CSV) file is a delimited text file that uses a comma to separate values. Each line of the file is a data record. Each record consists of one or more fields, separated by commas. The use of the comma as a field separator is the source of the name for this file format. A CSV file typically stores tabular data (numbers and text) in plain text, in which case each line will have the same number of fields.
The CSV file format is not fully standardized.
Learn more about CSV at Wikipedia.
RIS is a standardized tag format developed by Research Information Systems, Incorporated (the format name refers to the company) to enable citation programs to exchange data. The RIS file format — two letters, two spaces and a hyphen — is a tagged format for expressing bibliographic citations. According to the specifications, the lines must end with the ASCII carriage return and line feed characters. Multiple citation records can be present in a single RIS file. A record ends with an "end record" tag ER - with no additional blank lines between records. We also included a Pubmed RIS file, which is a Pubmed format that has a high resemblance with RIS files.
The RIS file format is standardized but sometimes improperly implemented.
Learn more about RIS at Wikipedia.
A tab-separated values (TSV) file is a simple text format for storing data in a tabular structure, e.g., a database table or spreadsheet data, and a way of exchanging information between databases. Each record in the table is one line of the text file. Each field value of a record is separated from the next by a tab character. The TSV format is thus a type of the more general delimiter-separated values format. TSV is an alternative to the common comma-separated values (CSV) format, which often causes difficulties because of the need to escape commas – literal commas are very common in text data, but literal tab stops are infrequent in running text. The IANA standard for TSV achieves simplicity by simply disallowing tabs within fields.
The TSV file format is not fully standardized.
Learn more about TSV at Wikipedia.
The Office Open XML Workbook (XLSX) file is part of a set of file formats that can be used to represent electronic office documents. While this specific format is dedicated to spreadsheets, there are other formats dedicated for word processing documents, presentations as well as specific formats for material such as mathematical formulae, graphics, bibliographies etc.
The XLSX file format is standardized.
Learn more about XLSX at Wikipedia.
Below, most commonly used citation and reference managers are listed.
Below, most commonly used search engines are listed.
- CINAHL Database
- Cochrane
- Embase
- Google Scholar
- Google Scholar Button for Chrome
- ERIC Ovid®
- MDPI
- OpenAlex
- Psycinfo Ovid®
- Pubmed
- ResearchGate
- Scopus
- Web of Science
Below, most commonly used search engines are listed.
Details are described in the contributing guide.