Data Sharing Statement Extraction and Classification in Webometric Analyst

Webometric Analyst will help you to convert a set of PDF academic papers into plain text format, and extract data sharing statements from them (in text or XML) and then classify the content of the statements by (a) what (all or some of the data) (b) where the data is shared (repository, via the authors) and (c) why the data is shared in some way.

PDF processing instructions

Download Webometric Analyst with the menu link above

Download the command line tools from here: http://www.xpdfreader.com/download.html

Unzip the command line tools file.

Start Webometric Analyst and select menu item Services > Data sharing > Convert PDF to text, extract and classify data sharing statements.

PubMed Central Open Access XML collection instructions

Download the PMC Open Access collection and unzip it into a folder with subfolders for each journal.

Download Webometric Analyst with the menu link above

Start Webometric Analyst and select menu item Citations > PMC Full Text > Data availability statements - extract from PMC XML (may take hours)

Select Webometric Analyst menu item Citations > PMC Full Text > Data availability statement classification

* These features were funded by Jisc.