X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Resource Name
RRID:SCR_006163 RRID Copied      
PDF Report How to cite
PDFX (RRID:SCR_006163)
Copy Citation Copied
Resource Information

URL: http://pdfx.cs.man.ac.uk

Proper Citation: PDFX (RRID:SCR_006163)

Description: A fully-automated PDF-to-XML converter service for scientific articles. It takes a full-text PDF article as input and outputs the hierarchy of its distinct logical elements in an XML format. The elements that PDFX can currently extract are: * Front Matter ** title, abstract, author, author footnote * Body Matter ** body text, h1, h2, h3, image, table, figure/table caption, figure/table reference, bibliographic item, bibliographic reference (citation) * Extras ** header, footer, side note, page number, email, URI Note: This system has been designed for processing scientific articles. While virtually any PDF file is acceptable input, quality of the processing output might be degraded e.g. for entire books, slide presentations or spreadsheet/strictly tabular data. There are two ways in which you can use PDFX: * via a web browser * via any other HTTP client, such as the curl command-line tool

Abbreviations: PDFX

Resource Type: text extraction software, software application, production service resource, software resource, service resource

Keywords: semantic mark up, text extraction, pdf, xml, html

Expand All
This resource

is listed by

FORCE11

has parent organization

Utopia Docs

Usage and Citation Metrics

We found {{ ctrl2.mentions.total_count }} mentions in open access literature.

We have not found any literature mentions for this resource.

We are searching literature mentions for this resource.

Most recent articles:

{{ mention._source.dc.creators[0].familyName }} {{ mention._source.dc.creators[0].initials }}, et al. ({{ mention._source.dc.publicationYear }}) {{ mention._source.dc.title }} {{ mention._source.dc.publishers[0].name }}, {{ mention._source.dc.publishers[0].volume }}({{ mention._source.dc.publishers[0].issue }}), {{ mention._source.dc.publishers[0].pagination }}. (PMID:{{ mention._id.replace('PMID:', '') }})

Checkfor all resource mentions.

Collaborator Network

A list of researchers who have used the resource and an author search tool

Find mentions based on location


{{ ctrl2.mentions.errors.location }}

A list of researchers who have used the resource and an author search tool. This is available for resources that have literature mentions.

Ratings and Alerts

No rating or validation information has been found for PDFX.

No alerts have been found for PDFX.

Data and Source Information