Proper Citation: PDFX (RRID:SCR_006163)
Description: A fully-automated PDF-to-XML converter service for scientific articles. It takes a full-text PDF article as input and outputs the hierarchy of its distinct logical elements in an XML format. The elements that PDFX can currently extract are: * Front Matter ** title, abstract, author, author footnote * Body Matter ** body text, h1, h2, h3, image, table, figure/table caption, figure/table reference, bibliographic item, bibliographic reference (citation) * Extras ** header, footer, side note, page number, email, URI Note: This system has been designed for processing scientific articles. While virtually any PDF file is acceptable input, quality of the processing output might be degraded e.g. for entire books, slide presentations or spreadsheet/strictly tabular data. There are two ways in which you can use PDFX: * via a web browser * via any other HTTP client, such as the curl command-line tool
Abbreviations: PDFX
Resource Type: text extraction software, software application, production service resource, software resource, service resource
Keywords: semantic mark up, text extraction, pdf, xml, html
Expand Allis listed by |
|
has parent organization |
We found {{ ctrl2.mentions.total_count }} mentions in open access literature.
We have not found any literature mentions for this resource.
We are searching literature mentions for this resource.
Most recent articles:
{{ mention._source.dc.creators[0].familyName }} {{ mention._source.dc.creators[0].initials }}, et al. ({{ mention._source.dc.publicationYear }}) {{ mention._source.dc.title }} {{ mention._source.dc.publishers[0].name }}, {{ mention._source.dc.publishers[0].volume }}({{ mention._source.dc.publishers[0].issue }}), {{ mention._source.dc.publishers[0].pagination }}. (PMID:{{ mention._id.replace('PMID:', '') }})
A list of researchers who have used the resource and an author search tool
A list of researchers who have used the resource and an author search tool. This is available for resources that have literature mentions.
No rating or validation information has been found for PDFX.
No alerts have been found for PDFX.
Source: SciCrunch Registry