MEDLINE is a bibliographic database that contains more than 22 million references to journal articles in life sciences with special emphasis on biomedicine, chemical sciences, and bioengineering. This database was instituted in the 1960, but today contains information back to even 1946.

Contributions to this database are invited and accepted from all over the world, just judging by the degree of highest quality and usefullness. Currently, citations from more than 5,000 journals globally are included, published in some 40 languages.

To a large degree, these are scholarly journals – about 40% of the included citations refer to publications in the US, and more than 90% are published in English.

The content of the database is selected with having the intended audience in mind, persons active or interested in the biomedical field as researchers, students, educators, and practitioners, just to name a few.

In an ever growing number, citations from both MEDLINE and the related database PubMed have links to full text articles which are contained in PubMedCentral (PMC).


Quick comprehension

It is of course impressive to have such a huge amount of data at your hands, however, this sheer amount might overextend the reader. That is where Chemanalyser comes in with an intelligent set of annotation tools that support the reader and prevent cognitive overload:

  • It uses background knowledge, which has been pre-processed and stored in the form of ontologies or taxonomies, to understand, annotate, and extract knowledge rather than just words from text documents.
  • These annotation dictionaries may contain more than 100 million possible terms, mapping each term to a respective concept number for later annotation. This guarantees highest speed  look-up and annotation of texts.
  • Specially programmed readers perform a format abstraction; so the system is able to easily handle a multitude of different file formats, e.g. HTML, XML, PDF, Excel and other file types.
  • Special routines perform detection and resolution of abbreviations and acronyms.
  • Other routines are specialized on recognizing chemical name entities, considering the diverse possible notations of chemical terms.
  • Relevance values are calculated and indicated for the retrieval results, which can also individually be adjusted.
  • A Chemical Formula Annotator recognizes chemical formulas in different notations, and adds the respective SMILES (Simplified Molecular Input Line Entry Specification) string to the annotation for more standardized reference.


In summary

This toolset is able to significantly increase your pleasure and efficiency in reading chemical texts. It assists you in an excellent way to search documents with concepts rather than with just words. This is chemical text mining at its best, and all this with an unrivaled speed.

You have to see it to believe it.


PubMed, an extension of MEDLINE

Since 1996 the database PubMed has been available. It contains 25 million references, whereof 22 million are the records included in MEDLINE. So PubMed basically is a slighty extended Superset of MEDLINE.

The citations which are added in PubMed, mainly include:

  • References to selected additional life sciences journals whose full text is included in PMC® (PubMed Central®).
  • Citations to articles that did not pass the strict acceptance criteria due to their topic being slightly out of focus but that seemed close enough to the overall scope of biomedicine to be included here.
  • Citations to articles of authors whose research has been publicly funded.
  • References to nearly all electronically available books.
  • Citations for nearly all books and documents electronically available from the bookshelf of the National Center for Biotechnology Information (NCBI).