Generate deeper and more agile clinical evidence, thanks to the use of AI

Fill in this form carefully so that we can check if there is a match between you and us:

We will contact you even if we are not interested in your research project.

High-validity Virtual Registries
across 16 Therapeutic Areas in 14 countries

An international oncology study of Artificial Intelligence applied to electronic medical records:

This is a unique collaborative study between the Head and Neck Cancer International Group (HNCIG) and Savana.

The first of its kind for head and neck cancer study, HNC-TACTIC is a multi-language, multi-center, retrospective, real-world evidence study analyzing Electronic Medical Records (EMRs).

The study aims to describe patients with head and neck squamous cell carcinoma (HNSCC) in a real-world setting.


What we do:

  • Sometimes the simplest way can also be the best one.
  • And the best way is not a cut of a database, nor a group of preselected variables, nor a certain number of patients. It’s not that. And neither a very costly registry.
  • The best way we can imagine is to retrieve the actual complete information about what is happening at the points of care.
  • It’s having access to the complete medical records information.

Every single patient. Every single variable.
The most realistic data source possible.

Every single patient. Every single variable.
The most realistic data source possible.

  • In order to get this, you basically need:
    • A combined team of data scientists and clinicians with experience in research (in our case, lead by oncologists).
    • A system able to retrieve information from any healthcare provider (as long as they have electronic medical records -EMR-, paper doesn’t work).
    • Natural Language Processing, because 80% of the variables and outcomes are going to be in the clinical narratives’ free text.

This system is exactly what we created.

This system is exactly what we created.

And how, is in practice, getting the information through this methodology better?

  • The key is in our team helping researchers selecting which fragments of meaningful data in order to satisfy the objectives of the investigation.
  • In fact, because we had to signify how much deeper we get into data compared to anyone else, thanks to AI (true AI, not buzzword AI), we called the result of this methodology Deep Real World Evidence.
  • As you have probably suffered in the past, current databases exist to collect clinical data, but with considerable gaps due to recording limitations in the current methods.
  • Deep Real World Evidence from EMR offers a much (not a bit but a much) greater insight into the routine clinical care of patients throughout all stages of the disease.
  • Combining free text with other data sources (e.g. laboratory data, pathology, genomics, etc.), an insilico registry gets generated to describe the patient population with the defined disease, their associated clinical conditions and treatments, and develop predictive models.

Deep data layers analysis:

Deep data layers analysis:

If we do our job well, there is no need for:

  • Observational studies.
  • Traditional registries.
  • Classical disease databases.

Drug discovery: beyond EMR and into genomics.

Once we have facilitated the most difficult part, which is extracting variables from free text (clinical characteristics, comorbidities, signs and symptoms, adverse events or outcomes), we can also combine all this unstructured information with other structured data layers (genomics, transcriptomics, proteomics and imaging) which can be sourced both from our worldwide network of hospitals and from clinical trial databases.

Savana works with its premium partners in order to offer a combined proposal:

You need to know that we did this before… many times

We invested millions and years in developing a methodology by which we can infer the variables from the EMR, keeping quality and controlling bias. The consequence is a methodology which results are replicable, thus generalizable.

We collaborate with a network of 200 hospitals across Western Europe and the Americas.

And yes, we are absolutely the only ones who do this at multilingual level!

And yes, we are absolutely the only ones who do this at multilingual level!


You don’t have to. You just need to go to our peer-reviewed publications, both clinical and technical, where our methodology has been scrutinized and proven.

In our publications you will also find validations of the AI models we have created.

It depends on what you understand by more complicated. If I only need one pair of shoes, it’s easier to just manufacture it. But if you need thousands of shoes, the only way is to build a factory.

If you want to generate real world evidence about a disease or a drug, you will normally want a) very granular information b) new mathematical models in order to find new associations and hypothesis. Then, this is your method. While if you want to spend millions and years in creating a registry, this is not for you.

It really depends on how deep you want to get into the information. If you want the information in 1 month, then you’d better go for a database cut. But if you want to own a dynamic registry, navigate it, query it in search for new insights,… and you can wait some months to have this, then it’s definitely worth it.

We are just enjoying the result of years of focused investment into being the best at mining medical records for real world evidence generation purposes. There is no magic in it. All we are doing is applying state of the art AI and the scientific method to clinical research.

No. The amount of information you will get will be relatively more cost-efficient than any traditional way of doing things. By far.

Of course not. You are the only one who has the clinical question and you will need to guide our team until we are sure that they understand the exact problem you are trying to solve. Aside from that, agreements with hospitals are tough, and in our experience, what works better is to convince them by approaching them together, so our collaboration will serve to accelerate the Project.

We normalize the clinical concepts according to the SNOMED CT ontology, with variables added by Savana’s internal medical staff in those cases not covered by SNOMED CT. Mapping to OMOP is also part of the process when required.

YesSavana is compatible with other similar platforms. Other types of repositories based on structured text or free text can be complementary to the information processed by Savana.

Thanks to its technology for extracting clinical content from the free text of the electronic medical record, Savana offers doctors the opportunity to carry out research on pathologies and/or patient groups in real time and at any time, which to date has been impossible to perform.

Savana facilitates massive and very fast extraction of clinical variables found in the free text of EMRs, which replaces the current work of manually reviewing chart by chart. Structured data like pharmacy, laboratory or genomics can be extracted and added to the database.

Clinical documents: being the company which has processed the biggest number of documents of this type worldwide; allowing our algorithms to currently be among the most trained for this purpose. Savana has been implemented in +200 sites across 16 countries for years and its use has generated abundant scientific publications, answering questions in multiple therapeutic areas.

YesUsing natural language processing techniques, the clinical variables are extracted from the free text of the EMR. We also integrate and process other databases such as Hospital Pharmacy, Oncology Pharmacy, Laboratory, Radiology text, etc.

Savana is compatible with all EMR systems, regardless of format and source. Our technology is vendor agnostic. The only limitation is that the documents are text and not images. The preferred document formats for the information extracted from EMR are CSV, JSON, XML and DB, being compatible with other data exchange formats that we will assess previously.

There is no extraordinary requirement beyond the usual ones for a healthcare provider IT (internet connection, usual operating systems, etc).

It enables the export of all the data in different formats for its use by other artificial intelligence tools, or statistical tool, such as SPSS or R.

No, the hospital has the processed information at its disposal to make the use it deems appropriate.

No. Every site must opt in or out once we have a new study protocol. That way they always keep control over their data. Of course, every hospital can also suggest a study to the rest of the network.

Generate deeper and more agile clinical evidence, thanks to the use of AI

Fill in this form carefully so that we can check if there is a match between you and us:

We will contact you even if we are not interested in your research project.

Enrol in a project