Sophia Genetics’ new technology for genomic data privacy means secure democratisation of Data-Driven Medicine worldwide

Published: 7-Dec-2016

Sophia Genetics raises the bar in secure high speed genomic data flows worldwide with new privacy technology for the compression, storage, and access to genomic data which paves the way to a new industry standard


Sophia Genetics, global leader in Data-Driven Medicine, has unveiled a new privacy-protecting technology for the storage and access to patients’ genomic information worldwide, paving the way to a new industry standard. The largest clinical genomics platform – 215 hospitals from 35 countries connected - and the designer of SOPHiA, the most advanced artificial intelligence for Data-Driven Medicine, Sophia Genetics decided to leverage its leadership position in genomics to ensure the secure democratisation of Data-Driven Medicine worldwide.

Developed with genomic data privacy and security experts from the Swiss Federal Institute of Technology Lausanne (EPFL), and biomedical researchers from Stanford University, the new technology unveiled today makes sure the privacy of genomic information is not compromised as patients’ data go through various processing steps, from compression to storage and finally access by healthcare institutions. A true revolution both for patients and the healthcare community, the new privacy-preserving technology paves the way to a new industry standard and has been highlighted in Genome Research, an international, peer-reviewed genome sciences journal, in a publication entitled A Privacy-Preserving Solution for Compressed Storage and Selective Retrieval of Genomic Data.

Adam Molyneaux, Chief Information Officer at Sophia Genetics declares “Privacy has become an increasingly serious concern given both the highly personal nature of genomic information, and the increasing number of requests for access to patients’ genomic information generated by healthcare institutions all along patients’ life. Preventing unsolicited use of personal data requires to not only encrypt data but also to define data access privileges, enabling selective retrieval of genomic data.”

Over 100 petabytes of storage are already used by the world’s largest 20 biological research institutions, which corresponds to more than $1 million USD in storage maintenance costs every month

The new genomic data privacy technology presented today also tackles another big challenge of the Data-Driven Medicine revolution: genomic data costs of storage. Adam Molyneaux explains “The data of one human genome can go from 30 GB to 200 GB. And with new high speed DNA sequencing technologies driving costs of generating genomic data down, the growth rate of genomic data has more than doubled each year since 2007, outpacing Moore’s law. Over 100 petabytes of storage are already used by the world’s largest 20 biological research institutions, which corresponds to more than $1 million USD in storage maintenance costs every month.”

Jean-Pierre Hubaux, Professor at EPFL and senior author of the publication, commented “This new technology offers an effective space-saving and privacy-preserving solution for the storage of clinical genomic data. The currently available solutions were developed before the widespread usage of high-throughput technologies and do not consider effective protection when compressing genomic sequences; the current standard saves 34% storage on average in lossless compression. Our no-compromise solution uses 18% less storage while allowing for unprecedented levels of security in genomic data storage as well as selective retrieval.”

Presenting the new solution today, Jurgi Camblong, Sophia Genetics’ CEO and co-founder, declared “As the global leader in Data-Driven Medicine, we are at the forefront of clinical genomics, and committed to developing new standards to tackle the challenges posed by the democratisation of Data-Driven Medicine. I am proud that, in partnership with EPFL, we are raising the bar in genomic data privacy and at the same time lowering the cost of storage of genomic data. This is a necessary contribution to our field for the large-scale application of personal genomics in research and clinical settings.”

Nick-named SECRAM, the new space-saving and privacy-preserving technology for the storage of clinical genomic data has been made accessible to the whole genomics community in open source.

You may also like