Data Democratisation Part 1

Abhinav Balakumar & Sanjanaa Jeevandass

Apr 25, 2022

Share This Post

Image reference: / source

WHY SCIENCE IS PROTESTING FOR DATA DEMOCRACY

In a rapidly transforming world where data is termed the new oil, we are surrounded by conversations about why data is important, the wonders we unearth each day with data and how it is transforming and invading our lives. Perhaps the oil analogy is not quite right – we do transform data as we do oil – but data is not a fixed resource and will not run out – it will continue to grow and develop over time – making it more valuable than oil.

This continuous overpowering creation of information, insights and knowledge is often characterized as uncontrolled, confounding and largely under-utilized. Global industries are now going through a revolution in digitization, digital technologies, automation and AI among other “buzz-word” worthy concepts. However, the foundation of any of or all these technologies is…… Data.

The life sciences industry has often been criticized for playing catch-up with other industries such as FINTECH etc. While other industries are becoming data and analytics-ready, and even data-driven in some cases – the life sciences industry has yet to realise the true potential of the data stockpile they are sitting on.

MORE DATA, MORE PROBLEMS:

Data generation has never been a problem in the life sciences industry. Labs have been producing enormous amounts of data for many years and it is only increasing thanks to recent technological advances in multi-omics and image-based analytical techniques.

Our data requirements have always been intensive and are only getting more complicated thanks to such fields as omics (genomics, transcriptomics, proteomics, metabolomics etc.), biomarker analysis, and clinical statistics to name a few.

The time spent and complexity of managing data is becoming increasingly cumbersome compared to our data generation capacity. Scientists are often faced with insurmountable challenges in accessing, cleaning and managing data, rather than spending their time on analysing and inferring from it.

The most common problems faced by scientists are:

Availability:

The availability and accessibility of data for business and exploratory needs are perhaps the foremost problems faced by scientists working in data-siloed organizations. Data is often stored in native or custom formats specific to each instrument or scientific application niche to a business process. Scientists often hit roadblocks when trying to search for related data from their processes or running behind “data owners” to get access to relevant data.

Lack of Data Standards:

Data exchange is often seen as necessary for driving innovation between organizations and within organizations, but it’s always impeded by the lack of data standards among instruments, labs, business units, and entire organizations. The need for data standards was clearly understood during the Ebola outbreak of 2014 when data sharing helped scientists trace the origins of the virus and control the endemic. The need is reiterated in the current COVID-19 pandemic, with the WHO defining data sharing and reporting protocols. In spite of defining a protocol, the COVID-19 pandemic has clearly highlighted how the lack of data sharing standards in real-world scenarios exposes the shortcomings of any data infrastructure, and in turn the health infrastructure.

Ownership and usage:

Most organizations do not have clearly defined owners for their data, and often the IT teams that manage the applications end up as de facto owners. This leads to scientists and research teams being dependent on IT teams to access and analyse data.

There’s also the unwanted evolution of the IT team into a common data analytics team, performing ETL operations on behalf of the data users.

Lack of Self-service Analytical tools or knowledge leads to dependency on data scientists to perform ETL and create inferences without complete knowledge about the data process. Organizations often end up having to hire generalist data scientists who work across business functions without the scientific focus required for drug discovery research.

WHATS NEXT?

Stay tuned to know more about the questions that can be used to decide if you need to your democratise data in our second blog in this series

To find out more about how Zifo can help with your data management needs please email us directly at info@zifornd.com