Genomic-Scale Cloud Data Analysis Pipelines with Lynn Langit

About Show #714

There is big data, and then there is genomic-scale. Richard talks to Lynn Langit about her work in large scale data analytics around the pandemic. Lynn talks about the terabytes of data involved in doing genetic analysis and the pipeline needed to move and organize that data. It takes a cloud - you need a huge amount of compute in relatively small bursts, as well as massive amounts of storage. And in the end, you get probabilistic results - nothing is ever absolutely certain! But the experiences happening today in genomic-scale research point to amazing data analytics capabilities for everyone in the future.


Lynn Langit is a cloud architect who codes. She is a Microsoft Regional Director, a Google Developer Expert, and an AWS Community Hero. Lynn specializes in big data projects. She has worked with AWS Athena, Aurora, Redshift, Kinesis, and the IoT. She has also done production work with Databricks for Apache Spark and Google Cloud Dataproc, Bigtable, BigQuery, and Cloud Spanner. She is the author of a number of LinkedIn Learning videos on these subjects. Lynn is also the co-founder of Teaching Kids Programming. She has spoken on data and cloud technologies all over the world.

Show Comments

blog comments powered by Disqus