Why Subscribe?✅ Curated by Tommy Tang, a Director of Bioinformatics with 100K+ followers across LinkedIn, X, and YouTube✅ No fluff—just deep insights and working code examples✅ Trusted by grad students, postdocs, and biotech professionals✅ 100% free
Hello Bioinformatics lovers, What's the most undervalued skill in computational biology? It is EDA (Exploratory Data Analysis) What is EDA? According to wiki: In statistics, exploratory data analysis (EDA) is an approach of analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods In plain words, it is using summary statistics and visualization (figures) to understand your data BEFORE you do any data analysis. Why is it important?
I termed it "data intuition". Through EDA and looking at the data with your eyes (by printing out some rows), you should find the irregularity of your data. This means that you should have some intuition when you feel something is wrong during the data analysis. The reality is: If you feel something is off, there is usually something wrong with your code or your understanding. Instead of blindly following any tutorial or executing any command, make sure it makes sense by taking a second thought of the output. Let me explain more. If you have sequenced the female samples, and you find variant calls (mutations) on the Y chromosome, you know something is wrong. Do not blindly follow any tutorial, make sure you understand the details. That's all for it today. Happy Learning! Tommy aka. Crazyhottommy PS: If you want to learn Bioinformatics, there are four ways that I can help:
Stay awesome! |
Why Subscribe?✅ Curated by Tommy Tang, a Director of Bioinformatics with 100K+ followers across LinkedIn, X, and YouTube✅ No fluff—just deep insights and working code examples✅ Trusted by grad students, postdocs, and biotech professionals✅ 100% free