I am a bioinformatician/computational biologist with six years of wet lab experience and over 12 years of computation experience. I will help you to learn computational skills to tame astronomical data and derive insights. Check out the resources I offer below and sign up for my newsletter!
Hello, Bioinformatics lovers and new subscribers, Tommy here. It is March, can you believe it? Boston had heavy snow last week and we finally got a little warmer. The snow is finally melting! Spring is near, I guess. It is my pleasure to distribute my bioinformatics knowledge through this newsletter. btw, I am adding video tutorials replicating the genomics paper in this playlist. Remember 1% better a day, that's a lot of progress in a year! If you find it helpful, kindly forward it to your friends. Today, we will talk about how to choose Bioinformatics tools and documentation. Why Most Bioinformatics Tools Fail Users Before They Even Start I’ve looked at hundreds of bioinformatics GitHub repos. Here’s the harsh truth: most tools fail not because of poor algorithms, but because of bad documentation and usability issues. And here’s the kicker—many of these are published in high-impact journals. But publication ≠ usability. Let’s talk about how to write better documentation AND choose better tools so you don’t waste time. 1. Can You Even Install It?The first test of any bioinformatics tool: installation. If I need to wrestle with outdated dependencies, cryptic errors, or a broken ✅ Good sign: Works in a clean Conda/Docker setup Your tool might be amazing, but it's dead on arrival if users can’t even install it. 2. Documentation: What Users NeedDevelopers often write: What users want: Good docs solve problems, not just describe features. Example: Instead of saying “outputs a normalized matrix”, say:
Input: Raw count matrix (genes × cells, CSV format) Output: Log2-transformed, batch-corrected counts Use case: Preparing scRNA-seq for clustering 3. Real-World Tool Selection: Do the Docs Pass the Test?I recently found a “revolutionary” scRNA-seq tool. But… I had to abandon it. Great science, but useless in practice. 4. Pro Tip: Include Example Data!Nothing helps users more than: If a tool makes me guess what the input should look like, I move on. Clear example data = instant usability boost. 5. Does It Play Nice With Others?A tool might be well-documented, but if it doesn’t integrate with existing workflows, it’s a problem. ✔️ Does it work with Seurat, Scanpy, or Nextflow? If I need to write 500 lines of glue code just to use it, it’s not worth it. 6. Controversial Take: High-Impact Papers Often Mean High-Maintenance ToolsSome of the worst documented and hardest-to-install tools come from papers in Nature Methods. Why? 🔹 The authors moved on to new projects Meanwhile, well-maintained GitHub projects (often from industry or long-term academic labs) are safer bets. 7. Before Releasing a Tool: Do This One TestAsk a colleague who’s never seen your tool to install and run it. Watch silently. I guarantee you’ll find gaps you never noticed. 8. Key TakeawaysGood tools install easily and integrate well Next time you build a bioinformatics tool—or need to choose one—remember: usability beats novelty. Do you have a favorite (or most frustrating) bioinformatics tool? Reply and let me know! Other posts from the past week you may find helpful
Happy Learning! Tommy PS: If you want to learn Bioinformatics, there are four ways that I can help:
Stay awesome! |
I am a bioinformatician/computational biologist with six years of wet lab experience and over 12 years of computation experience. I will help you to learn computational skills to tame astronomical data and derive insights. Check out the resources I offer below and sign up for my newsletter!