coding posts

posts about coding

Found 4 posts

Making volcano plots in python in Google Colab

posted: June 07, 2022
dataanalysispythonplotlyGoogleColabpandasregex

Learn how to use python in an interactive online Google colab notebook to generate volcano plots to analyze your data sets.

post_preview_img

In their most basic form, experiments will generally involve comparing two conditions: a control condition vs an experimental condition where a treatment or perturbation has been applied to the system of interest. The effect of this treatment can then be read out in numerous different ways depending on what you’re interested in understanding about your system. Sometimes this can be accomplished by looking at a small number of factors such as the expression changes in a single protein or changes ...

Find all possible kmers in python

posted: January 20, 2022
pythondnasequencesrecursion

Learn how the powerful technique of recursion can simply and elegantly generate all possible sequences of a desired length.

post_preview_img

Modern DNA synthesis techniques allow for the construction of vast libraries of different DNA sequences that can be used in many high throughput applications. As an example, say you just found that a particular transcription factor binds to a promoter with a defined sequence that is 8 nucleotides long. The natural binding affinity of the factor for the sequence is good, but it’s possible that it could be even better. How could we find the optimal binding sequence for this protein? Ideally, a rat...

Finding the reverse complement in python

posted: November 16, 2021
pythonsequencingDNAbeginner

Two strands of DNA bound together? Simple right? Well turns out there's more to it especially with the funny 5' 3' prime business. Learn how to convert between the two strands in python.

post_preview_img

It is standard practice that unless otherwise stated, a nucleotide sequence is written left to right beginning with its 5’ (five-prime) phosphate end and then ending with its 3’ (three-prime) hydroxyl end where 5’ and 3’ refer to the position of these groups on the 5-sided sugar ring (ribose or deoxyribose) of the backbone. When working with single stranded oligonucleotides that’s the end of the story, but often double stranded DNA or RNA species are utilized. In those cases, there are two diffe...

Accessing UniProt via its REST API

posted: November 07, 2021
uniprotproteinsrestapipython

UniProt is a great online resource for finding a wealth of information about proteins from nearly all model organisms. Searching it manually works well, but to get to the next level of protein data mining you'll likely need to access its data programmatically, via the hidden REST API.

post_preview_img

The Universal Protein Resource more commonly referred to as just UniProt is an online resource that contains protein sequences and associated data from more than a dozen different organisms including humans. This curated database aims to be a universal resource that ties together disparate data from a wide variety of sources into a single, central location. This data includes structural information, isoform data, subcellular localization, reported variants and associated pathologies, and much mo...