Cryo-EM reaches atomic resolution in protein structure determination

The end of the era of crystallography has been long predicted also on this blog, but has it come now with the recent report of an atomic resolution protein structure solved by cryo–electron microscopy (cryo-EM)?

Continue reading →

A new Graduate students (unexperienced) guide to academic literature.

Given this is my first ever attempt at a blog post, let alone one on such a highly regarded platform I feel it’s proper that I introduce myself. Hi, my name is Maranga, I am a second-year SABS student starting my DPhil project in Small molecules, and honestly, I really don’t like reading. Especially, scientific journals. Now I can appreciate this does not bode well given my chosen career path, however, my aversion for reading is not new (shoutout to Biff, Chip and Kipper) and hopefully not permanent.

Continue reading →

3 Useful UNIX commands you might not know

nohup

The command nohup (stands for “No hang up”) allows your script to run even if you quit the terminal. It can be very useful, especially if your terminal has been opened through ssh and you have a dodgy connection. It can be used as follows:

nohup python my_script.py > log.out &

nohup will automatically append the output from your script to a file named nohup.out. By adding the > log.out part of the command you can save the output to a different file of your choice.

Continue reading →

NeurIPS 2020: Chemistry / Biology papers

Another blog post, another look at accepted papers for a major ML conference. NeurIPS joins the other major machine learning conferences (and others) in moving virtual this year, running from 6th – 12th December 2020. In a continuation of past posts (ICML 2020, NeurIPS 2019), I will highlight several of potential interest to the chem-/bio-informatics communities

The list of accepted papers can be found here, with 1,903 papers accepted out of 9,467 submissions (20% acceptance rate).

In addition to the main conference, there are several workshops highly related to the type of research undertaken in OPIG: Machine Learning in Structural Biology and Machine Learning for Molecules.

The usual caveat: given the large number of papers, these were selected either by “accident” (i.e. I stumbled across them in one way or another) or through a basic search (e.g. Ctrl+f “molecule”). If you find any I have missed, please reach out and I will update accordingly.

Continue reading →

Speaking about Sequence and Structure at a Summit

A couple of weeks ago I was lucky enough to be asked to speak at the 5th Computational Drug Discovery & Development for Biologics Summit. This was my first virtual conference – it was a shame I didn’t get to visit Boston, and presenting to my empty room was slightly bizarre, but it was great to hear what people have been working on, and there’s definitely something to be said for attending a conference in fluffy socks…

A, antibody structure. An antibody is made up of four chains: two light (orange) and two heavy (blue). Each chain is made up of a series of domains—the variable domains of the light and heavy chains together are known as the Fv region (shown on the right; PDB entry 12E8). The Fv features six loops known as complementarity determining regions or CDRs (shown in dark blue); these are mainly responsible for antigen binding. B, example sequences for the VH and VL, highlighting the CDR regions and the genetic composition. It is estimated that the human antibody repertoire contains up to 1013 unique sequences, enabling the immune system to respond to almost any antigen. This is possible through the recombination of V, D and J gene segments, junctional diversification, and somatic hypermutation.

Continue reading →

Improving your Python code quality using git pre-commit hooks

Intro

I recently completed an internship during which I spent a considerable amount of time doing software engineering. One of my main take-aways from this experience was that in industry, a lot more attention is spent on ensuring that code committed to a GitHub repo is clean and bug-free.

This is achieved through several means like code review (get other people to read your code), test-driven development (make sure your code works as you are adding functionality) or paired development (have two people work together on the same piece of code). Here, I will instead focus on a useful tool that is easy to integrate into your existing git workflow: Pre-commit hooks.

Continue reading →

From Jupyter to Slides using RISE

In preparation for remote teaching this year, I’ve spent the last few weeks converting the Doctoral Training Centre’s ‘Introduction to Computer Programming’ course into a series of Jupyter notebooks so that the course can be run entirely using Google Colaboratory.

Continue reading →

Smaller and Smaller Fragments

Fragment-based drug discovery (FBDD) is based on the idea that using small (< 300 Da), highly soluble compounds to screen against a target will give higher hit rates and sample chemical space more efficiently compared to screens using larger, drug-like compounds.

Continue reading →

The right tool for the job – The Joy of Excel

Excel’s pervasiveness has resulted in it being used (correctly or incorrectly) in just about every area of science.

Unfortunately, Excel has some traps for the new player and unless you’ve fallen for them before, they are not entirely obvious. They stem from the fact that Excel will try to help the user by reformatting data into what it thinks you mean.

Continue reading →

Oxford Protein Informatics Group

or "OPIG" to friends

Cryo-EM reaches atomic resolution in protein structure determination

A new Graduate students (unexperienced) guide to academic literature.

3 Useful UNIX commands you might not know

nohup

NeurIPS 2020: Chemistry / Biology papers

Speaking about Sequence and Structure at a Summit

Improving your Python code quality using git pre-commit hooks

Intro

From Jupyter to Slides using RISE

Smaller and Smaller Fragments

The right tool for the job – The Joy of Excel