How to SCP files from a gated server to your personal computer

Jack recently made a blog post in which he provided a script which can transfer your files between your personal computer and a given remote machine via temporarily hosting a file on file.io (blog post here); where you have some sensitive data that you do not want to risk hosting online, you can also fairly easily use SCP to keep business strictly between your local and remote machine.

What I am referring to is described here. This blog post refers to the case where you want to SCP from local host to a remote machine which is only accessible via a gate server (this isn’t necessarily true for the Stats computers as we can use the VPN to directly access our remote machine of choice by the way). I won’t effectively plagiarise the blog post I linked to as the explanation is clear enough in itself, but you just use port forwarding and the localhost address of your local machine!

Best wishes,

Eve

Curing Dogs With Cancer: The Power of the Antibody

This blog post finally combines the two great passions of my life: antibodies and dogs. Therapeutic antibody development is a huge area and is certainly not limited to humans. In the process of developing antibodies, we often use mouse or rat antibodies, obtained by injecting the animal with the antigen of choice and then collecting the resulting antibodies. The first monoclonal antibodies (mAbs) were produced in this way, by fusing spleen B cells from an immunised mouse or rabbit with immortalised myeloma cells to form antibody-expressing hybridoma cells. However, using antibodies to treat disease in animals lags behind humans.

Continue reading

Re-educating myself about the light chain

I have an unconscious habit of personification, and I always see the antibody light chain as lazy for not contributing more residues to binding interfaces (obviously a generalisation – e.g. insertions in CDRL4 in anti-HIV bNAbs [1]). Perhaps this is why I have a personal preference for the more diverse [2] heavy chain with its specificity-determining [3] CDR3. Having written this down, I realised it’s actually pretty weird to consider an antibody chain as a person and I ought to re-educate myself about the role that light chains play.

Continue reading

C is for Cysteines (plus a fun quiz)

At group meeting a few weeks ago I presented this paper, “Landscape of Non-canonical Cysteines in Human VH Repertoire Revealed by Immunogenetic Analysis“, from Prabakaran and Chowdhury. The paper is an investigation of the frequency, location and patterns of cysteines contained in human antibody sequences. Cysteines are important amino acids found in proteins, including antibodies, which can form disulphide bonds with other cysteines due to the presence of their reactive sulfhydryl group in the side chain.

Continue reading

Le Tour de Farce v8.0

Last Tuesday marked two exciting milestones for me in OPIG! Not only had I been looking forward to group socials since the beginning of lockdown, but I’d never met anyone other than Charlotte in person since starting in the group in April. As such, the annual cycling pub trip was an apt introduction to several OPIG members (who are now exempt from the game I play by myself during weekly Zoom group meetings: “Guess how tall this person is in real life!”) and a chance to interact with people other than my housemates! 

Continue reading

Understanding Conformational Entropy in Small Molecules

While entropy is a major driving force in many chemical changes and is a key component of the free energy of a molecule, it can be challenging to calculate with standard quantum thermochemical methods. With proper consideration in flexible molecules, we can break down the total entropy into different components, including vibrational, translational, rotational and conformational entropy. The calculation of conformational entropy is the most time-consuming as we have to sample all thermally-accessible conformers. Here, we attempt to understand the components that contribute to the conformational entropy of a molecule, and develop a physically-motivated statistical model to rapidly predict the conformational entropies of small molecules.

Continue reading

Learning from Biased Datasets

Both the beauty and the downfall of learning-based methods is that the data used for training will largely determine the quality of any model or system.

While there have been numerous algorithmic advances in recent years, the most successful applications of machine learning have been in areas where either (i) you can generate your own data in a fully understood environment (e.g. AlphaGo/AlphaZero), or (ii) data is so abundant that you’re essentially training on “everything” (e.g. GPT2/3, CNNs trained on ImageNet).

This covers only a narrow range of applications, with most data not falling into one of these two categories. Unfortunately, when this is true (and even sometimes when you are in one of those rare cases) your data is almost certainly biased – you just may or may not know it.

Continue reading

Prerecording Conference Talks and Posters using OBS Studio

Seemingly every conference due to take place this year has either been cancelled or will be run virtually due to the COVID-19 pandemic. Many organisers have decided that running entirely live virtual programmes causes more trouble than it’s worth (e.g. due to unforseeable IT and internet issues disrupting the schedule), and so are asking their presenters to prerecord their talks, which are then broadcast “live” on the day.

I recently “presented” two virtual prerecorded talks at the ISMB conference using Open Broadcast Software Studio (OBS Studio), a free open-source software package most commonly used by live-streamers on Twitch and Youtube. It is super simple to use and achieves a professional output, with video overlaying a presentation slide deck/poster PDF. This blog is a “how-to” on getting started with OBS for conference talks/poster presentations.

Continue reading

Pigs in the Parks: OPIG Social 28JUL2020

Tuesday afternoon normally heralds Group Meeting, the precious hour of the week where we gather on Zoom to hear about recently published papers, dissect each other’s research and, most importantly, bicker about appropriate usage of the servers. Knowing that Fergus B was on holiday this week and that a Group Meeting devoid of SLURM-inspired ranting would have felt strangely empty, it was instead decided that now was the time for the first in-person group social since the lockdown began in March.

Struggling to adapt to not being able to turn off Mic and Webcam – how on earth did we manage like this all the time before?!
Continue reading

Climate Change @ ISMB

Another special session I was listening to at ISMB 2020 was the Green stream. Several talks dealt with climate change and its relation to bioinformatics and computational biology. Two of them I found particularly interesting, one calculating the carbon footprint of ISMB itself and the other calculating the footprint of specific bioinformatics tools.

I believe most people have realised how important the issue of human-made climate change is and I assume that everyone has heard about some aspects of our life that are causing particularly many emissions compared to certain alternatives. For example, train rides vs. short-haul flights, eating the food’s food (veggies) vs. mass production of meat or renewable energies vs. coal plants, just to name some that are rather easy to change. Admittedly, I have also underestimated the urgency of the issue and I found this plot quite convincing:

(Screenshot from Alex Bateman’s talk)

What can we as computational researchers do about it?

Continue reading