“Ghosts” in scientific images and narratives

copyright 2019 by Russ Hodge

This piece was motivated by my recent correspondence with Jens Wohlmann, a talented young scientist working in Norway. It follows up on two previous pieces concerning phenomena I call “ghosts”, which play crucial (and often disruptive) roles in scientific thinking and communication. They can be read here:

A brief overview in interview format

A more detailed introduction to the problem of ghosts and examples of various types

The letter from Jens:

Dear Russ,

It has been a while since I wrote you but I have been following your blog and I was actually thinking that I should write you about some “observations” concerning your “ghosts” – so I will use your mail as a reason to finally do so:

Listening to some recent talks by scientists of our institute I came across “forces” you are most likely aware of but which are new to me – in the terminology of “ghosts” one could probably describe them as “goblins” or “deranging ghosts”. I think of drawings and models of structures of interest in papers, presentations, schemes and so on. One example is when parts seem to be totally out of scale.

For example, a small GFP tagged Protein will be marked by a small “star” as a marker attached it, although the GFP itself may be double the size of the protein of interest. The same is true for markers on antibodies. In EM we put a dot on a Y-shaped structure, but the whole antibody has a size of 15nm. Since the particle may be 10 or 15nm it should be as big as the antibody. Even the orientation in the illustrations is kept consistent – always with the binding end pointing “outwards”. This confuses students if steric hindrance is important, because in reality the structure is most likely a chaotic, multi-layered coat oriented in all possible directions.

Another nice example can be seen in schemes of transmembrane proteins, pores or receptors. Most of the time the structure of interest is presented as a huge thing standing alone on the cell surface (because it’s so important), and the membrane is represented as a thin line – but in reality the size of the molecule is only slightly larger than the membrane, and it may be entirely inside it. Of course schemes need to be simplified, but this emptiness often gives the impression that cells are empty membrane bubbles, whereas the cytosol has an incredible high protein concentration and is full of fibrers and structures….

I think this “out of scale” representations can result in similar problems as your ghosts and they can be seriously found everywhere.

Best, Jens

My response in four sections:

Hi Jens,

I think you’re absolutely on the mark with your observations about the peculiar ways biological entities are represented in images or schemes. The examples you gave are excellent. At the moment, I’m exploring ways of mapping some of these problems into the conceptual framework of “ghosts”. Below I’ve broken this down into a few related points.

1: Overview

To me what makes this so important is that we use images and schemes to represent complex concepts, but obviously ideas undergo important transformations as they are translated into visual form. Relationships that we know are three-dimensional are pressed into two. Key points are brought into the foreground, while others fade into the background or disappear altogether.

And dynamic processes are broken into static frames. What happens a lot like the difference between a musical phrase and the way it is represented in a score; composers and musicians know that tones aren’t “particles” just lined up in sequences (they are “waves” integrated into longer “waves” – phrases of different lengths in different voices). I’ll be talking about thinking of cellular processes in terms of “phrases” and other musical terms like polyphony, harmony and dissonance in my upcoming talk in Oslo – this is a much larger discussion.

The ways we translate concepts into images, language or mathematical models are highly susceptible to influences by effects of styles and genres, which reflect experience, habits and expectations and make communication possible. Such “styles” guide the way a thinker produces an image (or text) and the way audiences unpack it to map information onto their own conceptual frameworks. It’s interesting that most of the time, the two are combined: when a speaker shows a slide, he will say something about it; figures in texts are accompanied by legends. The idea behind this, I think, is to ensure that the audience decodes the meaning the way the author or speaker intended.

And here the problem of “ghosts” rears its head – inevitably, a lot of meaning is hidden. Some of it lies in the invisible conceptual architecture that lies behind packing and unpacking; some of it lies in the style or code. And an awful lot of it comes from differences in the way the author and audience have their knowledge organized. As an electron microscopist, you have an extremely high-resolution version of what a cell is in your head; you know how “full of stuff” this landscape is. And you work at a scale where the relative sizes of molecular objects are incredibly important.

But you’ve seen enough talks and read enough papers to be familiar with the styles of most schemes that you’ll see, and you know how to translate them into your own conceptual models. Sometimes they won’t fit. You surely find it equally difficult, sometimes, to translate your ideas into schemes that other people will understand. This is true for all kinds of communication; what makes it interesting in science is that it’s often possible to pinpoint where things go wrong and identify the ghosts very clearly.

I think that recognizing this is essential to the scientific process. Hidden architectures are essential to meaning. Individual scientists – even in the same field – have their knowledge organized in different ways. This creates subtle differences in their views of models that are analogous to variation in biological systems. When these differences collide and become exposed, they lead to refinements and revisions in models. This can be a powerful, efficient, creative process if we are aware that they are there. The problem is that most people don’t consider them consciously when they communicate or teach, and don’t actively look for ghosts that can disrupt communication. If the structure of a collection of concepts remains invisible, students will have to assemble it themselves, and a lot of things can go wrong in the process.

So now I’ll try to break down some of the things you’ve mentioned.

 

2: The problem of “translation” between conceptual models, language, and images

Recently a number of scientists have asked me to create drawings for their talks. They describe something, I try to draw it – FAIL! – they tell me what to fix and I try – FAIL! – and so on, until we finally have it right. This happens even when they pre-draw a scheme, because somehow I don’t seeit the way they do.

There are several things going on here. First, if the scheme represents a model of a physical system, such as a molecular structure, a complex of molecules or a process, the scientist is probably thinking of it visually and spatially but simultaneously functionally. Whatever function he is considering at the moment (foreground) plays a big role in the degree of detail that is in his mind and he wishes to be displayed in the image. So if I’m just trying to show what molecule binds to what, it may be enough to represent single components as generically as Lego blocks. A lot of times in these schemes, the pieces are not even placed in the right relationship to each other – which is understandable given the difficulties of crystallizing complexes. I was astounded many years ago to learn that biochemically, it was even hard to determine how many copy numbers of a specific protein there are in a particular complex.

(For the nerds: Even when crystals are made, weird Fourier transformations (math!) have to be applied to turn X-ray diffraction patterns into electron density maps, then sequence information and homologous structures are applied to find alpha helices, beta sheets, and tell what belongs to what.)

Anyway, in diagrams, very simple models may be sufficient until fine details of their surfaces and issues like steric hindrance suddenly become important in understanding something.

When starting out to make a scheme, I think it’s important to understand that our minds are constantly shifting between considering different types of functions, rapidly shuffling concepts between the foreground and background, and doing so at different scales. So there aren’t “one-size-fits-all” models. Different levels of structure ought to be embedded in each other and linked, but as you well know, most methods in biological don’t give us scalable views of things like Google Earth; if we want to study a new level, we have to change methods. That means, inherently, that models are necessary not only to classify, generalize and describe or depict the components of a system, but to link them to higher and lower levels of structure. Conflicts arise all the time because they are connected by a hidden web of assumptions and structures.

In your work with biologists, yeah – it would nice to have (3D) EM pictures of everything everybody is studying, and even real (3D) structural views of the molecules, but it’s not always necessary to make a certain point.

To give you an example, below I’ve inserted three models of the same thing: a nucleosome. Each of these was clearly developed to emphasize a particular relationship between structure and function. But those relationships lie at different scales, which has dictated the level of detail that is included, what lies in the foreground and background, and influenced all sorts of stylistic decisions.

This helps explain why some of the examples you gave don’t work, or are dissatisfying – people are often lazy about making their own images; they borrow them from other people and don’t check that they are really made to fit the point at hand. As a result, images may not convey the information they’re really aiming at.

Please note: I acquired these images from diverse papers; if any author has a problem with their use, contact me and I will replace it.

There are several issues to consider here. First, A researcher has some sort of visual representation of the system in his head, but when he tries to draw it or create an image he may realize that he hasn’t probed that internal visualization in detail. In fact, the most detailed images here required a computer: the scientist doesn’t have all of this in his head – at least not in this form. This means that there are all kinds of gaps in his concepts, which is interesting because he may be completely unaware of them until he actually tries to create the image. This is one way that engaging in communication can generate entirely new scientific questions. (“Oh, I didn’t know that, or I have no idea where this component belongs – how can I figure it out?”)

When trying to describe something in words, language is notoriously bad at capturing a lot of types of visual information. Part of this has to do with the linear nature of language: you can describe a row of dominoes, which are lined up in a sequence, but if they’re scattered randomly across a table we don’t have enough words for complex two-dimensional shapes, let alone 3D. Or try finding a criminal based on a verbal description. Or drawing a face based on one – the “police artist” problem.

Third, the prerequisite to communicating any model well is having it clearly in your mind, and mapping it onto language in the clearest possible way given your expectations about the audience. A lot of scientists don’t understand all the things that can go wrong in this mapping process.

3: Ghosts in visual styles and genres

A biologist would see the image below much differently than a non-scientist. When I show this to groups of scientists, they all recognize that they are looking at something on the molecular scale. They immediately recognize the double-helix structure of a DNA molecule, and notice that its circular, wrap-like structure encloses ribbon diagrams (simplified schemes of proteins). This probably makes the structure a nucleosome. They will probably assume different colors of the ribbons are meant to represent different proteins – here, four of them. If you assume that this object has a front-back symmetry, then you might guess that there are eight histones in the complex.

Very little of this information is contained in the image per se: it’s extra knowledge that the viewer has to have to decode the scheme.

There are lots of other, very basic “ghosts” related to two-dimensional images you need to be aware of to “understand” and explain this object. We’re used to translating 2D into 3D; shading and shadows create an illusion of depth, but some of this is cultural. Is it very thin or thick? And so on.

But there’s another enormousghost in this image, truly invisible in the most literal sense, that no scientist I’ve shown it to has detected so far. What’s all that white space around the thing? It can’t just be empty space. Nucleosomes only exist in a very specific biochemical environment – that of the nucleus, composed of all kinds of other molecules, a specific pH, and so on. So this object and its nature are contingent on a lot of invisible things that aren’t in the image at all. They are, however, somehow encoded in the image.

4: The “fudge factor”

This type of ghost is something my good friend and mentor Jim Hartman came up with over a lunch last year. It’s omnipresent – there in every example we’ve taken so far – and very complex because it mixes lots of types of other ghosts. In some ways it comes really close to what you called “goblins”.

“Fudge factors” arise from the fact that everyone knows that language, concepts, models and images don’t map onto each other very well. So whenever you describe something, you’re packing some thought into language or an image, transmitting it to someone else, and expecting them to unpack it in a very similar way. The representations are usually highly simplified – highly complex processes are reduced to a shorthand. If everyone translates them the same way, this works fine. But hidden within are lots of ghosts that can make things go very wrong. Think how hard it would be to truly adequately describe – in language – an experimental protocol to someone like me, and expect me to do it right the first time. I barely know a pipette from an electron microscope.

Here’s an example of a text loaded with “fudge factors,” concerning a biochemical signaling pathway that I recently deconstructed with my friend Uwe Benary:

 

A Wnt stimulus leads to the inhibition of the destruction complex that normally targets β-catenin. In consequence, less β-catenin is degraded and more β-catenin is able to enter the nucleus. There it regulates the expression of specific target genes.

 

Any experienced molecular biologist recognizes that dozens (hundreds? Thousands? Millions?) of steps are omitted from this description. To list just a tiny fraction of them: to receive a signal, lots of things have to happen to prepare a cell to bind the Wnt ligand. Lots of types of molecules (including its receptor) have to be presents, in the right quantities. They have to be arranged in often huge complexes – many of whose parts are unknown – that are constantly undergoing dynamic rearrangements. For beta-catenin to get involved, specific sites in its binding partners have to be chemically modified; once the complex dissolves, it is somehow transported to the nucleus and through pores, all along the way interacting with other factors and releasing them again. It has to find its way through masses of chromatin to find specific targets, a process which is hardly understood at all, and then participate in assembling the transcription complexes that will read the DNA sequence and build RNAs. It hits a lot of the “wrong” targets.

There are more types of ghosts: a scientist knows that we are not really talking about single molecules, but a generic model of how whole populations of molecules behave. Etc. Etc.

This type of highly oversimplified account is only meaningful within the context of a particular function of focus – just like the nucleosome image – and because people agree on how a model should be packed into language and unpacked again. Students won’t know all the missing pieces when they hear this, and the ghosts will lead to lots of misunderstandings. They may mistake the shorthand for a complete account of the process.

Interestingly, this skeletal shorthand reflects the history of the beta-catenin model. The bits of the story that are mentioned represent major discoveries over the past couple of decades. Digging out the missing steps has been the subject of an amazing amount of research; still, when the story is told, it’s arranged on the foundations of historical ghosts: what should be pulled into the foreground, what can be “safely ignored,” and what is simply unknown. This shorthand is a perfect example of the operation of fudge factors, a process that constantly generates ghosts.

There are always fudge factors – even in the most detailed experimental protocols, which are based on a researcher’s knowledge of tools and procedures and a large corpus of experimental and biological knowledge. I think they are likely a major cause of difficulty in reproducing experiments. And a lot of disagreements between the preeminent scientists in a field are waged over fudge factors – listening to debates can be extremely confusing if they are not exposed. Sometimes for the non-insiders, it’s hard even to tell what they are arguing about.

More on the profound connection between communication and science

Last year I gave a number of talks on a new model of the relationship between communication and research, which I have covered in “Ghosts, models and meaning in science,” and a more detailed text, here. I’ll be adding articles on this theme in the coming weeks. Comments are greatly appreciated – they have already significantly improved the project.

The core point is that scientific messages derive meaning from their relationship to various models and other concepts that often remain “invisible” (ghosts) in a given text or communicative context. This is true of all kinds of communication, of course. But the natural sciences relate meaning to models in specific, highly structured ways that can be recovered. If this invisible architecture is not shared by a writer or speaker, meaning will be lost. A failure to take this into account is one of the most common reasons people misunderstand a message. And in doing science, being unaware of the link between a project and the models that spawned it can become an obstacle to generating new hypotheses or fully understanding what happens in an experiment.

The inherent connection between thinking about, doing and communicating science is crucial to the quality of research and has important implications for science education. Here I present two slides I use in my talks. These “Concept maps” expose some of the patterns that link these ideas.

 

The first slide shows how a very specific scientific question (rose-colored box at the bottom) can be fit into a hierarchy of more general questions and models. There is no single path for creating such a chart: the same question at the bottom could be analyzed upward in different ways. You might diagram it within a more chemical or physical or evolutionary pathway, because specific questions are embedded in all kinds of models.

There are several important implications.

First, at some level, an experiment which seeks an answer to a very specific question also challenges the higher-order models it is embedded in. Basically, an experiment may be shaking a big tree and probing assumptions concerning several levels of the hierarchy and how they are linked. A highly specific experiment can refute a very large model, theory or linked set of assumptions. For example, all kinds of simple experiments might have shaken evolutionary theory, or a study that characterizes tumor samples could overturn a view of how a particular therapy works.

Secondly, an audience may know nothing about the example given below, which involves NF-kB, transcription factors, signaling pathways and so on. When trying to explain something, a scientist needs to make a reasonable guess about the knowledge of an audience and the kinds of things they are interested in, then find the right level of the hierarchy to jump in. Going downward provides a logical path for a dialogue that moves from a general question to a more specific one – and how they fit together.

The second slide links the way this communicative strategy can help scientists think about a problem more clearly, see relationships between models, and widen their understanding of the implications of their work. 

Stay tuned for more soon.

Russ Hodge

 

 

New entries in the Devil’s dictionary

today’s entries: balancers, blubber, chorology, enation

See the complete Devil’s Dictionary of Scientific Words and Phrases here.

3707_001

all entries in the Devil’s Dictionary copyright 2019 by Russ Hodge

balancers  in immature salamanders, antenna-like structures that grow from the head until the legs come along. Their name comes from their function; ideally there are two, weighing the head down so it doesn’t tip to one side, which would cause the animal to lose its balance. In humans this function is performed by the ears; if they are not symmetrical, one can add piercings to either side until you get it right. In weight-lifting, it is accomplished by ensuring that there is a weight on either side of a barbell.

blubber   in aquatic animals such as whales, a layer of fat between skin and underlying muscles that insulates the insides from the outsides. In media science, a layer of rhetoric that lies between news and facts and prevents them from ever mingling.

chorology  the study of the geographical or topological or stratalogical or scatological distribution of plants and animals throughout the universe or any part of it, such as the Earth, so that you can keep track of where you put them.

enation any outgrowth on a surface that was previously smooth, such as warts, pimples, hair, cars on the street, or the wind turbines in Holland. Ultimately, all enations have effects akin to those of wind turbines, generating a propeller-like force that either hastens the motion of an object in the direction it desires, or pushes it back toward its point of origin. This effect is the reason for EU regulations dictating that the number of wind turbine enations pointed to the east must always be kept in balance with the number pointed west, to avoid reversing the Earth’s rotation. This is also the rationale behind laws requiring the alternate parking of automobiles from one side of the street to the other on various days of the week. At least one physicist has attributed Lance Armstrong’s success in the Tour de France to microscopic enations mounted on his bicycling outfit. Not counting the pinwheel mounted on his handlebars. Or the dope, of course.

 

If you liked the Devil’s Dictionary, you’ll probably also enjoy:

Craig Venter and the alien zombies from Mars

Losing your heart in Heidelberg and getting it back again

 

Some new portraits of old guys…

Copyright 2019 by Russ Hodge; not for use without permission (as opposed to most of the material on the site).

Some of these images (or high-quality reproductions) are for sale if anyone is interested. The formats of the originals are all very large: A0 (84 x 119 cm); medium chalk and wax.Johann Sebastian Bach

Claudio MonteverdiAbraham LincolnEdgar Allen PoeHenry David Thoreau

Musicophobia: an appendix to the Devil’s Dictionary

Today’s entry represents a small appendix to the complete Devil’s Dictionary of Scientific Words and Phrases, which can be seen here.

As always, suggestions for new entries are welcome!

Musicophobia

A small glossary of terms for the fears and conditions that commonly impede the performance or appreciation of music, many of which are entirely justified.

3707_001

all entries in the Devil’s Dictionary copyright 2019 by Russ Hodge

 

1812 syndrome a fear that the cannons pointed at the public during a performance of Tchaikovsky’s overture are loaded with real ammunition

aquatonophobia a fear of being unable to control one’s compulsion to sing in the shower, or any time it rains

arcocidophobia a fear of being fatally stabbed when your stand partner’s bow penetrates your ear

batonocidophobia a fear of being stabbed by a baton released by a conductor, at which point you discover the tip has somehow been dabbed with a toxic substance

bizetophobia a fear of falling in love with the lead singer in an opera, which almost always turns out badly for everyone involved

bombasturinophobia a fear that your bladder will burst during a concert

chorophobia a fear of church choirs or the robes they wear

carillonophobia a fear of bell towers, particularly amongst those living near them

chromataphobia a fear of playing a piece which has been written in a key with more than one or two sharps or flats, just out of malice on the part of the composer

chutephobia a fear of falling into an orchestra pit, impaling yourself a music stand, and becoming trapped there until you bleed out or starve to death

cornemusophobia a fear of ancient reed instruments that produce bleating sounds, usually out of tune

cornoviperophobia a fear that a poisonous snake has crept into your French horn and is residing in one of those inaccessible curves

dingalingophobia a fear of missing one’s cue while performing in a bell choir

dongiovannitis a fear that a zombie will appear onstage during an opera

faintophobia a fear of locking your knees and fainting while singing in a choir, particularly when standing on one of the high risers at the back

fermataphobia a fear that a conductor will hold a note so long that a singer or wind player will asphyxiate

fortissimoflatulaphobia a fear of audibly releasing gas during a concert

gamelonophobia a fear of tonal systems in which octaves are not subdivided into 12 equal parts

karaokephobia a fear of being compelled to sing a solo in public

lloydweberphobia a fear of being compelled to attend a musical after being given tickets as a birthday present

mumphphobia a fear of opening your mouth to sing and nothing comes out, possibly because someone has stuffed a sock in it

music stand collapse anxiety a self-explanatory term

nibelungenphobia a fear that you will die during the performance of a particularly long opera

ohrwurmophobia a fear of getting a tune permanently stuck in one’s head, especially an advertising jingle, children’s song, or polka

reveillephobia a fear of being roused from sleep by a loud trumpet blasting in one’s ear

ringtonophobia severe anxiety caused by not being sure whether you have turned off your cell phone during a concert, or that if one goes off people will think it’s yours

saxamorophobia a fear of falling in love with a saxophone player, which almost always turns out badly

shankarophobia a fear of being trapped in a concert of Indian music that lasts for 12 or 14 hours

sourdaphobia a fear of peforming so badly that a listener to goes deaf, or wishes that he would

sousaphobia a fear of marching bands, which is almost always perfectly justified

stockhausen syndrome a fear of learning to like 12-tone music

tremeloseismophobia a fear that the dissonance created by bad tuning in the bass section will cause vibrations that register on the Richter scale

tritenorophobia a fear of any event involving three tenors

trombocularphobia a fear that your eye will be poked out by a trombone player

valkyriphobia a fear of very large altos

victrolaphobia a fear of becoming paralyzed while trapped in a room with a skipping record

violaphobia a fear of an unnatural behavior on the part of viola players that will lower the quality of their performance even below the expected standards, including fearing that they won’t show up on time, that they will show up on time, fearing that they will be sober when they show up, that they will have remembered to bring their bows, that they accidentally play the right piece, read the right clef, etc. etc.