Protein Contact Universe

Documentation

/Overview

?to toggle

Overview

Protein Contact Universe is an interactive dashboard for exploring inter-residue contact networks derived from protein 3D structures. It combines a high-performance 3D molecular viewer with network graphs, data tables, and functional annotation panels, all coordinated in a single workspace.

Every widget on the dashboard is linked: selecting a residue in the 3D viewer highlights the corresponding node in the network graph, scrolls the contact table, and surfaces relevant annotations. This tight integration lets you move between macro-level structure and atomic-level detail without context-switching.

Allosteric Pathway

Allosteric Pathway finds candidate communication routes through the protein's residue contact network, specifically chains of interacting residues that could relay conformational signals between distant sites. It supports two modes: Auto (default) automatically identifies promising start/end pairs, and Manual lets you pick specific residues.

Modes

In Auto mode, the tool evaluates many possible start/end residue pairs and returns ranked candidates. This is the recommended starting point for exploration.

In Manual mode, you choose start and end residues using the sliders in the settings panel (gear icon). You can also adjust confidence thresholds and search parameters.

Reading the results

Results are organised into candidates, each representing a start/end residue pair with one or more paths between them. Expand a candidate to see its paths, then click a path to highlight the residues in the 3D viewer and contact network.

Below each path you will see colour-coded labels showing the secondary structure elements the path traverses: H for helices, E for sheets, and L for loops. Paths that cross multiple distinct structural elements are more likely to represent genuine inter-domain communication rather than trivial backbone walks.

Candidate scoring

Each candidate receives a composite score (0–1) built from three terms:

Path efficiency: how short the best paths are between this pair.
Endpoint strength: how well-connected the start and end residues are in the network.
Path diversity: whether multiple independent routes exist (preferred over a single route).

The score breakdown is shown when a candidate is expanded.

Settings

Open the settings panel (gear icon) to adjust search parameters. All settings take effect immediately.

Min pLDDT: exclude residues below this structural confidence threshold (default 50).
Max PAE (Å): penalise contacts with high predicted alignment error. Leave empty to disable.
Top paths: how many paths to return per candidate (default 3).
Max runs: search budget. Higher values explore more thoroughly but take longer (default 120).
Max path length: the longest path to consider, measured in residues (default 18, range 2-50).
Reject single-helix paths: discard paths that stay entirely within one helix, since these are structurally trivial. Enabled by default.
Helix penalty: how strongly to discourage the search from walking along a helix backbone. Higher values push paths toward cross-element routes. Set to 1.0x to disable (default 3.0x).

Use Reset to defaults at the bottom of the panel to restore all settings.

How it works

Atom-level contacts are collapsed to residue-level connections, keeping the closest atom distance for each residue pair. Residues with low structural confidence (pLDDT below the threshold) are excluded, and if PAE data is available, contacts with high predicted alignment error are penalised or removed.

The search then finds the shortest paths through the filtered residue network, returning multiple alternative routes between each start/end pair. Paths are ranked by total distance and confidence, favouring short, high-confidence connections.

To avoid reporting trivial backbone walks as allosteric pathways, the search assigns a penalty to contacts between residues that are close in sequence and within the same helix. Paths that remain entirely within a single helical segment can be automatically rejected. The widget reports how many paths were rejected this way.

Predicted Aligned Error (PAE)

The PAE widget displays AlphaFold's predicted aligned error matrix as an interactive heatmap. Each cell (i, j) represents how confidently the model predicts the relative position of residue j when the structure is aligned on residue i. Low values indicate high positional confidence, while high values indicate uncertainty. Regions of consistently low error often correspond to rigid structural domains, while high-error off-diagonal blocks suggest flexible interdomain relationships. The panel header lets you switch between several perceptually stronger colormaps.

Selecting a region

Click and drag on the heatmap to draw a rectangular selection. The selected area is highlighted in blue and defines which residue pairs are sent to the 3D viewer for visualisation.

Move: click inside the selection and drag to reposition it.
Resize: drag any edge or corner to adjust the selection boundaries.
Clear: click the × button in the top-right corner of the selection, or click outside it on the heatmap.

Summary statistics (max, p90, p95 error) for the selected region are shown in the top-right corner of the heatmap.

Visualising in the 3D viewer

When a selection is active, the Visualize toggle in the panel header controls whether error lines are drawn in the Mol* viewer. Each line connects a pair of C-alpha atoms and is coloured with the same colormap as the heatmap, so palette changes stay synchronized between the 2D and 3D views. This lets you see spatial patterns of confidence directly on the structure.

The opacity button (half-circle icon, to the right of Visualize) opens controls for the maximum line opacity and the opacity policy. The default Error-weighted mode fades high-error lines more aggressively while keeping low-error lines easy to read, and the Fade strength preset lets you tune how strongly that emphasis is applied. The colormap button beside it switches the shared heatmap and Mol* palette.

Choosing a colormap

The colormap selector changes both the heatmap and the Mol* error lines. Different palettes emphasize different parts of the range:

Greens: the default that is used by the AlphaFold database.
RWB: default diverging map with a neutral middle band. Useful when midrange values should stand apart from low and high values.
Inferno: high-contrast sequential map. Useful when line visibility in the 3D view is the main concern.
Cividis: lower-chroma sequential map with better color-vision robustness.
Viridis: sequential map with good separation across the full range.

Legend and range filter

The gradient bar below the heatmap doubles as an interactive range filter. Two triangular handles underneath the bar let you narrow the visible error range: drag a handle to move one boundary, or drag the bar between the handles to slide the entire range. Filtered-out regions turn white on both the legend and the heatmap. The Low and High labels beside the bar show the current boundary values in angstroms.

The filter operates on the raw matrix value at each cell, so the heatmap pixels that remain coloured are exactly the pairs eligible for 3D visualisation.

Controlling the number of lines

A large selection can contain millions of residue pairs. Drawing all of them would overwhelm both the viewer and your GPU, so the widget samples a representative subset. Several settings in the gear menu work together to determine which lines are drawn and how many.

Edge budget: a hard cap on the number of lines that can be drawn, regardless of other settings (default 75,000). The actual number may be lower depending on the edge usage and the active range filter.
Edge usage (%): controls what fraction of the edge budget is used. At 100% the full budget is consumed. At lower values the budget is reduced quadratically, so small adjustments near 0% have a large effect while adjustments near 100% are more subtle.

Sampling strategy

Rather than picking random pairs, the sampling algorithm divides the selected region into blocks and scores each block by how much error variation it contains and how many low-error pairs it holds. It then selects a representative edge from each block, one whose error is close to the block's median, with a slight bias toward lower errors. This ensures that the drawn lines faithfully represent the overall error landscape even when only a fraction of edges are shown.

Other settings

Low error threshold (Å): pairs below this value are treated as “high confidence” during sampling. The algorithm gives extra weight to blocks containing many such pairs, making them more likely to contribute lines (default 5 Å).
Favor edge: the PAE matrix is asymmetric because aligning on residue i then measuring j can differ from aligning on j and measuring i. This setting controls how the two values are combined into a single edge. Lower takes the smaller of the two (default) and Higher takes the larger.
Live range updates: when enabled, the heatmap and visualisation update continuously while you drag the range filter handles. Disable this on large matrices if dragging feels sluggish.
Rounded corners: toggles rounded corners on the heatmap canvas.
Downsample to 1024: for very large proteins the matrix can exceed typical screen resolution. This option caps the rendered canvas at 1024 × 1024 pixels, reducing GPU and memory usage with no loss of analytical precision (sampling and selection still operate on the full matrix).

Keyboard Shortcuts

Shortcut	Action
`Cmd/Ctrl + A`	Toggle selection panel
`Cmd/Ctrl + B`	Toggle widget palette
`Shift + S`	Switch to Structure Evaluation layout
`Shift + N`	Switch to Contact Network layout
`Shift + M`	Switch to Mechanism Analysis layout
`Shift + E`	Switch to Empty Dashboard layout
`?`	Toggle documentation overlay
`Escape`	Close documentation overlay