跳到主要内容

ABFE Manual

1. ABFE Submission

The ABFE Submission interface manages job submission for FEP calculations, which includes the following functions: uploading protein and protein paration (in the Protein function), uploading ligand and ligand preparation (in the Ligands function) and setting task parameters

1.1 Protein

This interface contains necessary processes for setting up the protein for the FEP calculations, including protein upload, protein structure processing, build membrane protein, and protein force field parameter settings.

WindowWindow DisplayParametersDescriptionNotes
Protein

import_structure

Upload a ProteinUpload a local protein file1)Supports uploading .pbd and .mae format files from local, but does not support uploading membrance-bond protein files.
2)The protein preparation window will pop up automatically after uploading the protein.
3) Support non-standard amino acids: TPO, SEP, SEQ, HYP, ORN, DAB.
Protein Preparation

import_structure

Select Polymers to KeepSelect the amino acid chain to retain
Select Water(s) to KeepSelect water molecules to retain
Select Other Groups to KeepSelect other molecules or functional groups to keepIncludes cofactors and covalent substitution groups
Protonation State SettingSet the parameters for determining amino acid side chainsMaintain protonated state: Retain the protonation state of the input.
Modify protonated state: Adjust the protonation state based on the pH settings below.
Protein Force FieldSelect protein forcefieldSupports AMBER14SB-TIP3P and AMBER99SB-STAR-ILDN-MUT-TIP3P
Protein

import_structure

ProteinsDisplay protein information1)After protein preparation, the system automatically performs a protein validation check, which returns one of three statuses: Valid, Error, or Warning. You can access the Protein Reliability Report by clicking the icon next to the protein status.
2)In the protein list, the Delete and Show/Hide buttons under Op.(Operation) allow you to delete or show/hide proteins.
3)If the protein validation check fails (Error), you can delete the protein and re-upload it.

import_structure

Modify Protonation StatesModify the protonation state of a specific amino acid side chainIf the protein validation check results show a protonation state warning, or if there are important protonatable amino acids near the protein-ligand binding site, you can manually adjust the amino acid protonation states here.
Add Non-standard ComponentAdd Non-standard ComponentIf there are non-standard components (such as co-factors or metal ions) near the protein binding site, you can upload relevant files to ensure the accuracy of the simulation.
Build MembraneBuild membrane structuresIf the target protein is a membrane protein, you can build the membrane-bound protein structure here
Modify Protonation States

import_structure

Protonated TypeProtonation state of the amino acidSupports modification of the side chain protonation states for HIS/ASP/GLU/LYS/CYS.
Click "Submit" to confirm the changes.
FocusFocus on a specific amino acidClick this button to center and highlight the corresponding amino acid in the 3D Viewer window.
Add Non-standard Component

import_structure

Upload Co-factor FilesUpload structure files for co-factors.Supports uploading relevant non-standard component structures or pre-built force field parameter files.
Note: Each uploaded structure file and force field file will be included in the final calculation. Please carefully check that the uploaded components are correct, and ensure that the structure and force field files correspond to desired components.
Upload External Force Field FilesUpload external force field files
Build Membrane

import_structure

Membrane ThicknessMembrance ThicknessNote: Before building the membrane, the protein preparation, modify protonation states, and the add non-standard components steps must be completed. Additionally, the ligand upload and the ligand alignment steps should be finished.
Solvent PaddingSolvent Padding
NPT Equilibration TimeNPT Equilibration Time
Automatically orient the protein in the membraneAutomatically orient the protein in the membrane
Make the membrane squareMake the membrane square

1.2 Ligands

This interface includes necessary steps to set up ligands for the FEP calculation, including ligand upload, force field settings, ligand alignment, and uploading affinity data.

WindowWindow DisplayParametersDescriptionNotes
Ligands

import_structure

Upload Ligand(s)Upload Ligand(s)Upload local ligand files in .sdf format. Experimental IC₅₀ values and ΔG values are supported
LigandsUploaded ligands will enter the ligands listAfter uploading ligands, the system automatically performs a ligand validation check, which returns one of three statuses: Valid, Error, or Warning. You can access the Ligand Reliability Report by clicking the icon next to the ligand status.
Ligand Force FieldLigand forcefieldSupports gaff2
Dock LigandsDocking LigandsGenerate Docking conformations
Align LigandsAlign ligands
Upload Affinity DataUpload Affinity Data
Dock Ligands

import_structure
Docking Pocket Setting

Pocket SettingPocket setting methodSupports setting pockets through ligand selection, residue selection, project selection, and pocket file upload. Cutoff can be used to set the box size, with larger values resulting in larger boxes.

import_structure
Docking Ligands

FocusFocusClick this button to center and highlight the corresponding ligand in the 3D Viewer window.
Align Ligands

import_structure
Select reference ligand for alignment

Min Distance to ProteinMinimal distance between the ligand and the proteinWhen performing ligand alignments, one must ensure that the reference ligand is within the protein pocket. The distance between the ligand and the protein can be used to determine whether the ligand should be selected as the reference ligand.
FocusFocusClick this button to center and highlight the corresponding ligand in the 3D Viewer window.

import_structure
Select Alignment Ligands

Atom Mapping
import_structure
Edit the atom mapping between the ligand to be aligned and the reference ligand.During ligand alignment, the program will use the default MCS constrained docking method to align the ligands. This may result in the same functional groups within the molecule pair not aligning together, especially when macrocycle structures are involved. If you know the common structural regions of the ligands and want to avoid misalignment due to alignment offsets, you can specify the common structural regions through atom mapping before performing the alignment calculation.
Upload Affinity Data

import_structure

Upload Affinity Data File (.csv)Upload affinity data from local filesWhen the ligand molecules have measured experimental activity data (such as IC₅₀ or Ki values), you can batch upload the corresponding experimental values using the Upload Affinity Data File (.csv).

import_structure

Value TypeSupport IC₅₀ / ΔG1)The ligand name must match the name of the ligands uploaded previously.
2)When uploading activity/affinity values via data files, you need to select the value type in the Value Type field. Based on this selection, the values will be converted into ΔG and recorded in the ligand list.
3)The unit of IC₅₀ is nM, the unit of ΔG is kcal/mol

1.3 Job Submission

Click the Configure Calculationbutton to enter the Submit Calculation interface.

ItemsParametersNotes
ProteinInclude the protein name and the protein category (with / without a membrance)
LigandsInclude pairs and corresponding Sim. Score, ΔΔG_Exp, Op. (Mapping/Delete) of each pair
Simulation TimeSimulation TimeIt is recommended to start the calculation with 2ns, and increase simulation time when needed.
ParametersWater PaddingRecommend to use the default value
Ion ConcentrationRecommend to use the default value
TemperatureRecommend to use the default value
ReplicasRecommend to use the default value
CostsThis task will consume N pairs.Job cost is determined by the submitted protein system (with or without membrane), the number of molecule pairs, simulation duration, and the number of replicas.

2. ABFE Jobs

The ABFE Jobs interface provides calculation information for each ligand.

import_structure

2.1 Calculation Result of a Single Ligand

ItemsDescriptionOperation example
LigandLigand name——
ΔG_ExpExperimental binding free energy value of the ligand——
ΔG_FEPFEP calculated binding free energy value of the ligand——
Op.Setting
Select one calculation result of this ligand to perform result analysis

import_structure

Delete
Delete all jobs related to this ligand

import_structure

2.2 Analysis Result of a Single Ligand

ItmesOperations / SettingsDescriptionNotes
Simulation——The index of the simulation
InformationStatus: job status
Start Time: Job start time
End Time: Job end time
Job ID: Job ID
Username: Username
Parameters——The same as parameters used for submitting the job
Process——Job process
Convergence

import_structure

Convergence analysisThe convergence analysis shows the difference between the forward and the reverse free energy calculation results, which reflects the extent of hysteresis. Ideally, the difference between forward and reverse results should be small, indicating that the system has fully converged and sampling deviations are negligible. A large difference usually means insufficient state overlap or inadequate sampling. In addition, a smooth and stable moving average suggests that the free energy value has converged, while a continuously changing moving average indicates that the system is not fully converged and still evolving.
Free Energy by ℷ

import_structure

λ Decomposition of the Free EnergyThe λ decomposition of free energy shows the free energy change across different λ states in the system. If ΔΔG values change smoothly across different λ values, it indicates that the system has a stable transition during the alchemical transformation process. Abrupt jumps at certain λ values may indicate critical transition points, insufficient sampling, or system instability.
Overlap Matrix

import_structure

Phase Space OverlapIn FEP calculations, the ligand A must be perturbed into ligand B through a series of well-overlapping intermediate λ states to obtain accurate results. The phase-space overlap matrix visualizes the conformational sampling overlap between different λ states. Each cell value represents the degree of conformational space shared between two adjacent λ windows, with higher values (typically recommended to be greater than 0.15) indicating smoother transitions between states and more reliable calculations. If certain regions show low overlap, it may suggest significant energy barriers or insufficient sampling between states.
Replica Distribution

import_structure

Replica DistributionThe replica distribution shows the sampling coverage across different λ states. A uniform distribution indicates that all λ states have been sufficiently sampled, and the sampling is stable and even across all intervals. An uneven distribution may suggest sampling bottlenecks or energy barriers in certain λ intervals, which can affect the accuracy of the final free energy.
Op.logLog
ReportView and download the analysis report
...-TrajectoryView the MD trajectoryProvide visualization videos of the two molecules in apo states and complex states
...-StopStop the job
...-ExtendExtend the simulation time

import_structure

...-Re-submitResubmitt the calculation

import_structure

...-DeleteDelete this job

3. ABFE Analysis

The ABFE Analysis interface is composed of three sections: the project overview, the view interface, and the job list.

import_structure

3.1 Project Overview

This section shows the number of ligands in the project, as well as the number of currently displaying ligands. The Export Report button on the right side can export the result as well as the summary of this project.

3.2 View Interface

ViewDescriptionExampleOperations / SettingsDescription
Correlation Plot of ΔGΔG correlation plot:correlation analysis of the calculation result of all ligands.
The dark region indicates an error range of <1 kcal/mol; the shallow region indicates an error range of <2 kcal/mol

import_structure

RMSE

Root mean square error, computed through:

1ni=1n<br/>(ΔGFEPiΔGExpi)2 \sqrt{\frac 1n\sum_{i=1}^n <br />(ΔG_{FEP}^i-ΔG_{Exp}^i)^2}


When the molecule activity values are similar, it is recommended to refer to the RMSE value. The smaller the RMSE, the better the calculation results, with RMSE < 1.4 kcal/mol being an acceptable range.

R2

Correlation coefficient, computed through:

1(i=1n(ΔGFEPiΔGExpi)2i=1n(ΔGFEPΔGExpi)2) 1 - \left( \frac {\sum_{i=1}^n (ΔG_{FEP}^i - ΔG_{Exp}^i)^2} {\sum_{i=1}^n (\overline{ΔG}_{FEP} - ΔG_{Exp}^i)^2} \right)

When the span of the molecule's experimental activity values is large, it is recommended to refer to the value. The closer R² is to 1, the more accurate the calculation results are, with R² > 0.4 being considered 'acceptable'.

Show LabelsShow ligands names of each point in the plot
Retrospective / ProspectiveWhen experimental values are uploaded for a ligand, the correlation plot displays the correlation between FEP computed values and experimental values, represented by blue data points (Retrospective).
When no experimental values are uploaded, the correlation plot only displays the correlation between FEP computed values and the molecule itself, represented by green data points (Prospective).

3.3 Job List

ItemExampleSub-itemsDescription
Ligands

import_structure

Convert to IC(50)Convert to IC(50)
NameLigand Name
ΔG_Exp (kcal/mol)Experimentally measured binding free energy of the ligand
ΔG_FEPFEP calculated binding free energy of the ligand
Op.Show/Hide
show/hide this ligand

4. Files

The file center records the input files and the result files of each FEP calculation.

1_Protein_and_Cofactors: Supports downloading initial protein file([protein name].pdb), the protein protonation form (protein_protonation_states.csv), non-standard factor files. If the protein has been embedded in the membrance, then [protein name]_with_membrane.pdb will also be included.

2_Ligands: Supports downloading ligand files([ligand name].sdf) and uploading ligand binding affinity files(FEP Nash_exp_affinity.csv)

3_ABFE_Simulations: Supports downloading different simulation batches (Simulation_n) of molecular calculation files, including task information ([ligand name]_simulation0_info.csv), log information ([ligand name]_simulation0_log.txt), calculation report ([ligand name]simulation0_report.pdf), trajectory file (Trajectory[ligand name]).

4_Analysis: Supports downloading molecular calculation result files (ABFE_ligands_info.csv).

import_structure