biosim-schema¶
Schema for Biomolecular Simulation Data
URI: https://CCPBioSim.ac.uk/biosim-schema/
Name: biosim-schema
Classes¶
Class |
Description |
|---|---|
Analysis stage of simulation workflow |
|
A quantity representing angles |
|
Settings used for the simulation barostat |
|
A quantity representing bytes |
|
Potential used for carbohydrate molecules |
|
A quantity representing charge |
|
A quantity representing compressibility |
|
Computational environment used to perform simulation |
|
A quantity representing concentration |
|
Connectivity information included in the topology |
|
A quantity representing energy |
|
Simulation ensemble/thermodynamic properties |
|
Equilibration stage of simulation workflow |
|
A quantity representing force |
|
A quantity representing frequency |
|
A quantity representing friction coefficients |
|
Potential used for molecules |
|
Computer hardware used to perform simulation |
|
Settings used for the simulation integrator |
|
Settings used to define particle interactions |
|
A quantity representing length |
|
Potential used for lipid molecules |
|
Machine learned potential used for molecules |
|
A quantity representing mass |
|
A quantity representing compressibility vectors in 3D |
|
A quantity representing pressure vectors in 3D |
|
A quantity representing general vectors in 3D |
|
Minimisation stage of simulation workflow |
|
A quantity representing molar energy |
|
Identifiers of a molecule type in the simulated system |
|
Potential used for nucleic acid molecules |
|
Particle information included in the topology |
|
Compute performance during simulation |
|
Potential used for polymers molecules |
|
Potentials used for various modelled molecules |
|
A quantity representing pressure |
|
Production stage of simulation workflow |
|
Potential used for protein molecules |
|
Setup stage of simulation workflow |
|
Average values of observables outputted from the simulation |
|
Simulation box information |
|
root class for biosim-schema |
|
Simulation observables outputted from simulation |
|
Settings in simulation |
|
Stages of simulation workflow |
|
Computer software used to perform simulation |
|
Molecular composition of simulated system |
|
Entity counts across the simulated system |
|
A quantity representing temperature |
|
Settings used for the simulation thermostat |
|
A quantity representing time |
|
Topology information |
|
Information in trajectory files |
|
Trajectory information |
|
A quantity representing 3D angles |
|
A quantity representing compressibility in 3D |
|
A quantity representing 3D lengths/dimensions |
|
A quantity representing pressure in 3D |
|
Temperature values for multiple coupling groups |
|
Time values for multiple coupling groups |
|
A quantity representing volume as via 3 length dimensions |
|
A quantity representing volume |
|
Potential used for water molecules |
Slots¶
Slot |
Description |
|---|---|
AlphaFold predicted protein structure identifier (AlphaFold DB, EMBL-EBI) |
|
Analysis stage of a simulation |
|
Name of the method used to analyse simulation |
|
Name of software used to analyse simulation |
|
Name of the tool used to analyse simulation |
|
Number of atoms in a molecule |
|
Average enthalpy sampled from the simulation, given as a float |
|
Average kinetic energy sampled from the simulation, given as a float |
|
Average potential energy sampled from the simulation, given as a float |
|
Average pressure sampled from the simulation, given as a float |
|
Average temperature sampled from the simulation, given as a float |
|
Average volume sampled from the simulation, given as a float |
|
Average volume sampled from the simulation, given as a list, (e |
|
Barostat information |
|
List of barostat algorithms used in the simulation |
|
Constraints applied to bonds between two particles in the simulated system |
|
Are bond parameters present in the topology? |
|
The angles of the box used to simulate the system |
|
The lengths/dimensions of the box used to simulate the system |
|
The type of box used to simuluate the system |
|
Force field for carbohydrates |
|
Force field used to describe carbohydrate |
|
Usually required in the Nosé-Hoover thermostat, the chain length (e |
|
Are atoms coarse-grained? |
|
Collision frequency for the integrator, given as a float |
|
Composition of simulated system |
|
Compressibility of the simulated system, given as a float |
|
Compressibility of the simulated system, given as a float |
|
Computational environment information |
|
Particle connectivity information |
|
Container runtime used to execute the simulation environment, if any |
|
Number of physical CPU cores in each socket |
|
A subset of atoms for which temperature is controlled, often needed for simul… |
|
CPU instruction-set architecture of the compute nodes |
|
CPU vendor used in the compute nodes |
|
Are dihedral parameters present in the topology? |
|
Distance in Angstrom at which a electrostatic interaction is turned off and a… |
|
Electron Microscopy Data Bank ID |
|
Are per-frame energies included in the trajectory? |
|
Total energy consumed by the simulation run |
|
Tolerance energy for when minimization stops, this is when the maximum force … |
|
Mapping from engine-specific parameter names to schema slots |
|
Ensemble information |
|
List of ensemble types used in the simulation |
|
Equilibration stage of a simulation |
|
Broad type of system used to run the simulation workload |
|
Are fixed charges on atoms included in the topology? |
|
Are per-frame forces included in the trajectory? |
|
Total number of snapshots that make up a trajectory |
|
Time between saved snapshots/frames in a simulation, given as a float |
|
Usually used in the Langevin thermostat, the friction coefficient determines … |
|
Force field for molecules (in general) |
|
Force field used to describe molecules |
|
GPU vendor used by the compute nodes, if applicable |
|
Number of GPUs present in each compute node |
|
Computer hardware used to perform simulation |
|
External database identifier |
|
The International Chemical Identifier (InChi) is a textual identifier for che… |
|
The condensed, 27 character InChIKey, which is a hashed version of the full I… |
|
Integrator information |
|
List of integrator algorithms used to integrate the simulation |
|
Interaction information |
|
Force field for lipids |
|
Force field used to describe lipids |
|
Method used to describe long-range interactions between particles |
|
ML force field for molecules (in general) |
|
ML force field used to describe molecules |
|
Are masses present in the topology? |
|
Amount of memory installed per compute node |
|
Minimisation stage of a simulation |
|
Name of the method used to minimise the molecular system |
|
The distance the algorithm moves in a single step, controls how large a step … |
|
Has the initial model been modified from the original? |
|
The molecular formula of a molecule\ |
|
The molecular weight of a molecule |
|
Electrostatic charge of a molecule |
|
Number of instances of a given molecule, defined as bonded atoms or beads |
|
Persistent identifier of a molecule being simulated |
|
Number of monomeric units in a polymer(e |
|
MPI implementation used for distributed parallel execution |
|
Number of compute nodes used for the run |
|
Compute node profile used for the simulation run |
|
Force field for nucleic acids |
|
Force field used to describe nucleic acids |
|
One letter sequence for nucleic acid nucleotides |
|
Number of integration steps performed during minimisation of the system, give… |
|
Number of integration steps performed in the simulation, given as an integer |
|
Simulation observables |
|
Operating system installed on the hardware used to perform the simulation |
|
Particles information |
|
Protein Data Bank identifier |
|
Compute performance during simulation |
|
What directions in a simulation cell periodic boundaries are set if they are … |
|
Are per-frame polarizable charges included in the trajectory? |
|
Force field for polymers (excluding proteins and nucleic acids) |
|
Force field used to describe polymers (excluding proteins and nucleic acids) |
|
Are per-frame positions included in the trajectory? |
|
Force field potentials used to describe simulated particles |
|
Are the molecule positions derived from a prediction? |
|
Step frequency to apply the barostat, given as an integer |
|
List of coupling types for adjusting box vectors |
|
Time constant/step (relaxation time of pressure) used for coupling the system… |
|
Production stage of a simulation |
|
Force field for proteins |
|
Force field used to describe proteins |
|
One letter sequence for protein amino acids |
|
PubChem CID of chemical molecules and their activities against biological ass… |
|
Random number used to set (a) distribution of velocities across particles at … |
|
Is this trajectory a replica of another provided trajectory? |
|
Resolution of simulated system |
|
Are any positional restraints applied to molecule dynamics? (Restraints usual… |
|
Concentration of salt in the solution |
|
Workload manager or job scheduler used to launch the simulation |
|
Simulation settings |
|
Setup stage of a simulation |
|
Name of the tool used to setup simulation |
|
Average values of observables outputted from the simulation |
|
Information about the simulation box |
|
Method used to perform simulation |
|
Name of software used to perform simulation step |
|
Version of software used to perform simulation step |
|
Total time molecular dynamics have been sampled in a simulation trajectory, o… |
|
Tool used to perform simulation |
|
Simplified Molecular Input Line Entry System (SMILES) ASCII-based line notati… |
|
Number of physical CPU sockets in each compute node |
|
Computer software used to perform simulation |
|
Simulation stages |
|
Total electrostatic charge of the system given by the elementary charge (e) |
|
Entity counts across the simulated system |
|
Target/reference pressure set to reach in the simulation, given as a float |
|
Target/reference pressure set to reach in the simulation, given as a 3x3 arra… |
|
Target/reference temperature set to reach in the simulation, given as a float |
|
Target/reference temperature set to reach in the simulation, given as a list … |
|
Time constant for coupling the system temperature in seconds units, given as … |
|
thermostat information |
|
List of thermostat algorithms used in the simulation |
|
Number of hardware threads per physical core |
|
Time between integration steps in a simulation, given as a float |
|
Topology information |
|
Total number of atoms in the simulation |
|
Total number of simulated molecules, defined as bonded atoms or beads |
|
Trajectory information |
|
Information included in trajectory files |
|
UniProt accession ID |
|
Number of unique simulated molecules, defined as bonded atoms or beads |
|
value for a given quantity |
|
unit for a given value |
|
Distance in Angstrom at which a Van der Waals interaction is turned off and a… |
|
vector value for a given quantity |
|
Are per-frame velocities included in the trajectory? |
|
Total elapsed runtime of the simulation |
|
Are water molecules included in the trajectory? |
|
Force field for water molecules |
|
Force field used to describe water molecules |
Enumerations¶
Enumeration |
Description |
|---|---|
Methods used to analyse simulation outputs |
|
Software used to analyse simulation outputs |
|
Tools used to analyse simulation outputs |
|
Units for angles |
|
Barostat algorithm used to set the pressure in a simulation |
|
Fix specific molecular degrees of freedom—typically fast bond vibrations invo… |
|
Types of box used to simulate the system |
|
Units for Bytes |
|
Force fields used to parametrize carbohydrates in a simulation |
|
Units for charge quantity |
|
measures how the system volume responds to pressure changes |
|
Units for concentration quantity |
|
Container runtime used to package and run software |
|
CPU architecture used by the compute nodes |
|
Vendor of the CPU used in the compute nodes |
|
Units for energy |
|
Ensemble used in a simulation |
|
Type of platform used to run the simulation |
|
Units for force |
|
Units used to describe frequency |
|
Relationship between applied forces and the resulting resistance to motion |
|
Force fields used to parametrize molecules in a simulation |
|
Vendor of the GPU used in the compute nodes |
|
Algorithm used to integrate the simulation |
|
Units for length quantity |
|
Force fields used to parametrize lipids in a simulation |
|
Method used to implement long-range interactions of point charges in the simu… |
|
Machine learned potentials |
|
Units for mass/molecular weight |
|
Algorithm used to minimise particle interactions |
|
Resolution of simulated molecules |
|
Units for molar energy |
|
MPI implementation used for message passing |
|
Type of compute node used in the system |
|
Force fields used to parametrize nucleic acids in a simulation |
|
List of operating systems installed on the hardware used to perform the simul… |
|
What directions in a simulation cell periodic boundaries are set if they are … |
|
Force fields used to parametrize polymers in a simulation |
|
Coupling type for adjusting box vectors |
|
Units to describe pressure |
|
Force fields used to parametrize amino acids in proteins in a simulation |
|
Job scheduler used for simulation execution |
|
Tool used to setup a system for simulation |
|
Options for simulation methods |
|
Simulation software used to perform molecular dynamics simulations and relate… |
|
Tools/utility used within simulation engines to perform simulations |
|
Units to describe temperature |
|
Thermostat algorithm used to set the temperature in a simulation |
|
Units used to describe time |
|
Units for volume quantity |
|
Force fields used to parametrize water molecules in a simulation |
Types¶
Type |
Description |
|---|---|
A binary (true or false) value |
|
a compact URI |
|
a date (year, month and day) in an idealized calendar |
|
Either a date or a datetime |
|
The combination of a date and time |
|
A real number with arbitrary precision that conforms to the xsd:decimal speci… |
|
A real number that conforms to the xsd:double specification |
|
A real number that conforms to the xsd:float specification |
|
An integer |
|
A string encoding a JSON Path |
|
A string encoding a JSON Pointer |
|
Prefix part of CURIE |
|
A URI, CURIE or BNODE that represents a node in a model |
|
A URI or CURIE that represents an object in the model |
|
A string encoding a SPARQL Property Path |
|
A character string |
|
A time object represents a (local) time of day, independent of any particular… |
|
a complete URI |
|
a URI or a CURIE |
Subsets¶
Subset |
Description |
|---|