Home >> Science >> Chemistry >> Chemical Databases


  Directories of Commercial Chemicals
MSDS
  Organic Chemistry
Software
   


The chemical database occurs as database specifically designed to store chemical information. Virtually all chemical databases store trading tools in stable molecules. Chemical structures come traditionally represented applying lines indicating chemical bonds between atoms and drawn on paper (Second). When which are actually idealistic ocular representations for the chemist, it is unsuitable for computational utilise & especially for lookup & storage.

Big chemical databases come required to cover hundreds to thousands of molecules ingesting terabytes of physical memory.

Representation

There are ii chief techniques for representing chemical structures inside digital databases

When connectiin tables / contiguousness matrices / lists sustaining extra trading tools on attach (edges) & atom attributes (nodes) eg: MDL MOL, PDB, CML When string notation according to depth foremost or even breadth number one traversal eg: SMILES/SMARTS, SLN, WLN, InChI

These approaches use at times been refined to allow representation of stereochemical differences and charges too when favorite sort of bonding like people seen inside organo-metallic compounds. the main benefit of a computer representation is the possibility for increased storage & convenient, flexible look for.

Search

Chemists might seek databases utilizing area of structures, area of their IUPAC names when well as according to constraints in properties. Chemical databases come particularly different from either more general purpose databases in their trend lines for sub-structure research. This kinda look for is achieved by wanting to find Sub-graph Isomorphism (sometimes too known as the monomorphism), a branch of Graph theory. A algorithmic program for shopping come computationally troglodytes troglodytes, typically of O (nThreesome) or even O (nQuaternion) period complexness (inorth which n is the total of atoms taking part). a troglodytes troglodytes component of seek is known as atom-by-atom-looking (ABAS), where a mapping of the seek substructure atoms & bonds by owning the target molecule is sought. ABAS shopping unremarkably makes have of Ullman's algorithm or variations of it. Quickening come achieved by time amortization, that is, some of the instance in seek tasks come saved up by applying precomputed facts. This precomputation generally involves creation of bit-strings representing presence or even absence of molecular fragments. By shopping at a fragments present inside a look for structure these come conceivable to eliminate a want for ABAS comparison sustaining target molecules that don't possess the fragments that are present in the seek structure. This elimination is known as screening. A bit-strings utilized for these applications come as well known as structural-keys. A performance of such keys depends on a guide of a fragments utilized for constructing the keys & the probability of their presence in the database molecules. A second sort of key makes apply of hash-codes according to fragments derived computationally. Which are actually known as 'fingerprints' although a term is another time utilized synonymously using structural-keys. A total of memory required to store these structural-keys & fingerprints may be reduced by 'folding', which is achieved by combining area of a key utilizing bitwise-operations & thereby reducing the overall length.

Similarity

No lone definition of molecular similarity, nonetheless the construct can be defined based on data from a application & is typically described as an opposite of a measure of few feet away. 2 molecules can exist as considered additional similar whenever their difference within molecular weights is moo than while equated by using others. The kind of more measures can be combined to create the multi-variate few feet away measure. Few feet away measures come typically classified into Euclidean measures and non-Euclidean measures.

Databases can be clustered into families of 'similar' molecules according to similarities. Two hierarchic & non-nonhierarchic clustering approaches may be applied to chemical able by having multiple attributes. These attributes or even even molecular properties will either become determined through empirical observation or computationally derived descriptors. One of a virtually all popular clustering approaches is the Jarvis-Patrick (k-nearest neighbours) algorithm.

Registration systems

Databases systems for maintaining unique records in compound come termed when Registration systems. Which are actually typically utilized for chemical indexing, patent systems and industrial databases.

Registration systems commonly enforce singularity of the chemical substance represented in the database through the utilise of unique representations. By using system of precedence for the generation of stringified notations, a single may obtain unique/'canonical' string representations like 'canonical SMILES'. A few registration systems like a CAS rules produce have of algorithmic rule to generate unique hash codes to achieve the equivalent objective.

Registration systems besides preprocess molecules to keep away from looking for trivial differences like differences inside halogen ions in chemical substance.

Tools

A computational representations come normally processed transparent to chemists by graphic display of the information. Information entry is besides simplified through the utilise of chemical structure editors. These editors internally convert a graphic information into computational representations.

There are too many algorithmic program for the interconversion of various formats of representation. An open-source utility for conversion is OpenBabel.

Algorithmic program for the conversion of IUPAC names to structure representations and the other way around come besides utilized for extracting structural information from text. Even so there are difficulties due to the being of multiple accent of IUPAC. Functiin is on to establish the unique IUPAC standard (Watch InChI).

Chemical Abstracts Service (CAS)
World's largest and most comprehensive chemical database with over 21 million document records and 37 million substances listed.

Chemcyclopedia Plus
An annual supplement to Chemical & Engineering News (C&EN). Provides users/purchasers of chemicals with trade names, packaging, special shipping requirements, potential applications, and CAS Registry Numbers. Chemicals are listed in categories.

Agency for Toxic Substances and Disease Registry
Searchable database for publications and resources about the effect of hazardous substances on public health and the environment.

Database Searching at Liverpool
Chemistry Literature Searching (BIDS ISI, ISIS, Camb Struc Data, Beilstein, journals).

SPECS and BioSPECS B.V.
Provider of chemical compound library's for HTS in vials or plates.

JAICI
JAICI is a not-for-profit scientific information service organization featuring, in particular, chemical expertise.

MDLI Information Systems
Provider of chemical structural databases and information management software to the chemical, agrochemical, and pharmaceutical industry.

Hazardous Chemical Database
Contains names, formulas and registry numbers for 1991 hazardous chemicals. This site was last updated in 1996.

NIST Chemistry WebBook
Database of organic chemistry compounds, organized by species. Contains chemical and physical property data on over 30,000 compounds.

The ChemExper Chemicals Directory
A catalog of 60000 fine chemicals searchable for free using a chemist-friendly search engine. Products are listed with physical data including IR spectra and MSDS. Contributions to the database possible with dedicated (and free) program.


Science: Biology: Toxicology
Science: Reference





© 2005 GeneralAnswers.org