A method for creating an amalgamated bioinformatics database from at least
a first database and a second database is presented. Concepts are
identified in a first field from the records of the first database. A
second field from the records of the second database which has data
related to the first field is also identified. A first set of concepts is
identified by traversing a mediating database using terms associated with
the first field and a second set of concepts is also identified by
traversing the mediating database using terms associated with the second
field. Either the first set of concepts or the second set of concepts, or
both, is identified using non-trivial terminological mapping. The set of
related concepts in the first set of concepts and the second set of
concepts is identified and a record is generated in the amalgamated
bioinformatics database.