Programming Languages, Conferences, Hot Topics, Academic Programs & Companies
- Programming
- Perl
- Basic procedural scripting language
- Freely available, runs on any platform
- Does not need a compiler, just a simple text editor
- Forms the core of most Bioinformatics applications
- Relatively easy to learn
- Courses available
- Recommended books
- Recommended websites
- Bio Perl Project
http://bio.perl.org/
- The primary resource for using Perl in biology
- Archive of scripts, modules etc. Look here before re-writing it yourself
- Great source for help. Very friendly people.
- Maintain an active mailing list
- The Open Lab - http://theopenlab.uml.edu/
- A not-for-profit foundation established to sponsor open collaborations in research and software development in the scientific field of computational biology.
- The Perl Institute
http://www.perl.org/
- Comprehensive Perl Archive Network (CPAN)
http://www.cpan.org/
- Java
- Object oriented, cross-platform language
- Freely available
- Requires a compiler, free compiler available with the JDK
- Has a bright future in Bioinformatics
- Relatively easy to get started, can quickly become complex
- Courses available
- Recommended books
- Recommended websites
- C/C++
- procedural (C)/object oriented (C++)
- Requires compiler, some freely available for example at http://www.gnu.org/
- used where speed and direct memory access are important
- learning curve fairly steep
- Courses available
- Recommended Books
- Recommended websites
- SQL
- Not a "stand-alone" programming language
- Procedural
- Very easy to learn initial concepts
- Essential for working with true databases (Oracle, Sybase, MySQL)
- Courses available
- Recommended books
- Hot Topics
- Arrays and Expression Arrays
- NHGRI Microarray Project
http://www.nhgri.nih.gov/DIR/LCG/15K/HTML/
- Build your own arrayer
http://cmgm.stanford.edu/pbrown/mguide/index.html
- Affymetrix
http://www.affymetrix.com/
- Other resources
http://www.nhgri.nih.gov/DIR/LCG/15K/HTML/microlinks.html
- SAGE - Serial Analysis of Gene Expression
- Description of SAGE
http://www.sagenet.org/home/Description.htm
- A short sequence tag (10-14bp) contains sufficient information to uniquely identify a transcript provided that that the tag is obtained from a unique position within each transcript
- Sequence tags can be linked together to from long serial molecules that can be cloned and sequenced
- Quantitation of the number of times a particular tag is observed provides the expression level of the corresponding transcript.
- Resources
- Bioinformatics Portals or ASPs (Application Service Providers)
Basic idea: More efficient to have a central location to house expensive hardware and software with expert service provider on well maintained machines than to buy yourself.
Advantages
User pays for access to software as they use it
Users have access to more different types of software than they could afford on their own
ASPs provide a permanent place for storage of data and analysis results
Disadvantages
May be hard to maintain competence in house, resulting in risky level of dependance on the ASP
Slowdowns or failures at the ASP could delay research
- Fee for Service Model ASPs
- GCG's SeqWeb and Viaken
http://www.oxmol.com
http://www.viaken.com
- Uses GCG's SeqStore data management product which is a central database to store sequence data, primers, plasmids, etc
- Secure internet access
- Good technical support from Viaken
- eBioinformatics
http://www.ebioinformatics.com
- Has it's own suite of web based tools called BioNavigator
- Usage based fees for time used and amount of data stored
- Access to GCG programs through SeqWeb
- Access to proprietary software and databases
- Tools include PCR primer design, restriction mapping, protein secondary structure, database searching, multiple alignments and phylogenetic tree production
- GeneSolutions
http://www.genesolutions.com
- Pay as you go
- Access to Hyseq's proprietary database of 12 million+ sequences (mostly EST's), many rarely expressed
- Free access to store and run searches, pay to see the results
- Larger fees to get the clones corresponding to a search result
- Many tissue specific expression profiles for 50 tissue libraries, which can be added to the search results for another fee
- LabOnWeb
http://www.labonweb.com
- LabOnWeb elongates your query sequence to a full gene.
- Lists tissues in which your gene is probably expresssed based on SAGE data.
- Extensive sequence analysis includes gene function prediction, mapping and much more.
- Based on data from all major public and on proprietary data
- Two analysis per day free, pay for the rest by credit card
- "Free" ASPs
- DoubleTwist
http://www.doubletwist.com
- Automatic research agents to perform simple bioinformatic operations
- Can store your sequences and results on their site
- Many proprietary software and databases are available
- Many partnerships with vendors such as Alphagene, Invitrogen and Clontech such that you can purchase a clone from them if it was found in your result set
- The Biology Workbench
http://workbench.sdsc.edu/
- Single web interface to a wide variety of standard protein and nucleic acid analysis programs
- Can store sequences and results on their site
- Password protected and encrypted
- Many database searches available, don't have to worry about format issues
- GeneScape Portal
http://curatools.curagen.com/
- Store your sequences and results on their site
- Access proprietary CuraTools to perform a wide variety of database searches and analysis for DNA and protein
- Includes BLAST, Clustal W, CAP2, PROSITE, PRODOM, Pfam, etc
- Amplicom
http://www.amplicom.com/
- A true portal site, with much more than just bioinformatics tools
- Integrates genetic sequence data from 11,000 human genes , 6849 mouse genes, 6,307 yeast and 14,190 C elegans genes, 16 bacteria complete genomes and 5 Archaea complete genomes
- Provides links to a wide variety of analysis tools
- Conferences
- Cambridge Health Institute Conference Series
- Bioinformatics
- Integrated Bioinformatics
- In Silico Biology
- Proteomics
- Functional Genomics
- Drug Discovery Technology 2001 Organized by IBC USA Conferences
August 13-17, 2001, Boston, MA
- GCB2001 16th German Conference on Bioinformatics
October 7 - 10, 2001 Braunschweig, Germany
- PSB 2002 - Pacific Symposium on Biocomputing
January 2-8, 2002 Kaua'i Marriott, Kaua'i
- RECOMB 2001
April 21-24, 2001, Montreal, Canada
- ISMB 2002 - Intelligent Systems for Molecular Biology
August 3 - 7, 2002 Edmonton, Canada
- Academic Programs
The primary organization for Bioinformatics is the International Society for Computational Biology (http://www.iscb.org/). They have a large list of courses available.
- Online
- Sequence Analysis with Distributed Resources
http://bibiserv.techfak.uni-bielefeld.de/sadr/
- Bioinformatics Certificate Program at Stanford University
http://scpd.stanford.edu/ce/ndp/certificate/bioinformatics.html
- Universität Bielefeld, VSNS Biocomputing Division
http://merlin.mbcr.bcm.tmc.edu:8001/bcdusa/welcome.html
- The Principles of Protein Structure using the Internet
http://www.cryst.bbk.ac.uk/PPS2/index.html
- Getting the most from your hidden Markov Model
http://www.cse.ucsc.edu/research/compbio/ismb99.tutorial.html
- Regular
- Degree Programs
- Courses
- Job Finding and General Information
- Biospace
http://www.biospace.com
- BioPlanet
http://www.bioplanet.com
- Designed specifically for Bioinformatics
- GenomeJobs
http://www.genomejobs.com
- BioView
http://www.bioview.com
- Bioinformatics Companies
- DoubleTwist.com
http://www.doubletwist.com
- Type of products
- Thin-client/server software
- Main products
- GeneMill - project management software designed for DNA sequencing laboratories
- GeneWorld - comprehensive sequence analysis and annotation solution
- EST Clustering tools - allow rapid clustering and alignment of both ESTs and full-length sequences
- Pathway Tools - computationally predict pathways from genomic data
- Molecular Simulations Inc.
http://www.msi.com
- Product type - 3D simulation software
- Main products
- Cerius2 - Simulation environment for life and materials science
- WebLab - Web browser plug-in that allows molecular simulation and analysis
- Insight II - 3D graphical environment for molecular modeling
- Catalyst - Modeling, hypothesis generation and structure for drug discovery
- FELIX - Processing, display and analysis of NMR data
- QUANTA - Advanced tools for macromolecular X-ray crystallographers
- MDL Information Systems
http://www.mdli.com
- Developer of the CHIME plugin
- Product type - Software packages
- Target audience - Drug companies and cheminformatics
- Too many products to list
- Incyte
http://www.incyte.com
- Product type - Information and software
- Main products
- LifeSeq database collection - LifeSeq Gold currently contains 4.3 million ESTs, 3.3 million of which are Incyte-proprietary. This represents an estimated 90% of expressed human genes
- PathoSeq database collection - provides genetic data for a variety of microorganisms, with a focus on disease-causing bacteria and fungi.
- LifeTools - scalable, enterprise-wide system for processing, storing, analyzing, and managing both proprietary and public-domain genomic data. It gives scientists extensive project management capabilities, including the ability to track research projects, and easily share data with team members
- Microarrays - A wide variety of Human, plant and animal arrays and the software to analyze them
- Neomorphic Software
http://www.neomorphic.com
- Product type - Java-based bioinformatics tools
- Main Products
- Genome SDK - an advanced Java class library that can be tailored to a client's in-house bioinformatics requirements. Put the widgets together and coordinate them to provide dynamic displays of your data
- Labtrak - laboratory information management system. Neomorphic will configure Labtrack for your unique laboratory process.
|
|