Skip to content


Source Code for Biology and Medicine

Open Access

KBWS: an EMBOSS associated package for accessing bioinformatics web services

Source Code for Biology and Medicine20116:8

Received: 12 February 2011

Accepted: 29 April 2011

Published: 29 April 2011


The availability of bioinformatics web-based services is rapidly proliferating, for their interoperability and ease of use. The next challenge is in the integration of these services in the form of workflows, and several projects are already underway, standardizing the syntax, semantics, and user interfaces. In order to deploy the advantages of web services with locally installed tools, here we describe a collection of proxy client tools for 42 major bioinformatics web services in the form of European Molecular Biology Open Software Suite (EMBOSS) UNIX command-line tools. EMBOSS provides sophisticated means for discoverability and interoperability for hundreds of tools, and our package, named the Keio Bioinformatics Web Service (KBWS), adds functionalities of local and multiple alignment of sequences, phylogenetic analyses, and prediction of cellular localization of proteins and RNA secondary structures. This software implemented in C is available under GPL from and GitHub repository Users can utilize the SOAP services implemented in Perl directly via WSDL file at (RPC Encoded) and (Document/literal).


Proxy ServerSequence LogoLocal ToolCommon Gateway InterfaceRest Service


With more than 1700 services listed in the BioCatalogue at the time of this writing [1], a significant number of biological resources and tools is provided as web-based services, in light of their advantages in interoperability, ease of use, and the lack of requirements for the compute infrastructure as well as the effort in installation and maintenance [2]. Several efforts have already utilized hundreds of these resources with the aim to effectively integrate them into a bioinformatics cyberinfrastructure, allowing the creation and management of research workflows through syntactic and semantic integration [3]. BioMoby project [4] maintains a repository of service descriptions with controlled vocabularies so that services can be discovered from their purpose and from their input/output data types, and MOWServ project [5] further classifies and curates the services in order to allow automatic workflow creation [6]. A number of clients with rich graphical user interface such as Seahawk [7] and Taverna [8] are also available as front-end research environment.

While the advent of web-based services is expected to proliferate further especially in light of the introduction of next-generation sequencers and their huge masses of data [9], local tools will continue to be important for numerous purposes, and web-based services need to be integrated with these local tools. Therefore, here we present the Keio Bioinformatics Web Service (KBWS), a collection of proxy client tools for 42 major bioinformatics web-based services in the form of European Molecular Biology Open Software Suite (EMBOSS) [10] associated software (EMBASSY) package of UNIX command-line utilities. EMBOSS is one of the most widely used collection of more than 200 bioinformatics tools, which takes advantage of the UNIX environment for "piping" with any other UNIX commands. With its Ajax Command Definitions (ACD), EMBOSS is well curated with controlled vocabularies and semantics so that the tools are highly documented (with tfm command), discoverable (with wossname command), and interoperable. Advantages of the EMBOSS platform also includes a unified sequence data retrieval system with Universal Sequence Address (USA) as well as the availability of many front-end user interfaces, such as JEMBOSS [11], SoapLab [12], wEMBOSS [13], and EMBOSS Explorer [14].


KBWS is composed of two parts: a proxy web server providing SOAP service wrapper to bioinformatics web services, and UNIX command-line clients in the form of EMBOSS tools that access the proxy web server. By making the clients access the proxy server instead of the original service providers, the clients can be lightweight. Therefore, same client tool can be used without update or maintenance even the original service providers change their formats, and KBWS can utilize bioinformatics web-based services provided in any protocol (such as SOAP, REST - Representational State Transfer, or browser-based CGI - Common Gateway Interface) of any versions [15]. This proxy server is able to deal immediately with specification change of original service by regular automatic monitoring to check whether the proxy server returns correct report. Even if the original server is down, KBWS is able to stably provide same services without update to the client tools, by switching the server to analogous web-based services at the proxy server. Moreover, users can access the proxy server via SOAP, which provides all 42 services under the same programming interface defined in a single Web Service Description Language (WSDL). WSDL files that are provided as RPC Encoded style and Document/literal style allow access to KBWS from web-based services client software or programming languages. The proxy server provides access to 9 SOAP services, 3 REST services, 31 CGI services and 1 service installed in the web server, and is implemented using SOAP::Transport::HTTP Perl module. Three services provided by NCBI, EBI, and DDBJ, respectively, are used for a single tool kblast, and therefore there is a total of 42 tools accessing 44 services. List of supported web services in KBWS is shown in Table 1.
Table 1

List of supported services


Service Name


Tool Name



Altschul et al., 1990



McWilliam et al., 2009



Mackey et al., 2002



McWilliam et al., 2009




Larkin et al., 2007



McWilliam et al., 2009



Katoh et al., 2009




Lassmann et al., 2006



McWilliam et al., 2009



Edgar, 2004




Notredame et al., 2000



McWilliam et al., 2009




Crooks et al., 2004




Lukashin et al., 1998




Delcher et al., 1999




Lowe et al., 1997




Nakai et al., 1991




Nakai et al., 1991




Yu et al., 2010




Horton et al., 2007




Lukas et al., 2004



McWilliam et al., 2009




Labarga et al., 2007

kfetchdata, kfetchbatch


McWilliam et al., 2009



Centroid Fold

Sato et al., 2009




Hofacker et al., 1994




Kono et al., 2009




Lim et al., 1999

kclique, kcontml


kdnacomp, kdnadist


kdnainvar, kdnaml


kdnamlk, kdnapars


kdnapenny, kdollop


kdolpenny, kfitch


kgendist, kkitsch, kmix


kneighbor, kpenny


kprotdist, kprotpars


krestml, kseqboot

The software categories are based on those used in EMBOSS. Complete listings including full references and URLs are available at our website (

Each of the 42 UNIX command-line clients accesses a unique bioinformatics web service through KBWS proxy. These tools are implemented as EMBASSY package in C programming language with gSOAP Toolkit [16], and are available under GNU General Public License from and GitHub repository Detailed documentation for each of the tools are available through the EMBOSS tfm (The Fine Manual) command, and tools can be searched and discovered with wossname utility. As an EMBOSS package, KBWS can be utilized with graphical user interfaces through JEMBOSS and wEMBOSS, and browser-based access with EMBOSS Explorer is also available at our website (

Results and Discussion

The following set of commands comprises a workflow for generating a sequence logo image for a set of amino acid sequences of FOXP2 [17], using BLAST web service (kblast), extracting the list of IDs (sed and uniq), aligning the sequences with MUSCLE (kmuscle), extracting a certain region from the alignment (extractalign), and generating its sequence logo (kweblogo). Output from this workflow is shown in Figure 1. The definition of "swissprot" database used in the following workflow is available at
Figure 1

Sequence Logo created by example workflow. This sequence logo image is created by example workflow for generating a sequence logo image for a set of amino acid sequences of FOXP2. This workflow includes KBWS Tools (kblast, kmuscle, kweblogo), EMBOSS tool (extractalign), and other local tools (sed, uniq).

# search similar sequence in Swiss-Prot database using BLAST

% kblast swissprot:FOXP2_HUMAN -d swissprot -format k1 -eval 1e-50 -outfile kblast.out

# extract ID list from BLAST report

% sed 's/^\(.*\)\.[19]/swissprot:\1/g' kblast.out | uniq > match_list.out

# multiple sequence alignment using MUSCLE

% kmuscle @match_list.out -outfile kmuscle.fasta

# Extract a region from the alignment

% extractalign -regions '420-430' kmuscle.fasta -outseq extractalign.fasta

# Generation of sequence logos using WebLogo

% kweblogo extractalign.fasta

Other example workflows are available at myExperiment ( website as Taverna workflow (

Sample codes written by Perl, Ruby, Python and Java are also available at

KBWS provides 42 web-based services as an EMBOSS package. EMBOSS is already widely used in numerous laboratories, and therefore users can readily integrate web-based services with their EMBOSS tools and UNIX commands using KBWS. Moreover, KBWS adds the advantages of web-based services to the many functionalities of EMBOSS, especially for tools like BLAST search that require large regularly updated databases that are difficult to be installed and maintained with EMBOSS in traditional manner. KBWS also does not require any external software or data for installation except for EMBOSS. As an EMBOSS package, KBWS is well-documented and can take advantage of the documentations and discovery tools, and can be used from rich clients such as JEMBOSS, wEMBOSS, and EMBOSS Explorer.

List of abbreviations


Ajax Command Definitions


Common Gateway Interface


European Molecular Biology Open Software Suite


Keio Bioinformatics Web Service


Representational State Transfer


Universal Sequence Address


Web Service Description Language.



We would like to thank Nobuaki Kono and Keita Ikegami for technical advices. This research was supported by the Grant-in-Aid for Young Scientists No.222681029 from the Japan Society for the Promotion of Science (JSPS), and by funds from the Yamagata Prefectural Government and Tsuruoka City.

Authors’ Affiliations

Institute for Advanced Biosciences, Keio University, Fujisawa, Japan


  1. Bhagat J, Tanoh F, Nzuobontane E, Laurent T, Orlowski J, Roos M, Wolstencroft K, Aleksejevs S, Stevens R, Pettifer S, Lopez R, Goble CA: BioCatalogue: a universal catalogue of web services for the life sciences. Nucleic Acids Res. 2010, 38: W689-W694. 10.1093/nar/gkq394.PubMed CentralView ArticlePubMedGoogle Scholar
  2. Stein LD: Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges. Nat Rev Genet. 2008, 9: 678-688.View ArticlePubMedGoogle Scholar
  3. Stockinger H, Attwood T, Chohan SN, Côté R, Cudré-Mauroux P, Falquet L, Fernandes P, Finn RD, Hupponen T, Korpelainen E, Labarga A, Laugraud A, Lima T, Pafillis E, Pagni M, Pettifer S, Phan I, Rahman N: Experience using web services for biological sequence analysis. Brief Bioinform. 2008, 9: 493-505. 10.1093/bib/bbn029.PubMed CentralView ArticlePubMedGoogle Scholar
  4. Consortium BioMoby: Interoperability with Moby 1.0--it's better than sharing your toothbrush!. Brief Bioinform. 2008, 9: 220-231.View ArticleGoogle Scholar
  5. Ramírez S, Muñoz-Mérida A, Karlsson J, García M, Pérez-Pulido AJ, Claros MG, Trelles O: MOWServ: a web client for integration of bioinformatic resources. Nucleic Acids Res. 2010, 38: W671-W676. 10.1093/nar/gkq497.PubMed CentralView ArticlePubMedGoogle Scholar
  6. Martin-Requena V, Ríos J, Garcia M, Ramirez S, Trelles O: jORCA: easily integrating bioinformatics Web Services. Bioinformatics. 2010, 26: 553-559. 10.1093/bioinformatics/btp709.View ArticlePubMedGoogle Scholar
  7. Gordon PM, Sensen CW: Seahawk: moving beyond HTML in Web-based bioinformatics analysis. BMC Bioinformatics. 2007, 8: 208-10.1186/1471-2105-8-208.PubMed CentralView ArticlePubMedGoogle Scholar
  8. Oinn T, Addis M, Ferris J, Marvin D, Senger M, Greenwood M, Varver T, Glover K, Pocock MR, Wipat A, Li P: Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics. 2004, 20: 3045-3054. 10.1093/bioinformatics/bth361.View ArticlePubMedGoogle Scholar
  9. Stein LD: The case for cloud computing in genome informatics. Genome Biol. 2010, 11: 207-10.1186/gb-2010-11-5-207.PubMed CentralView ArticlePubMedGoogle Scholar
  10. Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16: 276-277. 10.1016/S0168-9525(00)02024-2.View ArticlePubMedGoogle Scholar
  11. Carver T, Bleasby A: The design of Jemboss: a graphical user interface to EMBOSS. Bioinformatics. 2003, 19: 1837-1843. 10.1093/bioinformatics/btg251.View ArticlePubMedGoogle Scholar
  12. Senger M, Rice P, Oinn T: Soaplab - a unified sesame door to analysis tools. Proceedings of the UK e-Science All Hands Meeting: 2-4 September 2003; Nottingham. Edited by: Cox SJ. 2003, EPSRC, 509-513.Google Scholar
  13. Sarachu M, Colet M: wEMBOSS: a web interface for EMBOSS. Bioinformatics. 2005, 21: 540-541.View ArticlePubMedGoogle Scholar
  14. EMBOSS Explorer. []
  15. Neerincx PB, Leunissen JA: Evolution of web services in bioinformatics. Brief Bioinform. 2005, 6: 178-188. 10.1093/bib/6.2.178.View ArticlePubMedGoogle Scholar
  16. van Engelen RA, Gallivan KA: The gSOAP Toolkit for Web Services and Peer-to-Peer Computing Networks. Proceedings of the 2nd IEEE International Symposium on Cluster Computing and the Grid: 21-24 May 2002; Berlin. 2002, IEEE Computer Society, 128-135.Google Scholar
  17. Enard W, Przeworski M, Fisher SE, Lai CS, Wiebe V, Kitano T, Monaco AP, Paabo S: Molecular evolution of FOXP2, a gene involved in speech and language. Nature. 2002, 418: 869-872. 10.1038/nature01025.View ArticlePubMedGoogle Scholar


© Oshita et al; licensee BioMed Central Ltd. 2011

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.