Volume 10 Number 4 (Oct. 2020)
Home > Archive > 2020 > Volume 10 Number 4 (Oct. 2020) >
IJBBB 2020 Vol.10(4): 161-169 ISSN: 2010-3638
DOI: 10.17706/IJBBB.2020.10.4. 161-169

Mining Frequent Patterns in Bioinformatics Workflows

C. R. Wijesinghe, A. R. Weerasinghe
Abstract— The goal of workflow systems is to put away the disadvantages of the state-of-the-art methods of scientific data analysis, mostly in Perl or similar scripting languages. Scientific workflow systems enable the development of analysis pipelines, provenance management, process control, recovery, scheduling and parallelization of individual tasks, understandability and sharing of workflows among the scientific community. There are several workflow systems to design bioinformatics workflows. The objective of this work is to identify the frequent workflow patterns or substructures in a corpus of Galaxy bioinformatics workflows obtained from myExperiment. Frequent sub graph discovery (FSG) algorithm used in analyzing the workflows. Seventy-one reusable workflow patterns identified with a 5% minimum support threshold. As future work planning to annotate the identified frequent patterns and to encode the identified patterns in the workflow systems with the objective of improving the usability by providing a high-level abstract interface to the user.

Index Terms— Frequent subgraph mining, galaxy workflows, substructures in workflows, workflow abstraction.

C. R. Wijesinghe and A. R. Weerasinghe are with University of Colombo School of Computing, Colombo 7, Sri Lanka. C. R. Wijesinghe is also with Faculty of Graduate Studies, University of Colombo, Colombo 7, Sri Lanka.

Cite: C. R. Wijesinghe, A. R. Weerasinghe, " Mining Frequent Patterns in Bioinformatics Workflows," International Journal of Bioscience, Biochemistry and Bioinformatics vol. 10, no. 4, pp. 161-169, 2020.

Copyright © 2020 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

General Information

ISSN: 2010-3638 (Online)
Abbreviated Title: Int. J. Biosci. Biochem. Bioinform.
Frequency: Quarterly 
DOI: 10.17706/IJBBB
Editor-in-Chief: Prof. Ebtisam Heikal 
Abstracting/ Indexing:  Electronic Journals Library, Chemical Abstracts Services (CAS), Engineering & Technology Digital Library, Google Scholar, and ProQuest.
E-mail: ijbbb@iap.org
  • Sep 28, 2020 News!

    IJBBB Vol 10, No 4 has been published online! [Click]

  • Jun 22, 2020 News!

    IJBBB Vol 10, No 3 has been published online! [Click]

  • Apr 02, 2020 News!

    The papers published in Vol 10, No 2 have all received dois from Crossref [Click]

  • Mar 25, 2020 News!

    IJBBB Vol 10, No 2 has been published online!  [Click]

  • Dec 13, 2019 News!

    The papers published in Vol 10, No 1 have all received dois from Crossref [Click]

  • Read more>>