Volume 10 Number 4 (Oct. 2020)
Home > Archive > 2020 > Volume 10 Number 4 (Oct. 2020) >
IJBBB 2020 Vol.10(4): 161-169 ISSN: 2010-3638
DOI: 10.17706/IJBBB.2020.10.4. 161-169

Mining Frequent Patterns in Bioinformatics Workflows

C. R. Wijesinghe, A. R. Weerasinghe
Abstract— The goal of workflow systems is to put away the disadvantages of the state-of-the-art methods of scientific data analysis, mostly in Perl or similar scripting languages. Scientific workflow systems enable the development of analysis pipelines, provenance management, process control, recovery, scheduling and parallelization of individual tasks, understandability and sharing of workflows among the scientific community. There are several workflow systems to design bioinformatics workflows. The objective of this work is to identify the frequent workflow patterns or substructures in a corpus of Galaxy bioinformatics workflows obtained from myExperiment. Frequent sub graph discovery (FSG) algorithm used in analyzing the workflows. Seventy-one reusable workflow patterns identified with a 5% minimum support threshold. As future work planning to annotate the identified frequent patterns and to encode the identified patterns in the workflow systems with the objective of improving the usability by providing a high-level abstract interface to the user.

Index Terms— Frequent subgraph mining, galaxy workflows, substructures in workflows, workflow abstraction.

C. R. Wijesinghe and A. R. Weerasinghe are with University of Colombo School of Computing, Colombo 7, Sri Lanka. C. R. Wijesinghe is also with Faculty of Graduate Studies, University of Colombo, Colombo 7, Sri Lanka.

Cite: C. R. Wijesinghe, A. R. Weerasinghe, " Mining Frequent Patterns in Bioinformatics Workflows," International Journal of Bioscience, Biochemistry and Bioinformatics vol. 10, no. 4, pp. 161-169, 2020.

Copyright © 2020 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

General Information

ISSN: 2010-3638 (Online)
Abbreviated Title: Int. J. Biosci. Biochem. Bioinform.
Frequency: Quarterly 
DOI: 10.17706/IJBBB
Editor-in-Chief: Prof. Ebtisam Heikal 
Abstracting/ Indexing:  Electronic Journals Library, Chemical Abstracts Services (CAS), Engineering & Technology Digital Library, Google Scholar, and ProQuest.
E-mail: ijbbb@iap.org
  • Sep 29, 2022 News!

    IJBBB Vol 12, No 4 has been published online! [Click]

  • Jun 23, 2022 News!

    News | IJBBB Vol 12, No 3 has been published online! [Click]

  • Dec 20, 2021 News!

    IJBBB Vol 12, No 1 has been published online!  [Click]

  • Sep 23, 2021 News!

    IJBBB Vol 11, No 4 has been published online! [Click]

  • Jun 25, 2021 News!

    IJBBB Vol 11, No 3 has been published online! [Click]

  • Read more>>