Skewed Distribution Models: Data Analysis, Identification, and Applications in Biomolecular Systems and R-loop Biology of Cancer
dc.contributor.author | Grageda, Andre | |
dc.date.accessioned | 2023-10-04T13:34:35Z | |
dc.date.available | 2023-10-04T13:34:35Z | |
dc.date.issued | 2023-09 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12648/13032 | |
dc.description.abstract | Modeling and computational analysis can be used to crystallize, integrate, and extract knowledge from large datasets generated by biology, medicine, and next-generation sequencing. The use of probability models, multifactor hypothesis testing, and computational analyses is crucial to studies in systems biology. These studies provide insights into understanding large and diverse molecular biology data sets. It is no longer enough to study individual molecules, their properties, and their interactions with other molecules in cells and organisms. In addition to generating numerous case studies with unique data, such studies provide a limited understanding of the underlying complexity and dynamics of the leading mechanisms determining the states and behaviors of a whole biological system. Sequencing and multi-omics experiments generate big data needed to model processes, organization and behavior of biological systems in a more comprehensive, less biased manner. Analysis of such enormously heterogeneous and complex information requires mathematical models and computational algorithms. It is the motivation and challenge of current systems biology and medicine. Applied to cancer systems biology, we will consider basic probabilistic aspects of big data. We study skewed frequency distributions commonly observed in diverse omics experiments. We focus on modeling and developing computational algorithms to quantify big data's statistical characteristics, aiming for accurate and unbiased characterization of the systems variation. In several applications, we focus on the identification of the skewed distributions for quantification and differentiation of the of R-loop formation patterns in non-cancer, pre-malignant states and cancer genomes. Current studies involving R-loops rely on the S9.6 antibody which generates noisy signals. We show that using R-loop forming sequences for filtering specific S9.6 signals selects biologically meaningful signals. R-loops have been shown to play a role in tumorigenesis. Using our R-loop forming sequence enrichment method, we investigate the roles of R-loops in tumorigenesis across different detection modalities primarily in breast cancer. | en_US |
dc.language.iso | en_US | en_US |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Birth-Death Process Model | en_US |
dc.subject | R-loops | en_US |
dc.subject | Skewed Distributions | en_US |
dc.subject | R-loop Forming Sequence | en_US |
dc.subject | BRCA1-mutation | en_US |
dc.subject | Breast Cancer | en_US |
dc.title | Skewed Distribution Models: Data Analysis, Identification, and Applications in Biomolecular Systems and R-loop Biology of Cancer | en_US |
dc.type | Dissertation | en_US |
dc.description.version | NA | en_US |
refterms.dateFOA | 2023-10-04T13:34:36Z | |
dc.description.institution | Upstate Medical University | en_US |
dc.description.department | Biochemistry & Molecular Biology | en_US |
dc.description.degreelevel | PhD | en_US |
dc.description.advisor | Kuznetsov, Vladimir | |
dc.date.semester | Fall 2023 | en_US |