Statistical methods for detection of non-coding RNAs in eukaryote genomes. Understanding how eukaryotic cells work is a major goal of 21st century biology. A crucial step will be to catalogue the functional components of eukaryotic genomes. Australian researchers must be involved in this process at an early stage, in order to maximise commercial opportunities, attract quality researchers and position ourselves for further advances. This project will make major contributions to international effo ....Statistical methods for detection of non-coding RNAs in eukaryote genomes. Understanding how eukaryotic cells work is a major goal of 21st century biology. A crucial step will be to catalogue the functional components of eukaryotic genomes. Australian researchers must be involved in this process at an early stage, in order to maximise commercial opportunities, attract quality researchers and position ourselves for further advances. This project will make major contributions to international efforts in this area, via the development of statistical methods for segmenting genomes, classification of those segments, and study of the resulting classes. In the long term, enhanced understanding of eukaryotic cells will lead to breakthroughs in biology, and to medical, pharmaceutical, agricultural and scientific advances.Read moreRead less
Classification of Microarray Gene-Expression Data. The broad aim is to provide statistical methodology for the classification of microarray gene-expression data. Microarrays are part of a new biotechnology that allows the monitoring of expression levels for thousands of genes simultaneously. The explosion in microarrays has produced massive quantities of data that require new statistical techniques for analysis in order to exploit their enormous scientific potential. One of the main uses of ....Classification of Microarray Gene-Expression Data. The broad aim is to provide statistical methodology for the classification of microarray gene-expression data. Microarrays are part of a new biotechnology that allows the monitoring of expression levels for thousands of genes simultaneously. The explosion in microarrays has produced massive quantities of data that require new statistical techniques for analysis in order to exploit their enormous scientific potential. One of the main uses of the methodology to be developed is to expedite the discovery of new subclasses of diseases. Another is to provide prediction rules for the diagnosis and treatment of diseases.Read moreRead less
New Directions in Bayesian Statistics: formulation, computation and application to exemplar challenges. Bayesian statistics is a fundamental statistical and machine learning approach for density estimation, data analysis and inference. However, there remain open questions regarding the formulation of the model, the likelihood and priors, and efficient computation. This project proposes new approaches that address these issues, and applies them to two exemplar challenges: the impact of climate ch ....New Directions in Bayesian Statistics: formulation, computation and application to exemplar challenges. Bayesian statistics is a fundamental statistical and machine learning approach for density estimation, data analysis and inference. However, there remain open questions regarding the formulation of the model, the likelihood and priors, and efficient computation. This project proposes new approaches that address these issues, and applies them to two exemplar challenges: the impact of climate change on the Great Barrier Reef and better understanding neurological diseases related aging, in particular Parkinson's Disease. Read moreRead less
Robust inferences for analysis of longitudinal data. This project will develop novel statistical tools. Outcomes of this project will enable more reliable data analysis and more cost effective designs in environmental and biological studies.
Statistical Methods for Discovering Ribonucleic acids (RNAs) contributing to human diseases and phenotypes. Identifying the causative genetic factors involved in quantitative phenotypes and diseases is a major goal of biology in the 21st century and beyond. A crucial step towards this goal is identifying and classifying the functional non-protein-coding Ribonucleic acids (RNAs) encoded in the human genome. This project will make major contributions to international efforts in this area by identi ....Statistical Methods for Discovering Ribonucleic acids (RNAs) contributing to human diseases and phenotypes. Identifying the causative genetic factors involved in quantitative phenotypes and diseases is a major goal of biology in the 21st century and beyond. A crucial step towards this goal is identifying and classifying the functional non-protein-coding Ribonucleic acids (RNAs) encoded in the human genome. This project will make major contributions to international efforts in this area by identifying RNA molecules that contribute to quantitative phenotypes including susceptibility to disease. As such, it will directly benefit fundamental science via the discovery and classification of new molecules. Indirectly, it will lead to breakthroughs in biology, and consequently to major medical and pharmaceutical advances in the diagnosis and treatment of genetic disease.Read moreRead less
Applications of Bayesian methods in Genomics and Comparative Genomics. Bayesian statistics provides a unified and versatile approach to problems of data analysis, inference and hypothesis testing. This project will involve the application of Bayesian methods to four topics of commercial and scientific importance in the fields of Genomics and Comparative Genomics. The four topics are: data analysis for a novel DNA sequencing technology, investigating genomic structure using multiple change-point ....Applications of Bayesian methods in Genomics and Comparative Genomics. Bayesian statistics provides a unified and versatile approach to problems of data analysis, inference and hypothesis testing. This project will involve the application of Bayesian methods to four topics of commercial and scientific importance in the fields of Genomics and Comparative Genomics. The four topics are: data analysis for a novel DNA sequencing technology, investigating genomic structure using multiple change-point analysis, phlogenetic inference with multiple genes and detection of incongruent phylogenies. The overall goal of the project is to advance understanding of the structure, function and evolution of genomes.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE160100741
Funder
Australian Research Council
Funding Amount
$382,274.00
Summary
Tractable Bayesian algorithms for intractable Bayesian problems. This project seeks to develop computationally efficient and scalable Bayesian algorithms to estimate the parameters of complex models and ensure inferences drawn from the models can be trusted. Bayesian parameter estimation and model validation procedures are currently computationally intractable for many complex models of interest in science and technology. These include biological processes such as the efficacy of heart disease, ....Tractable Bayesian algorithms for intractable Bayesian problems. This project seeks to develop computationally efficient and scalable Bayesian algorithms to estimate the parameters of complex models and ensure inferences drawn from the models can be trusted. Bayesian parameter estimation and model validation procedures are currently computationally intractable for many complex models of interest in science and technology. These include biological processes such as the efficacy of heart disease, wound healing and skin cancer treatments. Potential outcomes of the project include new algorithms to significantly economise computations and improved understanding of the mechanisms of experimental data generation. Improved models of wound healing, skin cancer growth and heart physiology supported by these algorithms could improve population health.Read moreRead less
Advanced Mixture Models for the Analysis of Modern-Day Data. Extracting key information from huge data sets is critical to the scientific successes of the future. This project will develop novel mixture models that can be used directly to analyse complex and high-dimensional data sets that may consist of thousands of variables observed on only a limited number of entities. In order to handle the challenging problems arising in the latter situation. This project develops mixtures of factor models ....Advanced Mixture Models for the Analysis of Modern-Day Data. Extracting key information from huge data sets is critical to the scientific successes of the future. This project will develop novel mixture models that can be used directly to analyse complex and high-dimensional data sets that may consist of thousands of variables observed on only a limited number of entities. In order to handle the challenging problems arising in the latter situation. This project develops mixtures of factor models with options for skew distributions that can be used to effectively analyse such data. Key applications include the domains of bioinformatics, biostatistics, business, data mining, economics, finance, image analysis, marketing, and personalised medicine, among many others.Read moreRead less
Joint clustering and matching of multivariate samples across objects. The project will provide a novel and very effective approach to the clustering of multivariate samples on objects, say patients, that automatically matches the sample clusters across the objects. A key application is the matching of biologically relevant cell subtypes across patients for use in the study and the clinical diagnosis and prognosis of cancer.
Expanding the role of mixture models in statistical analyses of big data. This project aims to develop theoretical procedures to scale inference and learning algorithms to analyse big data sets. It will develop analytic tools and algorithms to analyse big data sets which classical methods of inference cannot analyse directly due to the data’s complexity or size. This will accelerate the progress of scientific discovery and innovation, leading, for example, to new fields of inquiry; to an increas ....Expanding the role of mixture models in statistical analyses of big data. This project aims to develop theoretical procedures to scale inference and learning algorithms to analyse big data sets. It will develop analytic tools and algorithms to analyse big data sets which classical methods of inference cannot analyse directly due to the data’s complexity or size. This will accelerate the progress of scientific discovery and innovation, leading, for example, to new fields of inquiry; to an increase in understanding from studies on human and social processes and interactions; and to the promotion of economic growth and improved health and quality of life. Such applications should lead to breakthrough discoveries and innovation in science, engineering, medicine, commerce, education and national security.Read moreRead less