Statistical and computational methods using a multiscale approach for protein identification and quantification. Proteins are critically important in the onset and ongoing illness associated with disease. Key proteins may serve as markers to diagnose or predict the course of a disease, or even become the target of pharmaceuticals. Accurate, efficient and robust algorithms are a critical component in protein identification. This research provides novel statistical algorithms for protein identific ....Statistical and computational methods using a multiscale approach for protein identification and quantification. Proteins are critically important in the onset and ongoing illness associated with disease. Key proteins may serve as markers to diagnose or predict the course of a disease, or even become the target of pharmaceuticals. Accurate, efficient and robust algorithms are a critical component in protein identification. This research provides novel statistical algorithms for protein identification using multiscale analysis techniques. Their applications in the bio-medical field will enable Australian and international researchers to identify key proteins more accurately, than current methods, leading to improve health, medical, and biological research outcomes.Read moreRead less
Innovations in Bayesian likelihood-free inference. Bayesian inference is a statistical method of choice in applied science. This project will develop innovative tools which permit Bayesian inference in problems considered intractable only a few years ago. These methods will expedite advances in multidisciplinary research across a range of applications. With these foundations, this project will accelerate national research efforts into improving frameworks for projecting trends in water availabil ....Innovations in Bayesian likelihood-free inference. Bayesian inference is a statistical method of choice in applied science. This project will develop innovative tools which permit Bayesian inference in problems considered intractable only a few years ago. These methods will expedite advances in multidisciplinary research across a range of applications. With these foundations, this project will accelerate national research efforts into improving frameworks for projecting trends in water availability and management, the impact of climate extremes, telecommunications engineering, HIV and infectious disease modelling and biostatistics. With many sectors unable to recruit appropriately trained statisticians within Australia, this project will train four PhD students in Bayesian statistics.
Read moreRead less
Statistical methods for analysing multi-source microarray data and building gene regulatory networks. I will devise a statistical learning technique that does not force a gene to be assigned to exactly one category. This technique reflects the biological reality that a gene can belong to two or more functional categories. Therefore, the new technique will improve a model's ability to identify regulatory genes in different types of cancer; these regulatory genes can be targeted by new anti-cancer ....Statistical methods for analysing multi-source microarray data and building gene regulatory networks. I will devise a statistical learning technique that does not force a gene to be assigned to exactly one category. This technique reflects the biological reality that a gene can belong to two or more functional categories. Therefore, the new technique will improve a model's ability to identify regulatory genes in different types of cancer; these regulatory genes can be targeted by new anti-cancer drugs resulting in a more effective treatment. I will model gene regulatory networks using microarray data from multiple sources. These networks will be used to identify regulatory cliques - a group of genes that are vital for a cellular function. This will improve our understanding of debilitating conditions such as asthma.Read moreRead less
Asymptotic Expansions and Large Deviations in Probability and Statistics: Theory and Applications. Statistics is the major enabling science in a number of disciplines. This is fundamental research in probability and statistics but it has wide applications in Biology and Social Sciences which will ultimately be of national benefit. The behaviour of self normalized sums is an exciting new area of fundamental research that has implications for the application of statistics in many areas. U-statist ....Asymptotic Expansions and Large Deviations in Probability and Statistics: Theory and Applications. Statistics is the major enabling science in a number of disciplines. This is fundamental research in probability and statistics but it has wide applications in Biology and Social Sciences which will ultimately be of national benefit. The behaviour of self normalized sums is an exciting new area of fundamental research that has implications for the application of statistics in many areas. U-statistics for dependent situations has direct application to understanding financial time series and the analysis of sample survey data. Saddlepoint methods provide extremely accurate approximations in a number of important applications.
Read moreRead less
Empirical saddlepoint approximations and self-normalized limit theorems. Finite population sampling and resampling methods such as the bootstrap and randomization methods are central in a number of areas of application and M-estimates are the major method used to give robust methods under mild conditions; in both these areas statistics are used which are Studentized or self-normalized. We will develop asymptotic approaches for such statistics. Saddlepoint and empirical saddlepoint methods will ....Empirical saddlepoint approximations and self-normalized limit theorems. Finite population sampling and resampling methods such as the bootstrap and randomization methods are central in a number of areas of application and M-estimates are the major method used to give robust methods under mild conditions; in both these areas statistics are used which are Studentized or self-normalized. We will develop asymptotic approaches for such statistics. Saddlepoint and empirical saddlepoint methods will be used to give methods which have second order relative accuracy in large deviation regions and we will obtain limit results and Edgeworth approximations. Emphasis will be on obtaining results under weak conditions necessary for applications.Read moreRead less
Inference for Hawkes processes with challenging data. The Hawkes processes are statistical models for the analysis of high-impact event sequences, such as bushfires, earthquakes, infectious diseases, and cyber attacks. When the times and/or marks are missing for some events or when the data is otherwise incomplete, it is challenging to fit these models and perform diagnostic checks on the fitted models. This project aims to develop novel statistical methods to fit these models in the presence of ....Inference for Hawkes processes with challenging data. The Hawkes processes are statistical models for the analysis of high-impact event sequences, such as bushfires, earthquakes, infectious diseases, and cyber attacks. When the times and/or marks are missing for some events or when the data is otherwise incomplete, it is challenging to fit these models and perform diagnostic checks on the fitted models. This project aims to develop novel statistical methods to fit these models in the presence of incomplete data and to check the goodness-of-fit of the fitted models. The expected outcomes include publications documenting these methods and software packages implementing them. The primary benefits include the advancement of statistical methodology and the training of junior research personnel. Read moreRead less
Semiparametric Regression for Streaming Data. Semiparametric regression converts large and complex data-sets into interpretable summaries from which sound decisions can be made. This project tackles semiparametric regression analysis of streaming data - where the data are so voluminous that they may not be storable in standard computer memory and therefore need to be processed rapidly on arrival and then discarded. Effective solutions necessitate a rethinking of semi-parametric regression and ne ....Semiparametric Regression for Streaming Data. Semiparametric regression converts large and complex data-sets into interpretable summaries from which sound decisions can be made. This project tackles semiparametric regression analysis of streaming data - where the data are so voluminous that they may not be storable in standard computer memory and therefore need to be processed rapidly on arrival and then discarded. Effective solutions necessitate a rethinking of semi-parametric regression and new approaches will be developed. The project will also develop novel theory and methodology for robotics applications. It will allow analysis of streaming and massive data sets that would not be possible using currently available methods, opening up new applications.Read moreRead less
Fast approximate inference methods: new algorithms, applications and theory. This project aims to develop new algorithms and theory for fast approximate inference and lay down infrastructure to aid future extensions. Fast approximate inference methods are a principled and extensible means of fitting large and complex statistical models to big data sets. They come into their own in applications where speed is paramount and traditional approaches are not feasible. The project aims to lead to prac ....Fast approximate inference methods: new algorithms, applications and theory. This project aims to develop new algorithms and theory for fast approximate inference and lay down infrastructure to aid future extensions. Fast approximate inference methods are a principled and extensible means of fitting large and complex statistical models to big data sets. They come into their own in applications where speed is paramount and traditional approaches are not feasible. The project aims to lead to practical outcomes from better business decision-making for insurance data warehouses, to improved medical imaging technology.Read moreRead less
New Bayesian methodology for understanding complex systems using hidden Markov models and expert opinion, environmental, robotics and genomics applications. This project aims to merge four areas of intense international interest in describing complex systems: hidden Markov models and mixtures, semi-parametric and nonparametric approaches, true combination of expert opinion with data, and new Bayesian computational methods based on perfect sampling and particle sampling. The project will signific ....New Bayesian methodology for understanding complex systems using hidden Markov models and expert opinion, environmental, robotics and genomics applications. This project aims to merge four areas of intense international interest in describing complex systems: hidden Markov models and mixtures, semi-parametric and nonparametric approaches, true combination of expert opinion with data, and new Bayesian computational methods based on perfect sampling and particle sampling. The project will significantly contribute to statistical methodology and its ability to inform about real-world problems. A strong focus on applications to genomics, robotics and environmental modelling will bring immediate research and monetary benefit for industry. Expected outcomes include enhanced cross-disciplinary and international linkages, publications, industry-funded projects and highly trained graduates.Read moreRead less
Statistical methods and tools for integrative microarray analysis. Tools used for biological and medical research have been evolving and there has been an increase in high-throughput technologies such as genome sequencing and DNA microarray. The growing number of entries and the increasing availability of public microarray repositories and other sequence databases have generated the new challenge of developing tools to efficiently integrate data by different research groups. This research provi ....Statistical methods and tools for integrative microarray analysis. Tools used for biological and medical research have been evolving and there has been an increase in high-throughput technologies such as genome sequencing and DNA microarray. The growing number of entries and the increasing availability of public microarray repositories and other sequence databases have generated the new challenge of developing tools to efficiently integrate data by different research groups. This research provides new statistical methods to integrate different data sets. Its application in the biomedical field will allow researchers to effectively interpret the myriad of data generated within the community.Read moreRead less