Automated assessment of data quality in biological knowledge resources. This project aims to develop methods for identifying poor quality data in biological databases. Research in biomedicine is underpinned by massive databases of biological data. Data quality is largely managed through manual curation, but automated methods to assess quality are critically needed. This project expects to develop a suite of computational tools for assessing biological data quality, utilising an innovative approa ....Automated assessment of data quality in biological knowledge resources. This project aims to develop methods for identifying poor quality data in biological databases. Research in biomedicine is underpinned by massive databases of biological data. Data quality is largely managed through manual curation, but automated methods to assess quality are critically needed. This project expects to develop a suite of computational tools for assessing biological data quality, utilising an innovative approach based on network analysis of database record connectivity. These tools will enable quantifying data quality at scale. Researchers, evidence-based decision-makers in biomedicine, and the analytical or predictive tools that use this data will make more reliable inferences and decisions.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE190101118
Funder
Australian Research Council
Funding Amount
$339,000.00
Summary
High performance density-based clustering in parallel environments. This project aims to conduct a comprehensive study on density-based clustering to improve data management in parallel computing environments. Clustering, a fundamental task in data management, is to group a set of objects such that objects in the same group (called a cluster) are more similar to each other than those in other groups in order to simplify retrieval of similar information. Clustering is widely used in many fields i ....High performance density-based clustering in parallel environments. This project aims to conduct a comprehensive study on density-based clustering to improve data management in parallel computing environments. Clustering, a fundamental task in data management, is to group a set of objects such that objects in the same group (called a cluster) are more similar to each other than those in other groups in order to simplify retrieval of similar information. Clustering is widely used in many fields including machine learning, pattern recognition, information retrieval, bioinformatics and image analysis. It is expected that the developed clustering techniques will provide significant performance improvements in industry sectors where decisions are made based on clustering data analytics, such as the sectors of finance, renewable energy and artificial intelligence.Read moreRead less
Efficient spatial data management for enabling true ride-sharing. This data management project aims to examine ride-sharing as a model of a complex decision system that can be optimised to deliver better outcomes. Popular ride-sharing apps have quickly evolved into ride-sourcing services that are comparable to calling a taxi on a mobile phone. Such arrangements miss many of the key benefits of true ride-sharing for the society. The project will model incentives by helping people agree on points ....Efficient spatial data management for enabling true ride-sharing. This data management project aims to examine ride-sharing as a model of a complex decision system that can be optimised to deliver better outcomes. Popular ride-sharing apps have quickly evolved into ride-sourcing services that are comparable to calling a taxi on a mobile phone. Such arrangements miss many of the key benefits of true ride-sharing for the society. The project will model incentives by helping people agree on points of interest rather than directly seeking trips from others to set destinations. It also aims to introduce privacy-aware dynamic matching of sharers, and expand to transportation at large, to generate new shared transportation services. The expected outcome of this project is to elevate today's taxi-like ride-sharing services to true ride-sharing arrangements. This is expected to provide benefits such as reduced traffic and emissions, as well as addressing parking issues and other traffic problems.Read moreRead less
Personalised data analytics for the Internet of Me. This project aims to develop data mining methods for extracting comprehensive personalised knowledge, without breaching trust. The Internet of Things will lead to the Internet of Me. Billions of smart devices connected to the Internet record people’s lives. Companies wish to provide highly personalised services that engage their customers, while individuals wish to understand their health, lifestyle, education and personal performance. The chal ....Personalised data analytics for the Internet of Me. This project aims to develop data mining methods for extracting comprehensive personalised knowledge, without breaching trust. The Internet of Things will lead to the Internet of Me. Billions of smart devices connected to the Internet record people’s lives. Companies wish to provide highly personalised services that engage their customers, while individuals wish to understand their health, lifestyle, education and personal performance. The challenge is to analyse individuals’ personal data, and discover how they differentiate from and overlap with others’. This project expects to enable businesses to deepen customer satisfaction and individuals to better understand their personal place in a connected world.Read moreRead less
Fast effective clustering technologies for highly dynamic massive networks. Clustering is a fundamental data mining and analysis task. In an interconnected evolving world, friendships and information flows are modelled as large dynamic networks. Structural clustering and correlation clustering are important and well-studied approaches for static networks; for evolving networks, where links appear and disappear over time, we lack efficient techniques. Anticipated outcomes are new practical cluste ....Fast effective clustering technologies for highly dynamic massive networks. Clustering is a fundamental data mining and analysis task. In an interconnected evolving world, friendships and information flows are modelled as large dynamic networks. Structural clustering and correlation clustering are important and well-studied approaches for static networks; for evolving networks, where links appear and disappear over time, we lack efficient techniques. Anticipated outcomes are new practical clustering algorithms for dynamic networks – with performance guarantees of efficiency and clustering quality – and prototype software, guiding us to pick a good clustering. Expected benefits include better understanding of spread in evolving social networks, accelerating the software testing cycle, and improved topic detection.Read moreRead less
Constraint-based Reasoning for Multi-agent Pathfinding. Automation is a transformative technology for logistics -- using robots to manipulate inventory allows warehouses to be more efficient, and larger-scale, than ever before. But doing this in practice requires efficient, reliable methods for coordinating ever-larger fleets of robots. These problems are extremely difficult, and current approaches either scale poorly or give weak or no guarantees on solution quality. The project will develop t ....Constraint-based Reasoning for Multi-agent Pathfinding. Automation is a transformative technology for logistics -- using robots to manipulate inventory allows warehouses to be more efficient, and larger-scale, than ever before. But doing this in practice requires efficient, reliable methods for coordinating ever-larger fleets of robots. These problems are extremely difficult, and current approaches either scale poorly or give weak or no guarantees on solution quality. The project will develop transformative approaches to multi-agent pathfinding which can handle industrial size problems, and handle all of the complications that arise in practical applications. This will deliver improved cost-effectiveness and productivity to automated warehouse logistics and other agent coordination problems.Read moreRead less
Advancing Analytical Query Processing with Urban Trajectory Data. This project aims to provide accurate, rapid, and comprehensive information to analyze transport and related infrastructure use in real time. This project expects to develop innovative solutions by exploiting massive urban trajectory data derived from public transport usage, route mapping, GPS tracking and road-side sensors. Expected outcomes include a new algorithmic framework to support complex trajectory-driven analytical tasks ....Advancing Analytical Query Processing with Urban Trajectory Data. This project aims to provide accurate, rapid, and comprehensive information to analyze transport and related infrastructure use in real time. This project expects to develop innovative solutions by exploiting massive urban trajectory data derived from public transport usage, route mapping, GPS tracking and road-side sensors. Expected outcomes include a new algorithmic framework to support complex trajectory-driven analytical tasks in public transport network planning, traffic congestion prevention, and facility deployment. This should significantly benefit both government and industry in data-driven decision makings and evaluations on the impact of decisions made, and ultimately materialize Australian government’s Smart Cities Plan.Read moreRead less
Discovery Early Career Researcher Award - Grant ID: DE160100568
Funder
Australian Research Council
Funding Amount
$360,000.00
Summary
Towards reliability in combinatorial optimisation. This project intends to develop techniques to ensure that the solutions reported by optimisation tools are correct and verifiable. Combinatorial optimisation problems, where the best solution must be found from a vast set of possibilities, are central to critical sectors of the economy, including shipping, transit, mining and emergency response. Automated tools for these problems can now solve large industrial examples, however, they are incredi ....Towards reliability in combinatorial optimisation. This project intends to develop techniques to ensure that the solutions reported by optimisation tools are correct and verifiable. Combinatorial optimisation problems, where the best solution must be found from a vast set of possibilities, are central to critical sectors of the economy, including shipping, transit, mining and emergency response. Automated tools for these problems can now solve large industrial examples, however, they are incredibly complex artefacts which are prone to error and difficult to test. New methods for ensuring the correctness of automated tools would allow users to trust that the results returned by these tools are correct when making critical decisions.Read moreRead less
Searching for near-exact protein models. This project aims to develop novel and efficient heuristic-based algorithms leading to near accurate protein tertiary structure models. Knowledge about protein structures is fundamental to our understanding of living systems. The progress on experimental determination of these structures has been extremely limited and remains an open challenge in molecular biology. Computational prediction of protein structures from sequences is emerging as a promising ap ....Searching for near-exact protein models. This project aims to develop novel and efficient heuristic-based algorithms leading to near accurate protein tertiary structure models. Knowledge about protein structures is fundamental to our understanding of living systems. The progress on experimental determination of these structures has been extremely limited and remains an open challenge in molecular biology. Computational prediction of protein structures from sequences is emerging as a promising approach, but its accuracy is far from satisfactory. The software systems developed in this project will be used in structural identification of target proteins in drug design. This will make drug design process more efficient, saving time and cost, potentially saving lives.Read moreRead less