𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Astronomy and Big Data: A Data Clustering Approach to Identifying Uncertain Galaxy Morphology

✍ Scribed by Kieran Jay Edwards, Mohamed Medhat Gaber (auth.)


Publisher
Springer International Publishing
Year
2014
Tongue
English
Leaves
112
Series
Studies in Big Data 6
Edition
1
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


With the onset of massive cosmological data collection through media such as the Sloan Digital Sky Survey (SDSS), galaxy classification has been accomplished for the most part with the help of citizen science communities like Galaxy Zoo. Seeking the wisdom of the crowd for such Big Data processing has proved extremely beneficial. However, an analysis of one of the Galaxy Zoo morphological classification data sets has shown that a significant majority of all classified galaxies are labelled as β€œUncertain”.

This book reports on how to use data mining, more specifically clustering, to identify galaxies that the public has shown some degree of uncertainty for as to whether they belong to one morphology type or another. The book shows the importance of transitions between different data mining techniques in an insightful workflow. It demonstrates that Clustering enables to identify discriminating features in the analysed data sets, adopting a novel feature selection algorithms called Incremental Feature Selection (IFS). The book shows the use of state-of-the-art classification techniques, Random Forests and Support Vector Machines to validate the acquired results. It is concluded that a vast majority of these galaxies are, in fact, of spiral morphology with a small subset potentially consisting of stars, elliptical galaxies or galaxies of other morphological variants.

✦ Table of Contents


Front Matter....Pages 1-10
Introduction....Pages 1-3
Astronomy, Galaxies and Stars: An Overview....Pages 5-14
Astronomical Data Mining....Pages 15-30
Adopted Data Mining Methods....Pages 31-42
Research Methodology....Pages 43-48
Development of Data Mining Models....Pages 49-81
Experimentation Results....Pages 83-88
Conclusion and FutureWork....Pages 89-93
Back Matter....Pages 95-104

✦ Subjects


Computational Intelligence; Artificial Intelligence (incl. Robotics); Astronomy, Observations and Techniques; Data Mining and Knowledge Discovery


πŸ“œ SIMILAR VOLUMES


Clustering for Data Mining: A Data Recov
✍ Boris Mirkin πŸ“‚ Library πŸ“… 2005 πŸ› Chapman and Hall/CRC 🌐 English

This book gives a smooth, motivated and example-richintroduction to clustering, which is innovative in many aspects.Answers to important questions that are very rarely addressed if addressed at all, are provided.Examples:(a) what to do if the user has no idea of the numberof clusters and/or their lo

Clustering for Data Mining: A Data Recov
✍ Boris Mirkin πŸ“‚ Library πŸ“… 2005 πŸ› Chapman & Hall – CRC 🌐 English

Often considered more as an art than a science, the field of clustering has been dominated by learning through examples and by techniques chosen almost through trial-and-error. Even the most popular clustering methods--K-Means for partitioning the data set and Ward's method for hierarchical clusteri

Big Data, Algorithms and Food Safety: A
✍ Salvatore Sapienza πŸ“‚ Library πŸ“… 2022 πŸ› Springer 🌐 English

<span>This book identifies the principles that should be applied when processing Big Data in the context of food safety risk assessments. Food safety is a critical goal in the protection of individuals’ right to health and the flourishing of the food and feed market. Big Data is fostering new applic

SQL Server Big Data Clusters: Data Virtu
✍ Benjamin Weissman, Enrico van de Laar πŸ“‚ Library πŸ“… 2020 πŸ› APress 🌐 English

Use this guide to one of SQL Server 2019's most impactful features--Big Data Clusters. You will learn about data virtualization and data lakes for this complete artificial intelligence (AI) and machine learning (ML) platform within the SQL Server database engine. You will know how to use Big Data Cl

SQL Server Big Data Clusters : Data Virt
✍ Benjamin Weissman, Enrico van de Laar πŸ“‚ Library πŸ“… 2020 πŸ› Apress 🌐 English

<p>Use this guide to one of SQL Server 2019’s most impactful featuresβ€”Big Data Clusters. You will learn about data virtualization and data lakes for this complete artificial intelligence (AI) and machine learning (ML) platform within the SQL Server database engine. You will know how to use Big Data