𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Advances and Innovations in Statistics and Data Science

✍ Scribed by Wenqing He, Liqun Wang, Jiahua Chen, Chunfang Devon Lin


Publisher
Springer
Year
2022
Tongue
English
Leaves
338
Series
ICSA Book Series in Statistics
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


This book highlights selected papers from the 4th ICSA-Canada Chapter Symposium, as well as invited articles from established researchers in the areas of statistics and data science. It covers a variety of topics, including methodology development in data science, such as methodology in the analysis of high dimensional data, feature screening in ultra-high dimensional data and natural language ranking; statistical analysis challenges in sampling, multivariate survival models and contaminated data, as well as applications of statistical methods. With this book, readers can make use of frontier research methods to tackle their problems in research, education, training and consultation.

✦ Table of Contents


Preface
Part I: Methodology Development in Data Science (Chapters 1–6)
Part II: Challenges in Statistical Analysis (Chapters 7–15)
Contents
Contributors
Part I Methodology Development in Data Science
MiRNA–Gene Activity Interaction Networks (miGAIN): Integrated Joint Models of miRNA–Gene Targeting and Disturbance in Signaling Pathways
1 Introduction
2 Data
3 Methods
4 Results
4.1 Simulations
4.2 Data Analysis
5 Discussion
References
Robust Feature Screening for Ultrahigh-Dimensional Censored Data Subject to Measurement Error
1 Introduction
2 Notation and Framework
2.1 Survival Data
2.2 Distance Correlation of Two Random Variables
2.3 Feature Screening for Censored Data with Precise Measurements
3 Feature Screening for Censored Data with Error-Prone Covariates
3.1 Measurement Error Model
3.2 Feature Screening with Measurement Error Effects Accommodated
3.3 Asymptotic Results
4 Iteration Algorithm
5 Numerical Studies
5.1 Simulation Setup
5.2 Simulation Results
5.3 Analysis of Mantle Cell Lymphoma Microarray Data
6 Discussion
Appendix
A. Technical Lemmas
B. Proofs of Main Theorems
B.1 Proof of Theorem 1
B.2 Proof of Theorem 2
References
Simultaneous Control of False Discovery Rate and Sensitivity Using Least Angle Regressions in High-Dimensional Data Analysis
1 Introduction
2 Methodology
2.1 Cosin Distribution
2.2 Selection Criteria
3 Numerical Studies
3.1 Simulation Studies
3.1.1 Scenario I: Compound Symmetry Structure of
3.1.2 Scenario II: Auto-Regressive Correlation
3.2 A Real-Data Application
4 Discussion
References
Minimum Wasserstein Distance Estimator Under Finite Location-Scale Mixtures
1 Introduction
2 Wasserstein Distance and the Minimum Distance Estimator
2.1 Wasserstein Distance
2.2 Minimum Wasserstein Distance Estimator
2.3 Consistency of MWDE
2.4 Numerical Solution to MWDE
2.5 Penalized Maximum Likelihood Estimator
3 Experiments
3.1 Performance Measure
3.2 Performance Under Homogeneous Model
3.3 Efficiency and Robustness Under Finite Location-Scale Mixtures
3.3.1 Efficiency
3.3.2 Robustness
3.4 Image Segmentation
4 Conclusion
Appendix
Numerically Friendly Expression of W2(FN, F(Β·|G))
References
An Entropy-Based Comment Ranking Method with Word Embedding Clustering
1 Introduction
1.1 How to Judge a Comment's Quality?
1.2 Ranking Comments Using Entropy
1.3 Text Representation
2 Bag-of-Words Model with Word Embedding Clusters
3 Ranking Comments with General Entropy
4 Experiment with Amazon Review Data
References
A Robust Approach to Statistical Quality Control for High-Dimensional Non-Normal Data
1 Introduction
2 Notations and Preliminaries
3 Modification and Robustness
3.1 Model and Assumptions
3.2 Statistic, Its Limit, and Robustness
4 Simulations
5 Discussion
6 Basic Moments
7 Proof of Lemma 1
8 Proof of Theorem 1
9 Proof of Theorem 2
References
Part II Challenges in Statistical Analysis
Functional Linear Regression for Partially Observed Functional Data
1 Introduction
2 Functional Linear Model
3 Estimation Methods
3.1 Partially Observed Functional Data Without Measurement Error
3.2 Partially Observed Functional Data with Measurement Error
4 Simulation Studies
5 Real Data Analysis
6 Discussion
Appendix
References
Profile Estimation of Generalized Semiparametric Varying-Coefficient Additive Models for Longitudinal Data with Within-Subject Correlations
1 Introduction
2 Profile GEE Estimation Procedures
2.1 Model Estimation Using Fixed Working Covariance Matrices
2.2 Estimation of the Within-Subject Covariance Matrix
2.2.1 Estimation of Marginal Variance
2.2.2 Estimation of Correlation Coefficients
2.3 Profile Estimation via Quadratic Inference Function
2.4 Computational Algorithms
2.5 Choices of Kernel Function, Bandwidth, and Link Function
3 Simulation Studies
3.1 Continuous Longitudinal Responses
3.2 Discrete Longitudinal Responses
4 Concluding Remarks
References
Sieve Estimation of Semiparametric Linear Transformation Model with Left-Truncated and Current Status Data
1 Introduction
2 Sieve Maximum Likelihood Estimation
3 Efficient Estimation
4 Asymptotic Properties
5 Simulation Study
6 Real Data Analysis
7 Concluding Remarks and Future Work
Appendix
References
A Review of Flexible Transformations for Modeling Compositional Data
1 Introduction
2 Transformations for Compositional Data
2.1 Isometric Log-Ratio Transformation
2.2 Ξ±-transformation
2.3 Ξ±-Folding Transformation
3 Real-life Data Applications
4 Conclusion
References
Identifiability and Estimation of Autoregressive ARCH Models with Measurement Error
1 Introduction
2 The Model and Identifiability
3 GMM Estimation
4 Testing for Measurement Error
5 Impact of Measurement Error
6 Finite Sample Properties
7 Conclusions and Discussion
Appendix
Regularity Assumptions and Mathematical Proofs
Regularity Assumptions
Proof of Theorem 1
Proof of Theorem 2
References
Modal Regression for Skewed, Truncated, or Contaminated Data with Outliers
1 Introduction
2 Linear Modal Regression
2.1 Introduction of Linear Modal Regression
2.2 Asymptotic Properties
2.3 Estimation Algorithm
2.4 Prediction Intervals Based on Modal Regression
3 Nonparametric Modal Regression
3.1 Estimating Uni-Modal Regression
3.2 Estimating Multi-Modal Regression
4 Semiparametric Modal Regression
5 Discussion
Appendix
References
Spatial Multilevel Modelling in the Galveston Bay Recovery Study Survey
1 Introduction
2 Sampling Design
3 Survey Weights
4 Inferring Inclusion Probabilities from the Weights
5 Spatial Multilevel Model
6 A Bayesian Analysis
7 Frequentist Composite Likelihood Analysis
7.1 Estimating Function System for Mean Parameters
7.2 Decomposition of the Error Term
7.3 Estimating Equation System for Variance Components
7.4 Point Estimation
7.5 Uncertainty Estimation
8 Discussion and Conclusions
References
Efficient Experimental Design for Lasso Regression
1 Introduction
2 Methodology
2.1 A Construction Method Using Orthogonal Arrays
2.2 A Construction Method Using the Kronecker Product
3 Numerical Illustration
4 Discussion
References
A Selective Overview of Statistical Methods for Identification of the Treatment-Sensitive Subsets of Patients
1 Introduction
2 Statistical Methods for Treatment-Sensitive Subset Identification Based on Survival Times
2.1 An Approach Based on a Biomarker-Adaptive Threshold Design
2.2 A Hierarchical Bayesian Method
2.3 A Procedure Based on a Single-index Threshold Cox Model
2.4 An Interaction Tree Approach
3 Statistical Methods for Treatment-Sensitive Subset Identification Based on Longitudinal Measurements
3.1 A Procedure Based on Multilevel Models
3.2 A Prediction Model Approach
3.3 A Procedure Based on a Threshold Linear Mixed Model
4 Discussions and Future Work
References
Index


πŸ“œ SIMILAR VOLUMES


New Advances in Statistics and Data Scie
✍ Ding-Geng Chen,Zhezhen Jin,Gang Li,Yi Li,Aiyi Liu,Yichuan Zhao (eds.) πŸ“‚ Library πŸ“… 2017 πŸ› Springer International Publishing 🌐 English

<p><p>This book is comprised of the presentations delivered at the 25<sup>th</sup> ICSA Applied Statistics Symposium held at the Hyatt Regency Atlanta, on June 12-15, 2016. This symposium attracted more than 700 statisticians and data scientists working in academia, government, and industry from all

New Advances in Statistics and Data Scie
✍ Ding-Geng Chen; Zhezhen Jin; Gang Li; Yi Li; Aiyi Liu; Yichuan Zhao πŸ“‚ Library πŸ“… 2018 πŸ› Springer 🌐 English

This book is comprised of the presentations delivered at the 25th ICSA Applied Statistics Symposium held at the Hyatt Regency Atlanta, on June 12-15, 2016. This symposium attracted more than 700 statisticians and data scientists working in academia, government, and industry from all over the world.

Advances in Data Science: Symbolic, Comp
✍ Edwin Diday (editor), Rong Guan (editor), Gilbert Saporta (editor), Huiwen Wang πŸ“‚ Library πŸ“… 2020 πŸ› Wiley-ISTE 🌐 English

<p>Data science unifies statistics, data analysis and machine learning to achieve a better understanding of the masses of data which are produced today, and to improve prediction. Special kinds of data (symbolic, network, complex, compositional) are increasingly frequent in data science. These data

Advanced Statistical Methods in Data Sci
✍ Ding-Geng Chen, Jiahua Chen, Xuewen Lu, Grace Y. Yi, Hao Yu (eds.) πŸ“‚ Library πŸ“… 2016 πŸ› Springer Singapore 🌐 English

<p>This book gathers invited presentations from the 2nd Symposium of the ICSA- CANADA Chapter held at the University of Calgary from August 4-6, 2015. The aim of this Symposium was to promote advanced statistical methods in big-data sciences and to allow researchers to exchange ideas on statistics a

Frame Theory in Data Science (Advances i
✍ Zhihua Zhang, Palle E. T. Jorgensen πŸ“‚ Library πŸ“… 2024 πŸ› Springer 🌐 English

<span>This book establishes brand-new frame theory and technical implementation in data science, with a special focus on spatial-scale feature extraction, network dynamics, object-oriented analysis, data-driven environmental prediction, and climate diagnosis. Given that data science is unanimously r

Advancements and Innovations in Health S
✍ Mieczyslaw Pokorski πŸ“‚ Library πŸ“… 2019 πŸ› Springer International Publishing 🌐 English

<p><p></p><p>This book focuses on recent advances regarding clinical conditions and ailments whose mechanisms remain unclear, limiting our ability to treat them.</p><p>The respective chapters address a range of multidisciplinary topics related to timely or emergent research areas, such as osteoporos