<span>This book constitutes the post-conference proceedings of the 23rd International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2021, held in Moscow, Russia, in October 2021*.<br>The 16 revised full papers were carefully reviewed and selected from 61 submissi
Data Analytics and Management in Data Intensive Domains: 22nd International Conference, DAMDID/RCDL 2020, Voronezh, Russia, October 13–16, 2020, ... in Computer and Information Science)
✍ Scribed by Alexander Sychev (editor), Sergey Makhortov (editor), Bernhard Thalheim (editor)
- Publisher
- Springer
- Year
- 2021
- Tongue
- English
- Leaves
- 241
- Category
- Library
No coin nor oath required. For personal study only.
✦ Synopsis
This book constitutes the post-conference proceedings of the 22nd International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2020, held in Voronezh, Russia, in October 2020.
The 16 revised full papers and two keynotes were carefully reviewed and selected from 60 submissions. The papers are organized in the following topical sections: data Integration, conceptual models and ontologies; data management in semantic web; data analysis in medicine; data analysis in astronomy; information extraction from text.
The conference was held virtually due to the COVID-19 pandemic.
✦ Table of Contents
Preface
Organization
Keynote Speakers’ Bios
Ladjel Bellatreche (National Engineering School for Mechanics and Aerotechnics of Aerospace Engineering Group, France)
Óscar Pastor (Polytechnic University of Valencia, Spain)
Keynote Abstracts
Towards Green Data Management Systems
Conceptual Modeling and Life Engineering: Facing Data Intensive Domains Under a Common Perspective
Contents
Data Integration, Conceptual Models and Ontologies
Managing Data-Intensive Research Problem-Solving Lifecycle
1 Introduction
2 Current Trends in Data Management for Research Problem Solving
3 Levels of Resource Integration
4 The Proposed Approach to Problem-Solving
4.1 Specification Registries
4.2 Stages of Problem-Solving
5 Developing Problem Specifications
6 Selection and Integration of Relevant Resources
6.1 Data Model Integration
6.2 Schema Integration
6.3 Resource Integration on the Level of Objects
7 Selecting Methods and Solving the Problem
7.1 Research Experiments
8 Data Publishing
9 Managing Problem-Solving
9.1 Digital Objects
9.2 Necessary Services for the Problem-Solving Lifecycle Implementation
10 Conclusion
References
Algebraic Models for Big Data and Knowledge Management
1 Introduction
2 Foundations of the Fuzzy LP-Structures Theory
3 Logical-Production Equations in the FLP-Structure
4 On Fuzzy LP-Inference and Relevance Indicators
5 Conclusion
References
A Cloud-Native Serverless Approach for Implementation of Batch Extract-Load Processes in Data Lakes
1 Introduction
2 Batch Data Ingestion in Cloud Data Lakes
3 Related Works
4 Serverless Batch Extract-Load System Architecture and Lifecycle
5 Evaluation
6 Discussion and Conclusions
References
Data Management in Semantic Web
Pragmatic Interoperability and Translation of Industrial Engineering Problems into Modelling and Simulation Solutions
1 Introduction
2 Interoperability in Materials Modelling
2.1 Review of Materials Modelling, MODA, and Ontologies
2.2 Specification of Roles and Processes
3 Materials Modelling Translation Ontology
4 Key Performance Indicators
5 Conclusion
References
Analysis of the Semantic Distance of Words in the RuWordNet Thesaurus
1 Introduction
2 Methodology
3 Results and Discussion
3.1 Skipping of Word Meanings
3.2 Denotation to Different Semantic Areas
3.3 Skipping of Relationships
3.4 Skipping of Concepts
4 Conclusion
References
A Transformation of the RDF Mapping Language into a High-Level Data Analysis Language for Execution in a Distributed Computing Environment
1 Introduction
2 Related Work
3 RDF, RML and Pig
3.1 RDF
3.2 RML
3.3 Pig Latin
4 Mapping of RML into Pig Latin
4.1 Skeleton Mapping Algorithm
4.2 RML Constructs Mapping
5 Transformation
6 Evaluation
7 Conclusions
References
Data Analysis in Medicine
EMG and EEG Pattern Analysis for Monitoring Human Cognitive Activity during Emotional Stimulation
1 Introduction
2 The Experiment
3 Analysis of EMG Patterns
4 Analysis of EEG Patterns
5 The Results EEG Pattern Analysis
6 Conclusion
References
Finding the TMS-Targeted Group of Fibers Reconstructed from Diffusion MRI Data
1 Introduction
2 Methods and Materials
2.1 The Study Pipeline
2.2 Description of the Experimental Data
2.3 Data Pre-processing
2.4 Calculation of the TMS-Induced Effects
2.5 Finding TMS-Targeted Groups of Fibers
3 Results
4 Discussion
5 Conclusion
References
Data Analysis in Astronomy
Data for Binary Stars from Gaia DR2
1 Introduction
2 Catalogues of Binary Stars and Its Cross Identification with Gaia DR2
2.1 ILB
2.2 ORB6
3 Catalogues of Co-moving Stars from Gaia DR2
4 Discussion
4.1 Astrometric Solutions for Binary Stars
4.2 ILB Update with Gaia DR2 Data and Gaia DR2 Binaries
5 Results and Conclusions
References
Classification Problem and Parameter Estimating of Gamma-Ray Bursts
1 Introduction
2 The Ep,i – Eiso Correlation
2.1 Constructing and Fitting the Correlation
2.2 Using the Correlation to Classify GRBs, EH Parameter
2.3 Using the Correlation to Estimate Redshift
3 The T90,i – EH Diagram
3.1 Using the Diagram to Classify GRBs, EHD Parameter
3.2 Using the Diagram to Estimate Redshift
4 GRB 200415A
4.1 Observations
4.2 Analyzing the Ep,i – Eiso Correlation
4.3 Analyzing the T90,i – EH Diagram
4.4 Discussion
5 GRB 200422A
5.1 Observations
5.2 Analyzing the Ep,i – Eiso Correlation
5.3 Analyzing the T90,i – EH Diagram
5.4 Discussion
6 Conclusions
References
Databases of Gamma-Ray Bursts’ Optical Observations
1 Introduction
2 Databases in Optic
2.1 Why Databases Are Necessary?
2.2 Available Optical Databases
2.3 IKI GRB-FuN Observations and Data Collection (Database)
3 Discussion
3.1 Criteria and Databases Following These Criteria
3.2 Covering by Observations All Phases of GRB Emission
3.3 Multiwavelength Observations
3.4 Searching for GRB Accompanying Gravitational Wave Events
References
Information Extraction from Text
Part of Speech and Gramset Tagging Algorithms for Unknown Words Based on Morphological Dictionaries of the Veps and Karelian Languages
1 Introduction
2 Data Organization and Text Tagging in the VepKar Corpus
3 Corpus Tagging Peculiarities
4 Part of Speech and Gramset Search by Analogy Algorithms
4.1 The POSGuess Algorithm for Part of Speech Tagging with A Suffix
4.2 The GramGuess Algorithm for Gramset Tagging with a Suffix
4.3 The GramPseudoGuess Algorithm for Gramset Tagging with a Pseudo-ending
5 Experiments
5.1 Data Preparation
5.2 Part of Speech Search by a Suffix (POSGuess Algorithm)
5.3 Gramset Search by a Suffix (GramGuess Algorithm) and by a Pseudo-ending (GramPseudoGuess Algorithm)
6 Morphological Analysis Results
7 Conclusion
References
Extrinsic Evaluation of Cross-Lingual Embeddings on the Patent Classification Task
1 Introduction
1.1 IPC Taxonomy
1.2 Structure of Patent Documents
2 Related Work
2.1 Patent Classification
2.2 Cross-Lingual Embeddings
3 Data
3.1 Data Description
3.2 Data Preprocessing
4 Experiments
4.1 Multilingual Unsupervised and Supervised Embeddings
4.2 Language-Agnostic Sentence Representations
4.3 Multilingual BERT and XLM-RoBERTa
5 Results Discussion
6 Conclusions
References
An Approach to Extracting Ontology Concepts from Requirements
1 Introduction
2 Related Works
3 Analysis of Automatic Russian Text Processing Tools Possibilities for Extracting Ontology Concepts
4 An Approach to Extracting Ontology Concepts from the Results of Automatic Processing of Textual Requirements
5 Conclusion and Future Work
References
Data Driven Detection of Technological Trajectories
1 Introduction
2 Related Work
3 Dataset
4 Description of Methods
4.1 Method for an Identification of a Technology
4.2 Method for Revealing of Technology Dynamics
5 Results
6 Conclusion
References
Comparison of Cross-Lingual Similar Documents Retrieval Methods
1 Introduction
2 Related Work
3 Document Retrieval Methods
3.1 Preprocessing
3.2 Cross-Lingual Embeddings
3.3 Inverted Index Based Approach
3.4 Translation Method
3.5 Machine Translation
3.6 Document as Vector
3.7 Sentence as Vector
3.8 Explicit Semantic Analysis (ESA)
3.9 Clustering of Word Embeddings
4 Datasets
4.1 Wiki Dataset
4.2 Essays Dataset
5 Experiments Setup
5.1 Indexing of Wikipedia
5.2 Document as Vector
5.3 Sentence as Vector
5.4 ESA
5.5 Clustering of Word Embeddings
5.6 Parameters Tuning
6 Evaluation Results
7 Conclusion
References
Author Index
📜 SIMILAR VOLUMES
<span>This book constitutes the post-conference proceedings of the 23rd International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2021, held in Moscow, Russia, in October 2021*.<br>The 16 revised full papers were carefully reviewed and selected from 61 submissi
<p>This book constitutes the refereed proceedings of the 20th International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2018, held in Moscow, Russia, in October 2018.<p>The 9 revised full papers presented together with three invited papers were carefully review
<p>This book constitutes the post-conference proceedings of the 21st International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2019, held in Kazan, Russia, in October 2019.<p>The 11 revised full papers presented together with four invited papers were carefully
<p>This book constitutes the refereed proceedings of the 19th International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2017, held in Moscow, Russia, in October 2017.<p>The 16 revised full papers presented together with three invited papers were carefully revie
This book constitutes the refereed proceedings of the 28th International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2016, held in Ershovo, Moscow, Russia, in October 2016.<br><br>The 16 revised full papers presented together with one invited talk and two keyno