๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

[ACM Press the 18th ACM SIGKDD international conference - Beijing, China (2012.08.12-2012.08.16)] Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '12 - Model mining for robust feature selection

โœ Scribed by Woznica, Adam; Nguyen, Phong; Kalousis, Alexandros


Book ID
120769462
Publisher
ACM Press
Year
2012
Tongue
English
Weight
685 KB
Category
Article
ISBN
1450314627

No coin nor oath required. For personal study only.

โœฆ Synopsis


A common problem with most of the feature selection methods is that they often produce feature sets-models-that are not stable with respect to slight variations in the training data. Different authors tried to improve the feature selection stability using ensemble methods which aggregate different feature sets into a single model. However, the existing ensemble feature selection methods suffer from two main shortcomings: (i) the aggregation treats the features independently and does not account for their interactions, and (ii) a single feature set is returned, nevertheless, in various applications there might be more than one feature sets, potentially redundant, with similar information content. In this work we address these two limitations. We present a general framework in which we mine over different feature models produced from a given dataset in order to extract patterns over the models. We use these patterns to derive more complex feature model aggregation strategies that account for feature interactions, and identify core and distinct feature models. We conduct an extensive experimental evaluation of the proposed framework where we demonstrate its effectiveness over a number of high-dimensional problems from the fields of biology and text-mining.


๐Ÿ“œ SIMILAR VOLUMES


[ACM Press the 18th ACM SIGKDD internati
โœ Li, Jianzhong ๐Ÿ“‚ Article ๐Ÿ“… 2012 ๐Ÿ› ACM Press ๐ŸŒ English โš– 373 KB

With the rapid development of advanced data acquisition techniques such as high-throughput biological experiments and wireless sensor networks, large amount of graph-structured data, graph data for short, have been collected in a wide range of applications. Discovering knowledge from graph data has

[ACM Press the 18th ACM SIGKDD internati
โœ Chen, Hung-Hsuan; Ciou, Yan-Bin; Lin, Shou-De ๐Ÿ“‚ Article ๐Ÿ“… 2012 ๐Ÿ› ACM Press ๐ŸŒ English โš– 673 KB

With the popularity of online social network services, influence maximization on social networks has drawn much attention in recent years. Most of these studies approximate a greedy based sub-optimal solution by proving the submodular nature of the utility function. Instead of using the analytical t