CRC Press - Computational Methods of Feature Selection - 2008
Chapman & Hall/CRC
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487‑2742
© 2008 by Taylor & Francis Group, LLC
Chapman & Hall/CRC is an imprint of Taylor & Francis Group, an Informa business
No claim to original U.S. Government works
Printed in the United States of America on acid‑free paper
10 9 8 7 6 5 4 3 2 1
International Standard Book Number‑13: 978‑1‑58488‑878‑9 (Hardcover)
This book contains information obtained from authentic and highly regarded sources. Reprinted
material is quoted with permission, and sources are indicated. A wide variety of references are
listed. Reasonable efforts have been made to publish reliable data and information, but the author
and the publisher cannot assume responsibility for the validity of all materials or for the consequences
of their use.
No part of this book may be reprinted, reproduced, transmitted, or utilized in any form by any
electronic, mechanical, or other means, now known or hereafter invented, including photocopying,
microfilming, and recording, or in any information storage or retrieval system, without written
permission from the publishers.
For permission to photocopy or use material electronically from this work, please access www.
copyright.com (
http://www.copyright.com/) or contact the Copyright Clearance Center, Inc. (CCC)
222 Rosewood Drive, Danvers, MA 01923, 978‑750‑8400. CCC is a not‑for‑profit organization that
provides licenses and registration for a variety of users. For organizations that have been granted a
photocopy license by the CCC, a separate system of payment has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and
are used only for identification and explanation without intent to infringe.
Library of Congress Cataloging‑in‑Publication Data
Liu, Huan, 1958‑
Computational methods of feature selection / authors/editors, Huan Liu and
Hiroshi Motoda.
p. cm. ‑‑ (Chapman & Hall/CRC data mining and knowledge
discovery)
Includes bibliographical references and index.
ISBN 978‑1‑58488‑878‑9 (alk. paper)
1. Database management. 2. Data mining. 3. Machine learning. I. Motoda,
Hiroshi. II. Title. III. Series.
QA76.9.D3L5652 2007
005.74‑‑dc22 2007027465
Visit the Taylor & Francis Web site at
http://www.taylorandfrancis.com
and the CRC Press Web site at
http://www.crcpress.com
Preface
It has been ten years since we published our first two books on feature selection
in 1998. In the past decade, we witnessed a great expansion of feature
selection research in multiple dimensions. We experienced the fast data evolution
in which extremely high-dimensional data, such as high-throughput data
of bioinformatics and Web/text data, became increasingly common. They
stretch the capabilities of conventional data processing techniques, pose new
challenges, and stimulate accelerated development of feature selection research
in two major ways. One trend is to improve and expand the existing techniques
to meet the new challenges. The other is to develop brand new algorithms
directly targeting the arising challenges. In this process, we observe
many feature-selection-centered activities, such as one well-received competition,
two well-attended tutorials at top conferences, and two multi-disciplinary
workshops, as well as a special development section in a recent issue of IEEE
Intelligent Systems, to name a few.
This collection bridges the widening gap between existing texts and the
rapid developments in the field, by presenting recent research works from various
disciplines. It features excellent survey work, practical guides, exciting
new directions, and comprehensive tutorials from leading experts. The book
also presents easy-to-understand illustrations, state-of-the-art methodologies,
and algorithms, along with real-world case studies ranging from text classification,
to Web mining, to bioinformatics where high-dimensional data are
pervasive. Some vague ideas suggested in our earlier book have been developed
into mature areas with solid achievements, along with progress that
could not have been imagined ten years ago. With the steady and speedy
development of feature selection research, we sincerely hope that this book
presents distinctive and representative achievements; serves as a convenient
point for graduate students, practitioners, and researchers to further the research
and application of feature selection; and sparks a new phase of feature
selection research. We are truly optimistic about the impact of feature selection
on massive, high-dimensional data and processing in the near future, and
we have no doubt that in another ten years, when we look back, we will be
humbled by the newfound power of feature selection, and by its indelible contributions
to machine learning, data mining, and many real-world challenges.
Huan Liu and Hiroshi Motoda
© 2008 by Taylor & Francis Group, LLC
Acknowledgments
The inception of this book project was during SDM 2006’s feature selection
workshop. Randi Cohen, an editor of Chapman and Hall/CRC Press,
eloquently convinced one of us that it was a time for a new book on feature
selection. Since then, she closely worked with us to make the process easier
and smoother and allowed us to stay focused. With Randi’s kind and expert
support, we were able to adhere to the planned schedule when facing unexpected
difficulties. We truly appreciate her generous support throughout the
project.
This book is a natural extension of the two successful feature selection
workshops held at SDM 20051 and SDM 2006.2 The success would not be
a reality without the leadership of two workshop co-organizers (Robert Stine
of Wharton School and Leonard Auslender of SAS); the meticulous work of
the proceedings chair (Lei Yu of Binghamton University); and the altruistic
efforts of PC members, authors, and contributors. We take this opportunity
to thank all who helped to advance the frontier of feature selection research.
The authors, contributors, and reviewers of this book played an instrumental
role in this project. Given the limited space of this book, we could
not include all quality works. Reviewers’ detailed comments and constructive
suggestions significantly helped improve the book’s consistency in content,
format, comprehensibility, and presentation. We thank the authors who patiently
and timely accommodated our (sometimes many) requests.
We would also like to express our deep gratitude for the gracious help we
received from our colleagues and students, including Zheng Zhao, Lei Tang,
Quan Nguyen, Payam Refaeilzadeh, and Shankara B. Subramanya of Arizona
State University; Kozo Ohara of Osaka University; and William Nace and
Kenneth Gorreta of AFOSR/AOARD, Air Force Research Laboratory.
Last but not least, we thank our families for their love and support. We
are grateful and happy that we can now spend more time with our families.
Huan Liu and Hiroshi Motoda
1The 2005 proceedings are at
http://enpub.eas.asu.edu/workshop/.
2The 2006 proceedings are at
http://enpub.eas.asu.edu/workshop/2006/.
xuexi...................................................