Entity information life cycle for big data : master data management and information integration / [electronic resource]
by Talburt, John R [author.]; Zhou, Yinle [author.].
Material type: BookPublisher: Waltham, MA : Elsevier : 2015Description: 1 online resource.ISBN: 9780128006658; 012800665X.Subject(s): Big data | Semantic Web | Pattern recognition systems | Data mining | LANGUAGE ARTS & DISCIPLINES -- Library & Information Science -- General | Big data | Data mining | Pattern recognition systems | Semantic Web | Electronic books | Electronic booksOnline resources: ScienceDirectVendor-supplied metadata.
Entity Information Life Cycle for Big Data walks you through the ins and outs of managing entity information so you can successfully achieve master data management (MDM) in the era of big data. This book explains big data's impact on MDM and the critical role of entity information management system (EIMS) in successful MDM. Expert authors Dr. John R. Talburt and Dr. Yinle Zhou provide a thorough background in the principles of managing the entity information life cycle and provide practical tips and techniques for implementing an EIMS, strategies for exploiting distributed processing to handle big data for EIMS, and examples from real applications. Additional material on the theory of EIIM and methods for assessing and evaluating EIMS performance also make this book appropriate for use as a textbook in courses on entity and identity management, data management, customer relationship management (CRM), and related topics.
Front Cover; Entity Information Life Cycle for Big Data; Copyright; Contents; Foreword; Preface; THE CHANGING LANDSCAPE OF INFORMATION QUALITY; MOTIVATION FOR THIS BOOK; AUDIENCE; ORGANIZATION OF THE MATERIAL; Acknowledgements; Chapter 1 -- The Value Proposition for MDM and Big Data; DEFINITION AND COMPONENTS OF MDM; THE BUSINESS CASE FOR MDM; DIMENSIONS OF MDM; THE CHALLENGE OF BIG DATA; MDM AND BIG DATA -- THE N-SQUARED PROBLEM; CONCLUDING REMARKS; Chapter 2 -- Entity Identity Information and the CSRUD Life Cycle Model; ENTITIES AND ENTITY REFERENCES; MANAGING ENTITY IDENTITY INFORMATION.
ENTITY IDENTITY INFORMATION LIFE CYCLE MANAGEMENT MODELSCONCLUDING REMARKS; Chapter 3 -- A Deep Dive into the Capture Phase; AN OVERVIEW OF THE CAPTURE PHASE; BUILDING THE FOUNDATION; UNDERSTANDING THE DATA; DATA PREPARATION; SELECTING IDENTITY ATTRIBUTES; ASSESSING ER RESULTS; DATA MATCHING STRATEGIES; CONCLUDING REMARKS; Chapter 4 -- Store and Share -- Entity Identity Structures; ENTITY IDENTITY INFORMATION MANAGEMENT STRATEGIES; DEDICATED MDM SYSTEMS; THE IDENTITY KNOWLEDGE BASE; MDM ARCHITECTURES; CONCLUDING REMARKS; Chapter 5 -- Update and Dispose Phases -- Ongoing Data Stewardship.
DATA STEWARDSHIPTHE AUTOMATED UPDATE PROCESS; THE MANUAL UPDATE PROCESS; ASSERTED RESOLUTION; EIS VISUALIZATION TOOLS; MANAGING ENTITY IDENTIFIERS; CONCLUDING REMARKS; Chapter 6 -- Resolve and Retrieve Phase -- Identity Resolution; IDENTITY RESOLUTION; IDENTITY RESOLUTION ACCESS MODES; CONFIDENCE SCORES; CONCLUDING REMARKS; Chapter 7 -- Theoretical Foundations; THE FELLEGI-SUNTER THEORY OF RECORD LINKAGE; THE STANFORD ENTITY RESOLUTION FRAMEWORK; ENTITY IDENTITY INFORMATION MANAGEMENT; CONCLUDING REMARKS; Chapter 8 -- The Nuts and Bolts of Entity Resolution; THE ER CHECKLIST.
CLUSTER-TO-CLUSTER CLASSIFICATIONSELECTING AN APPROPRIATE ALGORITHM; CONCLUDING REMARKS; Chapter 9 -- Blocking; BLOCKING; BLOCKING BY MATCH KEY; DYNAMIC BLOCKING VERSUS PRERESOLUTION BLOCKING; BLOCKING PRECISION AND RECALL; MATCH KEY BLOCKING FOR BOOLEAN RULES; MATCH KEY BLOCKING FOR SCORING RULES; CONCLUDING REMARKS; Chapter 10 -- CSRUD for Big Data; LARGE-SCALE ER FOR MDM; THE TRANSITIVE CLOSURE PROBLEM; DISTRIBUTED, MULTIPLE-INDEX, RECORD-BASED RESOLUTION; AN ITERATIVE, NONRECURSIVE ALGORITHM FOR TRANSITIVE CLOSURE; ITERATION PHASE: SUCCESSIVE CLOSURE BY REFERENCE IDENTIFIER.
DEDUPLICATION PHASE: FINAL OUTPUT OF COMPONENTSER USING THE NULL RULE; THE CAPTURE PHASE AND IKB; THE IDENTITY UPDATE PROBLEM; PERSISTENT ENTITY IDENTIFIERS; THE LARGE COMPONENT AND BIG ENTITY PROBLEMS; IDENTITY CAPTURE AND UPDATE FOR ATTRIBUTE-BASED RESOLUTION; CONCLUDING REMARKS; Chapter 11 -- ISO Data Quality Standards for Master Data; BACKGROUND; GOALS AND SCOPE OF THE ISO 8000-110 STANDARD; FOUR MAJOR COMPONENTS OF THE ISO 8000-110 STANDARD; SIMPLE AND STRONG COMPLIANCE WITH ISO 8000-110; ISO 22745 INDUSTRIAL SYSTEMS AND INTEGRATION; BEYOND ISO 8000-110; CONCLUDING REMARKS.
Appendix A -- Some Commonly Used ER Comparators.
Includes bibliographical references and index.
There are no comments for this item.