Outlier Detection in Python

· 판매자: Simon and Schuster
eBook
560
페이지
적용 가능

eBook 정보

Learn how to identify the unusual, interesting, extreme, or inaccurate parts of your data.

Data scientists have two main tasks: finding patterns in data and finding the exceptions. These outliers are often the most informative parts of data, revealing hidden insights, novel patterns, and potential problems. Outlier Detection in Python is a practical guide to spotting the parts of a dataset that deviate from the norm, even when they're hidden or intertwined among the expected data points.

In Outlier Detection in Python you'll learn how to:

• Use standard Python libraries to identify outliers
• Select the most appropriate detection methods
• Combine multiple outlier detection methods for improved results
• Interpret your results effectively
• Work with numeric, categorical, time series, and text data

Outlier detection is a vital tool for modern business, whether it's discovering new products, expanding markets, or flagging fraud and other suspicious activities. This guide presents the core tools for outlier detection, as well as techniques utilizing the Python data stack familiar to data scientists. To get started, you'll only need a basic understanding of statistics and the Python data ecosystem.

About the technology

Outliers—values that appear inconsistent with the rest of your data—can be the key to identifying fraud, performing a security audit, spotting bot activity, or just assessing the quality of a dataset. This unique guide introduces the outlier detection tools, techniques, and algorithms you’ll need to find, understand, and respond to the anomalies in your data.

About the book

Outlier Detection in Python illustrates the principles and practices of outlier detection with diverse real-world examples including social media, finance, network logs, and other important domains. You’ll explore a comprehensive set of statistical methods and machine learning approaches to identify and interpret the unexpected values in tabular, text, time series, and image data. Along the way, you’ll explore scikit-learn and PyOD, apply key OD algorithms, and add some high value techniques for real world OD scenarios to your toolkit.

What's inside

• Python libraries to identify outliers
• Combine outlier detection methods
• Interpret your results

About the reader

For Python programmers familiar with tools like pandas and NumPy, and the basics of statistics.

About the author

Brett Kennedy is a data scientist with over thirty years’ experience in software development and data science.

Table fo Contents

Part 1
1 Introducing outlier detection
2 Simple outlier detection
3 Machine learning-based outlier detection
4 The outlier detection process
Part 2
5 Outlier detection using scikit-learn
6 The PyOD library
7 Additional libraries and algorithms for outlier detection
Part 3
8 Evaluating detectors and parameters
9 Working with specific data types
10 Handling very large and very small datasets
11 Synthetic data for outlier detection
12 Collective outliers
13 Explainable outlier detection
14 Ensembles of outlier detectors
15 Working with outlier detection predictions
Part 4
16 Deep learning-based outlier detection
17 Time-series data

저자 정보

Brett Kennedy is a data scientist with over thirty years’ experience in software development and data science. He has worked in outlier detection related to financial auditing, fraud detection, and social media analysis. He previously led a research team focusing on outlier detection.

이 eBook 평가

의견을 알려주세요.

읽기 정보

스마트폰 및 태블릿
AndroidiPad/iPhoneGoogle Play 북 앱을 설치하세요. 계정과 자동으로 동기화되어 어디서나 온라인 또는 오프라인으로 책을 읽을 수 있습니다.
노트북 및 컴퓨터
컴퓨터의 웹브라우저를 사용하여 Google Play에서 구매한 오디오북을 들을 수 있습니다.
eReader 및 기타 기기
Kobo eReader 등의 eBook 리더기에서 읽으려면 파일을 다운로드하여 기기로 전송해야 합니다. 지원되는 eBook 리더기로 파일을 전송하려면 고객센터에서 자세한 안내를 따르세요.