Data Scientist: The Definitive Guide to Becoming a Data Scientist

Technics Publications
1
Free sample

As our society transforms into a data-driven one, the role of the Data Scientist is becoming more and more important. If you want to be on the leading edge of what is sure to become a major profession in the not-too-distant future, this book can show you how. Each chapter is filled with practical information that will help you reap the fruits of big data and become a successful Data Scientist: • Learn what big data is and how it differs from traditional data through its main characteristics: volume, variety, velocity, and veracity. • Explore the different types of Data Scientists and the skillset each one has. • Dig into what the role of the Data Scientist requires in terms of the relevant mindset, technical skills, experience, and how the Data Scientist connects with other people. • Be a Data Scientist for a day, examining the problems you may encounter and how you tackle them, what programs you use, and how you expand your knowledge and know-how. • See how you can become a Data Scientist, based on where you are starting from: a programming, machine learning, or data-related background. • Follow step-by-step through the process of landing a Data Scientist job: where you need to look, how you would present yourself to a potential employer, and what it takes to follow a freelancer path. • Read the case studies of experienced, senior-level Data Scientists, in an attempt to get a better perspective of what this role is, in practice. At the end of the book, there is a glossary of the most important terms that have been introduced, as well as three appendices – a list of useful sites, some relevant articles on the web, and a list of offline resources for further reading.
Read more

About the author

Dr. Zacharias Voulgaris was born and raised in Greece. Upon completing a 5-year Engineering degree at the Technical University of Crete, he enrolled in the City University of London for a Masters course in Information Systems and Technology. Afterwards, he pursued a PhD in Birkbeck College (University of London), under the joint supervision of Prof. G. Magoulas and Prof. B. Mirkin, in the field of Machine Learning. Upon receiving his doctorate, he was recruited by the Georgia Institute of Technology as a research fellow. Since January 2013 he has been working as a Data Scientist.
Read more

Reviews

5.0
1 total
Loading...

Additional Information

Publisher
Technics Publications
Read more
Published on
May 1, 2014
Read more
Pages
278
Read more
ISBN
9781634620284
Read more
Read more
Best For
Read more
Language
English
Read more
Genres
Computers / Databases / Data Mining
Computers / Mathematical & Statistical Software
Read more
Content Protection
This content is DRM protected.
Read more

Reading information

Smartphones and Tablets

Install the Google Play Books app for Android and iPad/iPhone. It syncs automatically with your account and allows you to read online or offline wherever you are.

Laptops and Computers

You can read books purchased on Google Play using your computer's web browser.

eReaders and other devices

To read on e-ink devices like the Sony eReader or Barnes & Noble Nook, you'll need to download a file and transfer it to your device. Please follow the detailed Help center instructions to transfer the files to supported eReaders.
Zacharias Voulgaris, PhD
 Master how to use the Julia language to solve business critical data science challenges. After covering the importance of Julia to the data science community and several essential data science principles, we start with the basics including how to install Julia and its powerful libraries. Many examples are provided as we illustrate how to leverage each Julia command, dataset, and function.

Specialized script packages are introduced and described. Hands-on problems representative of those commonly encountered throughout the data science pipeline are provided, and we guide you in the use of Julia in solving them using published datasets. Many of these scenarios make use of existing packages and built-in functions, as we cover:

1.      1. An overview of the data science pipeline along with an example illustrating the key points, implemented in Julia

2.      2. Options for Julia IDEs

3.     3.  Programming structures and functions

4.     4.  Engineering tasks, such as importing, cleaning, formatting and storing data, as well as performing data preprocessing

5.      5. Data visualization and some simple yet powerful statistics for data exploration purposes

6.      6. Dimensionality reduction and feature evaluation

7.      7. Machine learning methods, ranging from unsupervised (different types of clustering) to supervised ones (decision trees, random forests, basic neural networks, regression trees, and Extreme Learning Machines)

8.      8. Graph analysis including pinpointing the connections among the various entities and how they can be mined for useful insights.

Each chapter concludes with a series of questions and exercises to reinforce what you learned. The last chapter of the book will guide you in creating a data science application from scratch using Julia.

 

Sebastian Raschka
Unlock deeper insights into Machine Leaning with this vital guide to cutting-edge predictive analyticsAbout This BookLeverage Python's most powerful open-source libraries for deep learning, data wrangling, and data visualizationLearn effective strategies and best practices to improve and optimize machine learning systems and algorithmsAsk – and answer – tough questions of your data with robust statistical models, built for a range of datasetsWho This Book Is For

If you want to find out how to use Python to start answering critical questions of your data, pick up Python Machine Learning – whether you want to get started from scratch or want to extend your data science knowledge, this is an essential and unmissable resource.

What You Will LearnExplore how to use different machine learning models to ask different questions of your dataLearn how to build neural networks using Keras and TheanoFind out how to write clean and elegant Python code that will optimize the strength of your algorithmsDiscover how to embed your machine learning model in a web application for increased accessibilityPredict continuous target outcomes using regression analysisUncover hidden patterns and structures in data with clusteringOrganize data using effective pre-processing techniquesGet to grips with sentiment analysis to delve deeper into textual and social media dataIn Detail

Machine learning and predictive analytics are transforming the way businesses and other organizations operate. Being able to understand trends and patterns in complex data is critical to success, becoming one of the key strategies for unlocking growth in a challenging contemporary marketplace. Python can help you deliver key insights into your data – its unique capabilities as a language let you build sophisticated algorithms and statistical models that can reveal new perspectives and answer key questions that are vital for success.

Python Machine Learning gives you access to the world of predictive analytics and demonstrates why Python is one of the world's leading data science languages. If you want to ask better questions of data, or need to improve and extend the capabilities of your machine learning systems, this practical data science book is invaluable. Covering a wide range of powerful Python libraries, including scikit-learn, Theano, and Keras, and featuring guidance and tips on everything from sentiment analysis to neural networks, you'll soon be able to answer some of the most important questions facing you and your organization.

Style and approach

Python Machine Learning connects the fundamental theoretical principles behind machine learning to their practical application in a way that focuses you on asking and answering the right questions. It walks you through the key elements of Python and its powerful machine learning libraries, while demonstrating how to get to grips with a range of statistical models.

Zacharias Voulgaris, PhD
 Master how to use the Julia language to solve business critical data science challenges. After covering the importance of Julia to the data science community and several essential data science principles, we start with the basics including how to install Julia and its powerful libraries. Many examples are provided as we illustrate how to leverage each Julia command, dataset, and function.

Specialized script packages are introduced and described. Hands-on problems representative of those commonly encountered throughout the data science pipeline are provided, and we guide you in the use of Julia in solving them using published datasets. Many of these scenarios make use of existing packages and built-in functions, as we cover:

1.      1. An overview of the data science pipeline along with an example illustrating the key points, implemented in Julia

2.      2. Options for Julia IDEs

3.     3.  Programming structures and functions

4.     4.  Engineering tasks, such as importing, cleaning, formatting and storing data, as well as performing data preprocessing

5.      5. Data visualization and some simple yet powerful statistics for data exploration purposes

6.      6. Dimensionality reduction and feature evaluation

7.      7. Machine learning methods, ranging from unsupervised (different types of clustering) to supervised ones (decision trees, random forests, basic neural networks, regression trees, and Extreme Learning Machines)

8.      8. Graph analysis including pinpointing the connections among the various entities and how they can be mined for useful insights.

Each chapter concludes with a series of questions and exercises to reinforce what you learned. The last chapter of the book will guide you in creating a data science application from scratch using Julia.

 

©2017 GoogleSite Terms of ServicePrivacyDevelopersArtistsAbout Google
By purchasing this item, you are transacting with Google Payments and agreeing to the Google Payments Terms of Service and Privacy Notice.