From Karen Lopez’s Foreword:
In this book, Thomas Frisendal raises important questions about the continued usefulness of traditional data modeling notations and approaches:Are Entity Relationship Diagrams (ERDs) relevant to analytical data requirements? Are ERDs relevant in the new world of Big Data? Are ERDs still the best way to work with business users to understand their needs? Are Logical and Physical Data Models too closely coupled? Are we correct in using the same notations for communicating with business users and developers? Should we refine our existing notations and tools to meet these new needs, or should we start again from a blank page? What new notations and approaches will we need? How will we use those to build enterprise database systems?
Frisendal takes us through the history of data modeling, enterprise data models and traditional modeling methods. He points out, quite contentiously, where he feels we have gone wrong and in a few places where we got it right. He then maps out the psychology of meaning and context, while identifying important issues about where data modeling may or may not fit in business modeling. The main subject of this work is a proposal for a new exploration-driven modeling approach and new modeling notations for business concept models, business solutions models, and physical data models with examples on how to leverage those for implementing into any target database or datastore. These new notations are based on a property graph approach to modeling data.
Thomas Frisendal is an experienced database consultant with more than 30 years on the IT vendor side and as an independent consultant. He has worked with databases and data modeling since the late 70s; since 1995 primarily on data warehouse projects. His area of excellence lies within the art of turning data into information and knowledge. Since 2005 he has specialized in business analysis, concept "harvesting" and mapping, modeling of information and data as well as design of analytical solutions. His approach to information-driven analysis and design is "New Nordic" in the sense that it represents the traditional Nordic values such as superior quality, functionality, reliability and innovation by new ways of communicating the structure and meaning of the business context. Thomas is an active writer and speaker, and at the time of writing also Chief Data Warehouse Architect at SimCorp. Thomas lives in Copenhagen, Denmark.
In stark contrast to this conventional wisdom, this book shows how views, just like base tables, can always be updated (so long as the updates don’t violate any integrity constraints). More generally, it shows how updating always ought to work, regardless of whether the target is a base table or a view. The proposed scheme is 100% consistent with the relational model, but rather different from the way updating works in SQL products today.
This book can:Help database products improve in the futureHelp with a "roll your own" implementation, absent such product improvementsMake you aware of the crucial role of predicates and constraintsShow you how relational products are really supposed to behave
Anyone with a professional interest in the relational model, relational technology, or database systems in general can benefit from this book.
The New Relational Database Dictionary defines all of these terms and many, many more. Carefully reviewed for clarity, accuracy, and completeness, this book is an authoritative and comprehensive resource for database professionals, with over 1700 entries (many with examples) dealing with issues and concepts arising from the relational model of data. DBAs, database designers, DBMS implementers, application developers, and database professors and students can find the information they need on a daily basis, information that isn’t readily available anywhere else.
If you are a student or Java developer or a budding data scientist who wishes to learn the fundamentals of data analysis and learn to perform data analysis with Java, this book is for you. Some familiarity with elementary statistics and relational databases will be helpful but is not mandatory, to get the most out of this book. A firm understanding of Java is required.What You Will LearnDevelop Java programs that analyze data sets of nearly any size, including textImplement important machine learning algorithms such as regression, classification, and clusteringInterface with and apply standard open source Java libraries and APIs to analyze and visualize dataProcess data from both relational and non-relational databases and from time-series dataEmploy Java tools to visualize data in various formsUnderstand multimedia data analysis algorithms and implement them in Java.In Detail
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the aim of discovering useful information. Java is one of the most popular languages to perform your data analysis tasks.
This book will help you learn the tools and techniques in Java to conduct data analysis without any hassle. After getting a quick overview of what data science is and the steps involved in the process, you'll learn the statistical data analysis techniques and implement them using the popular Java APIs and libraries. Through practical examples, you will also learn the machine learning concepts such as classification and regression.
In the process, you'll familiarize yourself with tools such as Rapidminer and WEKA and see how these Java-based tools can be used effectively for analysis. You will also learn how to analyze text and other types of multimedia. Learn to work with relational, NoSQL, and time-series data. This book will also show you how you can utilize different Java-based libraries to create insightful and easy to understand plots and graphs.
By the end of this book, you will have a solid understanding of the various data analysis techniques, and how to implement them using Java.Style and approach
The book takes a very comprehensive approach to enhance your understanding of data analysis. Sufficient real-world examples and use cases are included to help you grasp the concepts quickly and apply them easily in your day-to-day work. Packed with clear, easy-to-follow examples, this book will turn you into an ace data analyst in no time.
The Concept and Object Modeling Notation (COMN) is able to cover the full spectrum of analysis and design. A single COMN model can represent the objects and concepts in the problem space, logical data design, and concrete NoSQL and SQL document, key-value, columnar, and relational database implementations. COMN models enable an unprecedented level of traceability of requirements to implementation. COMN models can also represent the static structure of software and the predicates that represent the patterns of meaning in databases.
This book will teach you:the simple and familiar graphical notation of COMN with its three basic shapes and four line styles how to think about objects, concepts, types, and classes in the real world, using the ordinary meanings of English words that aren’t tangled with confused techno-speak how to express logical data designs that are freer from implementation considerations than is possible in any other notation how to understand key-value, document, columnar, and table-oriented database designs in logical and physical terms how to use COMN to specify physical database implementations in any NoSQL or SQL database with the precision necessary for model-driven development
NoSQL Distilled is a concise but thorough introduction to this rapidly emerging technology. Pramod J. Sadalage and Martin Fowler explain how NoSQL databases work and the ways that they may be a superior alternative to a traditional RDBMS. The authors provide a fast-paced guide to the concepts you need to know in order to evaluate whether NoSQL databases are right for your needs and, if so, which technologies you should explore further.
The first part of the book concentrates on core concepts, including schemaless data models, aggregates, new distribution models, the CAP theorem, and map-reduce. In the second part, the authors explore architectural and design issues associated with implementing NoSQL. They also present realistic use cases that demonstrate NoSQL databases at work and feature representative examples using Riak, MongoDB, Cassandra, and Neo4j.
In addition, by drawing on Pramod Sadalage’s pioneering work, NoSQL Distilled shows how to implement evolutionary design with schema migration: an essential technique for applying NoSQL databases. The book concludes by describing how NoSQL is ushering in a new age of Polyglot Persistence, where multiple data-storage worlds coexist, and architects can choose the technology best optimized for each type of data access.
"This book is a great contribution to the information management community. It combines a theoretical foundation with practical methods for dealing with important problems. This is rare and very useful. Conceptual models that communicate business reality effectively require some degree of creative imagination. As such, they combine the results of business analysis with communication design, as is extensively covered in this book."
Dr. Malcolm Chisholm, President at AskGet.com Inc.
“Truly understanding business requirements has always been a major stumbling block in business intelligence (BI) projects. In this book, Thomas Frisendal introduces a powerful technique—business concept mapping—that creates a virtual mind-meld between business users and business analysts. Frisendal does a wonderful explaining and demonstrating how this tool can improve the outcome of BI and other development projects ."
Wayne Eckerson, executive director, BI Leadership Forum