Titre : |
Using OpenRefine : The essential OpenRefine guide that takes you from data analysis and error fixing to linking your dataset to the Web |
Type de document : |
texte imprimé |
Auteurs : |
Ruben Verborgh, Auteur ; Max De Wilde, Auteur |
Editeur : |
Birmingham [Royaume-Uni] : Packt Publishing Limited |
Année de publication : |
cop. 2013 |
Importance : |
1 vol. (95 p.) |
Présentation : |
ill. |
Format : |
24 cm |
ISBN/ISSN/EAN : |
978-1-78328-908-0 |
Note générale : |
Index |
Langues : |
Français (fre) |
Catégories : |
Bases de données ** Gestion Bases de données sur le Web
|
Index. décimale : |
004C Informatique documentaire - Recherche de l'information |
Résumé : |
"Data is supposed to be the new gold, but how can you unlock the value in your data? Managing large datasets used to be a task for specialists, but you don't have to worry about inconsistencies or errors anymore. OpenRefine lets you clean, link, and publish your dataset in a breeze.
Using OpenRefine takes you on a practical tour of all the handy features of this well-known data transformation tool. It is a hands-on recipe book that teaches you data techniques by example. Starting from the basics, it gradually transforms you into an OpenRefine expert.
This book will teach you all the necessary skills to handle any large dataset and to turn it into high-quality data for the Web. After you learn how to analyze data and spot issues, we'll see how we can solve them to obtain a clean dataset. Messy and inconsistent data is recovered through advanced techniques such as automated clustering. We'll then show extract links from keyword and full-text fields using reconciliation and named-entity extraction.
Using OpenRefine is more than a manual: it's a guide stuffed with tips and tricks to get the best out of your data."(site d'éditeur)
|
Note de contenu : |
CHAPTER 1. Diving Into OpenRefine
Introducing OpenRefine
Installing OpenRefine
Creating a new project
Exploring your data
Manipulating columns
Using the project history
Exporting a project
Going for more memory
Summary
CHAPTER 2. Analyzing and Fixing Data
Sorting data
Faceting data
Detecting duplicates
Applying a text filter
Using simple cell transformations
Removing matching rows
Summary
CHAPTER 3. Advanced Data Operations
Handling multi-valued cells
Alternating between rows and records mode
Clustering similar cells
Transforming cell values
Adding derived columns
Splitting data across columns
Transposing rows and columns
Summary
CHAPTER 4. Linking Datasets
Reconciling values with Freebase
Installing extensions
Adding a reconciliation service
Reconciling with Linked Data
Extracting named entities
|
Permalink : |
http://catalogue.iessid.be/index.php?lvl=notice_display&id=20784 |
Using OpenRefine : The essential OpenRefine guide that takes you from data analysis and error fixing to linking your dataset to the Web [texte imprimé] / Ruben Verborgh, Auteur ; Max De Wilde, Auteur . - Birmingham (Livery Place, 35 Livery Street, B3 2PB, Royaume-Uni) : Packt Publishing Limited, cop. 2013 . - 1 vol. (95 p.) : ill. ; 24 cm. ISBN : 978-1-78328-908-0 Index Langues : Français ( fre)
Catégories : |
Bases de données ** Gestion Bases de données sur le Web
|
Index. décimale : |
004C Informatique documentaire - Recherche de l'information |
Résumé : |
"Data is supposed to be the new gold, but how can you unlock the value in your data? Managing large datasets used to be a task for specialists, but you don't have to worry about inconsistencies or errors anymore. OpenRefine lets you clean, link, and publish your dataset in a breeze.
Using OpenRefine takes you on a practical tour of all the handy features of this well-known data transformation tool. It is a hands-on recipe book that teaches you data techniques by example. Starting from the basics, it gradually transforms you into an OpenRefine expert.
This book will teach you all the necessary skills to handle any large dataset and to turn it into high-quality data for the Web. After you learn how to analyze data and spot issues, we'll see how we can solve them to obtain a clean dataset. Messy and inconsistent data is recovered through advanced techniques such as automated clustering. We'll then show extract links from keyword and full-text fields using reconciliation and named-entity extraction.
Using OpenRefine is more than a manual: it's a guide stuffed with tips and tricks to get the best out of your data."(site d'éditeur)
|
Note de contenu : |
CHAPTER 1. Diving Into OpenRefine
Introducing OpenRefine
Installing OpenRefine
Creating a new project
Exploring your data
Manipulating columns
Using the project history
Exporting a project
Going for more memory
Summary
CHAPTER 2. Analyzing and Fixing Data
Sorting data
Faceting data
Detecting duplicates
Applying a text filter
Using simple cell transformations
Removing matching rows
Summary
CHAPTER 3. Advanced Data Operations
Handling multi-valued cells
Alternating between rows and records mode
Clustering similar cells
Transforming cell values
Adding derived columns
Splitting data across columns
Transposing rows and columns
Summary
CHAPTER 4. Linking Datasets
Reconciling values with Freebase
Installing extensions
Adding a reconciliation service
Reconciling with Linked Data
Extracting named entities
|
Permalink : |
http://catalogue.iessid.be/index.php?lvl=notice_display&id=20784 |
| |