語系:
繁體中文
English
說明(常見問題)
圖資館首頁
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
SQL for data sciencedata cleaning, w...
~
Badia, Antonio.
SQL for data sciencedata cleaning, wrangling and analytics with relational databases /
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
SQL for data scienceby Antonio Badia.
其他題名:
data cleaning, wrangling and analytics with relational databases /
作者:
Badia, Antonio.
出版者:
Cham :Springer International Publishing :2020.
面頁冊數:
xi, 285 p. :ill., digital ;24 cm.
Contained By:
Springer Nature eBook
標題:
Database management.
電子資源:
https://doi.org/10.1007/978-3-030-57592-2
ISBN:
9783030575922$q(electronic bk.)
SQL for data sciencedata cleaning, wrangling and analytics with relational databases /
Badia, Antonio.
SQL for data science
data cleaning, wrangling and analytics with relational databases /[electronic resource] :by Antonio Badia. - Cham :Springer International Publishing :2020. - xi, 285 p. :ill., digital ;24 cm. - Data-centric systems and applications,2197-9723. - Data-centric systems and applications..
1. The Data Life Cycle -- 2. Relational Data -- 3. Data Cleaning and Pre-processing -- 4. Introduction to Data Analysis -- 5. More SQL -- 6. Databases and Other Tools.
This textbook explains SQL within the context of data science and introduces the different parts of SQL as they are needed for the tasks usually carried out during data analysis. Using the framework of the data life cycle, it focuses on the steps that are very often given the short shift in traditional textbooks, like data loading, cleaning and pre-processing. The book is organized as follows. Chapter 1 describes the data life cycle, i.e. the sequence of stages from data acquisition to archiving, that data goes through as it is prepared and then actually analyzed, together with the different activities that take place at each stage. Chapter 2 gets into databases proper, explaining how relational databases organize data. Non-traditional data, like XML and text, are also covered. Chapter 3 introduces SQL queries, but unlike traditional textbooks, queries and their parts are described around typical data analysis tasks like data exploration, cleaning and transformation. Chapter 4 introduces some basic techniques for data analysis and shows how SQL can be used for some simple analyses without too much complication. Chapter 5 introduces additional SQL constructs that are important in a variety of situations and thus completes the coverage of SQL queries. Lastly, chapter 6 briefly explains how to use SQL from within R and from within Python programs. It focuses on how these languages can interact with a database, and how what has been learned about SQL can be leveraged to make life easier when using R or Python. All chapters contain a lot of examples and exercises on the way, and readers are encouraged to install the two open-source database systems (MySQL and Postgres) that are used throughout the book in order to practice and work on the exercises, because simply reading the book is much less useful than actually using it. This book is for anyone interested in data science and/or databases. It just demands a bit of computer fluency, but no specific background on databases or data analysis. All concepts are introduced intuitively and with a minimum of specialized jargon. After going through this book, readers should be able to profitably learn more about data mining, machine learning, and database management from more advanced textbooks and courses.
ISBN: 9783030575922$q(electronic bk.)
Standard No.: 10.1007/978-3-030-57592-2doiSubjects--Topical Terms:
182428
Database management.
LC Class. No.: QA76.9.D3 / B33 2020
Dewey Class. No.: 005.74
SQL for data sciencedata cleaning, wrangling and analytics with relational databases /
LDR
:03549nmm a2200349 a 4500
001
589343
003
DE-He213
005
20201109200700.0
006
m d
007
cr nn 008maaau
008
210601s2020 sz s 0 eng d
020
$a
9783030575922$q(electronic bk.)
020
$a
9783030575915$q(paper)
024
7
$a
10.1007/978-3-030-57592-2
$2
doi
035
$a
978-3-030-57592-2
040
$a
GP
$c
GP
041
0
$a
eng
050
4
$a
QA76.9.D3
$b
B33 2020
072
7
$a
UN
$2
bicssc
072
7
$a
COM021000
$2
bisacsh
072
7
$a
UN
$2
thema
072
7
$a
UMT
$2
thema
082
0 4
$a
005.74
$2
23
090
$a
QA76.9.D3
$b
B136 2020
100
1
$a
Badia, Antonio.
$3
348285
245
1 0
$a
SQL for data science
$h
[electronic resource] :
$b
data cleaning, wrangling and analytics with relational databases /
$c
by Antonio Badia.
260
$a
Cham :
$b
Springer International Publishing :
$b
Imprint: Springer,
$c
2020.
300
$a
xi, 285 p. :
$b
ill., digital ;
$c
24 cm.
490
1
$a
Data-centric systems and applications,
$x
2197-9723
505
0
$a
1. The Data Life Cycle -- 2. Relational Data -- 3. Data Cleaning and Pre-processing -- 4. Introduction to Data Analysis -- 5. More SQL -- 6. Databases and Other Tools.
520
$a
This textbook explains SQL within the context of data science and introduces the different parts of SQL as they are needed for the tasks usually carried out during data analysis. Using the framework of the data life cycle, it focuses on the steps that are very often given the short shift in traditional textbooks, like data loading, cleaning and pre-processing. The book is organized as follows. Chapter 1 describes the data life cycle, i.e. the sequence of stages from data acquisition to archiving, that data goes through as it is prepared and then actually analyzed, together with the different activities that take place at each stage. Chapter 2 gets into databases proper, explaining how relational databases organize data. Non-traditional data, like XML and text, are also covered. Chapter 3 introduces SQL queries, but unlike traditional textbooks, queries and their parts are described around typical data analysis tasks like data exploration, cleaning and transformation. Chapter 4 introduces some basic techniques for data analysis and shows how SQL can be used for some simple analyses without too much complication. Chapter 5 introduces additional SQL constructs that are important in a variety of situations and thus completes the coverage of SQL queries. Lastly, chapter 6 briefly explains how to use SQL from within R and from within Python programs. It focuses on how these languages can interact with a database, and how what has been learned about SQL can be leveraged to make life easier when using R or Python. All chapters contain a lot of examples and exercises on the way, and readers are encouraged to install the two open-source database systems (MySQL and Postgres) that are used throughout the book in order to practice and work on the exercises, because simply reading the book is much less useful than actually using it. This book is for anyone interested in data science and/or databases. It just demands a bit of computer fluency, but no specific background on databases or data analysis. All concepts are introduced intuitively and with a minimum of specialized jargon. After going through this book, readers should be able to profitably learn more about data mining, machine learning, and database management from more advanced textbooks and courses.
650
0
$a
Database management.
$3
182428
650
0
$a
Big data.
$3
609582
650
0
$a
SQL (Computer program language)
$3
189402
650
1 4
$a
Database Management.
$3
273994
650
2 4
$a
Big Data/Analytics.
$3
742047
710
2
$a
SpringerLink (Online service)
$3
273601
773
0
$t
Springer Nature eBook
830
0
$a
Data-centric systems and applications.
$3
568971
856
4 0
$u
https://doi.org/10.1007/978-3-030-57592-2
950
$a
Computer Science (SpringerNature-11645)
筆 0 讀者評論
全部
電子館藏
館藏
1 筆 • 頁數 1 •
1
條碼號
館藏地
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
000000191778
電子館藏
1圖書
電子書
EB QA76.9.D3 B136 2020 2020
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
多媒體檔案
https://doi.org/10.1007/978-3-030-57592-2
評論
新增評論
分享你的心得
Export
取書館別
處理中
...
變更密碼
登入