Course Summary
After you’ve completed this course, you’ll be ready to dive into the Data Analyst Nanodegree, which is the ultimate stepping stone to a career as a Data Analyst!
上完這一部分的課,如果要更專精就可以上 Data Analyst Nanodegree 這門課。
Why Take This Course?
This course will introduce you to the world of data analysis. You'll learn how to go through the entire data analysis process, which includes:
- Posing a question
- Wrangling your data into a format you can use and fixing any problems with it
- Exploring the data, finding patterns in it, and building your intuition about it
- Drawing conclusions and/or making predictions
- Communicating your findings
You'll also learn how to use the Python libraries NumPy, Pandas, and Matplotlib to write code that's cleaner, more concise, and runs faster.
上這門課你可以學到 data analysis 的流程:從假設問題、針對問題整理相關資料、探索資料找出 pattern、作結論與預測,最後跟大家說你的發現。
Syllabus
Dictionaries
In this lesson you will get an awareness of an dictionaries, an important in Python that you will going to encounter in the rest of this path.
會介紹 Python 中很重要的 dictionaries
Data Analysis Process
In this lesson, you will learn about the data analysis process, which includes posing a question, wrangling and exploring your data, drawing conclusions and/or making predictions, and communicating your findings. You will complete an analysis of Udacity student data using pure Python, with minimal reliance on additional libraries.
學習資料分析的流程,會實作一個分析,是關於 Udacity 學生的資料。
NumPy and Pandas for 1D Data
In this lesson, you will start learning to use NumPy and Pandas to make the data analysis process easier. This lesson focuses on features that apply to one-dimensional data. You'll learn to use NumPy arrays, Pandas Series, and vectorized operations.
NumPy and Pandas for 2D Data
In this lesson, you'll continue learning about NumPy and Pandas, this time focusing on two-dimensional data. You'll learn to use two-dimensional NumPy arrays and Pandas DataFrames. You'll also learn to group your data and to combine data from multiple files.
我們會學 NumPy 和 Pandas幫助我們做資料分析。
Getting Started with Data Analysis
The following content is broken into two core pieces:
課程可以分成兩個部分,一個是 dictionaries 的介紹,你需要擁有 dictionaries 的知識才能做資料分析。
Introduction To Dictionaries
Before you dive into the full Intro to Data Analysis course, you'll need a solid understanding of dictionaries. If you decide to enroll in the Data Analyst Nanodegree program, you will need this as a prerequisite as well. The following content on dictionaries comes from the same course as much of the concepts you covered in Code Your Own Quiz.
Intro to Data Analysis
You will be taking Intro to Data Analysis with Caroline. This course is a good first step towards understanding the data analysis process as a whole. Before delving into each individual phase, it is important to learn the difference between all phases of the process and how they relate to each other. This course also covers the Python libraries: NumPy, Pandas, and Matplotlib, which are indispensable tools for doing data analysis in Python. Their many convenient functions and high performance make writing data analysis code a lot easier!
接著就是了解資料分析的流程與每一部分,然後了解每一個部分彼此的關聯,課程就包含 Python 的 libraries NumPy、Pandas 和 Matplotlib,他們都是不可或缺的資料分析工具。
This course will require that you search and utilize documentation. Don’t hesitate to search through the documentation that exists for the Python libraries covered in this course:
- NumPy
- Pandas
- Matplotlib
課程會需要用的 documentation
The next two lessons will direct you to set up your computer by installing the Anaconda distribution and Jupyter Notebook -- two very useful tools for data analysts.
下面會教我們安裝 Anaconda distribution 跟 Jupyter Notebook 這兩個好用的資料分析工具。