日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當前位置: 首頁 > 编程资源 > 编程问答 >内容正文

编程问答

课堂笔记——Data Mining(1)

發布時間:2024/8/23 编程问答 36 豆豆
生活随笔 收集整理的這篇文章主要介紹了 课堂笔记——Data Mining(1) 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

一、Introduction

……

1、Major Issues in Data Mining

User Interaction

Presentation and visualization of data mining results : Efficiency and Scalability

Diversity of data types: complex types of data; Mining dynamic, networked, and global data repositories?

Data mining and society: Privacy-preserving; Social impacts of data mining; Invisible data mining

?

二、Getting to Know Your Data

1、Type of Data?Sets

Record:Relational records; Data matrix; Text documents; Transaction data

2、 Important Characteristics of Structured Data

Dimensionality: Curse of?dimensionality;

Sparsity: Only presnce counts;

Resolution: Patterns depend on the scale;

Distribution: Centrality and dispersion?

3、Attribute (dimensions features varibles)

types: Nominal; Ordinal; Binary: Symmetric, Asymmetric; Quantity: Interval, Ratio

Discrete Attribute

Continuous Attribute

4、Basic Statistical Descriptions of Data

Data dispersion characterstics: median, max, min, quantiles, outliers, variance

mean:Weighted arithmetic mean; Trimmed mean

5、Measuring the Dispersion of Data

Quartiles:Q1(25th percentile)、Q3(75th percentile)

Inter-quartile range(IQR):最當中的50%

Five number summary :min、Q1,median、Q3、max

6、Graphic Displays of Basic Statistcal Description?

7、五種數據分析圖

boxplot analysis:

Histogram Analysis

Quantile Plot

Quantile-Quantile Plot(Q-Q Plot)

Scatter Plot

8、?Categorization of visualization methods

Pixel-orirnted:?

① The m dimension values of a record are mapped to m pixels at the corresponding positions in the windows

② The color of pixel reflect corresponding values

③?For? a dataset of m dimensions, create m windows on the screen, one for each dimension

Parallel Coordinates:用于畫k維屬性的圖。

Geometric projection

Icon-based

Chenoff Faces:

?Stick Figures:A 5-piece stick figure

Hierarchical:

Dimensional Stacking

Worlds-within-Worlds

Tree-Map

Infocube

8、Similarity and? Dissimilarity

① Data matrix

② Dissimilarity matrix

Proximity Measure of Nominal Attributes

a. Simple matching

b. Use a large number of binary attributes: create a new binary attribute for each??

Standardizing Numeric Data: z-score

?

?

總結

以上是生活随笔為你收集整理的课堂笔记——Data Mining(1)的全部內容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。