在线咨询
eetop公众号 创芯大讲堂 创芯人才网
切换到宽版

EETOP 创芯网论坛 (原名:电子顶级开发网)

手机号码,快捷登录

手机号码,快捷登录

找回密码

  登录   注册  

快捷导航
搜帖子
查看: 5104|回复: 20

国外大学大数据、数据挖掘类课程--【信息存储和信息检索】

[复制链接]
发表于 2015-3-2 14:10:03 | 显示全部楼层 |阅读模式

马上注册,结交更多好友,享用更多功能,让你轻松玩转社区。

您需要 登录 才可以下载或查看,没有账号?注册

x
[size=1em]CSCE 670 :Information Storage and Retrieval Spring 2014Tues/Thurs 12:45-2:00pm in HRBB 113Instructor: James Caverlee, HRBB 403Office Hours: Tues 4-5pm, or by appointmentDepartment of Computer Science and EngineeringTexas A&M University
TA: Haokai Lu, 408AOffice Hours: Mon/Wed 4-5pm






















Course Summary

In this course, we'll study the theory, design, and implementation of text-based and Web-based information retrieval systems, including an examination of web and social media mining algorithms and techniques at the core of modern search and data mining applications. By the end of the semester you will be able to:
  • Define and explain the key concepts and models relevant to information storage and retrieval, including efficient text indexing, boolean, vector space and probabilistic retrieval models, relevance feedback, document clustering and text categorization, Web search, including crawling, indexing, and link-based algorithms like PageRank.
  • Design, implement, and evaluate the core algorithms underlying a fully functional web search / data mining system, including the indexing, retrieval, and ranking components, as well as advanced algorithms like document clustering and text categorization.
  • Identify the salient features and apply recent research results in web search and data mining, including topics such as collaborative filtering, adversarial information retrieval, location-based services, and social information management.

Communication
All course communication will be via Piazza. We will post often to Piazza, so you should plan to check it often (every day).
Prerequisites
I expect all students to have had some previous exposure to basic probability, statistics, algorithms, and data structures. You should be able to design and develop large programs and learn new software libraries on your own.
Textbooks
The primary textbook is IIR: Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schutze, Cambridge University Press. 2008. Available at Cambridge University Press, at Amazon, and other fine booksellers.
We'll also read some selections from:
  • MMD: Mining of Massive Datasets, Anand Rajarman and Jeffrey D. Ullman.
  • DITP: Data-Intensive Text Processing with MapReduce, by Lin and Dyer, 2010.
  • NCM: Networks, Crowds, and Markets: Reasoning About a Highly Connected World, David Easley and Jon Kleinberg, Cambridge University Press. 2010.
  • As well as several papers and other resources provided in the course schedule (with links).

You may find some of these optional textbooks helpful, though none are required:
  • Modern Information Retrieval, by Baeza-Yates and Ribeiro-Neto.
  • Managing Gigabytes, by Witten, Moffat, and Bell.
  • Foundations of Statistical Natural Language Processing, by Manning and Schutze.
  • Search Engines: Information Retrieval in Practice, by Croft, Metzler, and Strohman.

It is critically important that you study the relevant course readings before class so that we can make the most of our limited class time together. I treat our class meetings as opportunities to highlight significant aspects of the material, to answer questions, to engage in discussions about particular topics, and so on. We cannot cover all of the material in class, so it is up to you to stay on top of the readings and the assignments.
 楼主| 发表于 2015-3-2 20:52:20 | 显示全部楼层
回复 1# netshell

CSCE670.part1.rar (14.2 MB, 下载次数: 55 )

CSCE670.part2.rar (14.2 MB, 下载次数: 56 )


CSCE670.part3.rar (14.2 MB, 下载次数: 54 )


CSCE670.part4.rar (14.2 MB, 下载次数: 52 )


CSCE670.part5.rar (14.2 MB, 下载次数: 53 )


CSCE670.part6.rar (9.46 MB, 下载次数: 34 )
   
6个包!完整~
发表于 2015-3-4 07:48:21 | 显示全部楼层
Here goes the credit
发表于 2015-3-4 07:52:14 | 显示全部楼层
Big data is useless !
 楼主| 发表于 2015-3-4 09:26:11 | 显示全部楼层
回复 4# pupukid


   你啥意思?大数据没有用吗?
   今年ISSCC的主题都是大数据!!!
发表于 2017-2-9 14:42:43 | 显示全部楼层
回复 2# netshell

谢谢暗暗啊啊啊
发表于 2017-2-9 14:43:36 | 显示全部楼层
回复 2# netshell

谢谢啊暗暗啊啊啊
发表于 2017-2-9 14:52:09 | 显示全部楼层
回复 1# netshell

谢谢啊啊暗暗啊
发表于 2017-2-25 17:06:11 | 显示全部楼层
回复 8# caltech_usa


   CESC
发表于 2017-6-12 18:11:36 | 显示全部楼层
回复 8# caltech_usa


   国外大学大数据、数据挖掘类课程--【信息存储和信息检索】
您需要登录后才可以回帖 登录 | 注册

本版积分规则

关闭

站长推荐 上一条 /2 下一条

×

小黑屋| 手机版| 关于我们| 联系我们| 在线咨询| 隐私声明| EETOP 创芯网
( 京ICP备:10050787号 京公网安备:11010502037710 )

GMT+8, 2024-11-23 02:00 , Processed in 0.022760 second(s), 8 queries , Gzip On, Redis On.

eetop公众号 创芯大讲堂 创芯人才网
快速回复 返回顶部 返回列表