文章分类 | 推荐文章 | 最新文章 | 热点文章 | 最新软件 | 精品软件 | 下载排行 | 推荐下载 | firefox | WPS | 杀毒软件 | Picasa
清风网络
首 页 软件下载 网络学院 数码学院
QQ 电脑入门 游戏 操作系统 图形图像 办公软件 媒体动画 精文荟萃 常用软件 网页编程 技术开发 网络技术 认证考试 网站建设 文章专栏
当前位置:清风网络学院专栏GoogleGoogle的技术剖析
精品推荐
特别推荐
·Google展示其内部使用的网络工具
·Gmail 小技巧
·11种途径将提升英文网站PR值
·google提交Sitemaps的常见问题解答
·提高Google域名信任度的8个方法
·使用Google工具条有助于网站收录
·Google搜索引擎介绍
·google沙盒效应产生的原因及其避免方法
·Google搜索技巧2007版
·总结:Google使用技巧
·技巧:GoogleTalk快捷键列表!
·教你如何解除“该网站可能会损害您的计算机”提示
·网站赚钱:Google关键词广告创建的十二高招
·十个值得推荐的Google搜索技巧
·狂想Google未来十大功能
·《Google排名技巧》共十五课学习笔记
·Google AdSense优化的5个最重点提示
·如何让你的网站远离“该网站可能会损害您的计算机”警告?
·Gmail帐号被盗怎么办?几步即可找回
·Google Earth共享发布地标使用详解
热点TOP10
·Google展示其内部使用的网络工具
·GOOGLE的摄像头漏洞
·Google"全球偷窥"真相调查
·两行代码在任意页面实现谷歌卫星图
·Google搜索技巧2007版
·卫星地图Google中国 可能是一项“要命的创新”
·Gmail 小技巧
·Google Earth官方中文版试用(新增宇宙遨游功能)
·绝密隐私 有趣的网络摄像头大揭露
·活学活用Google
·Google搜索引擎介绍
·Google的秘密 招聘条件跟微软一样
·[Google Adsense]如何增加点击率
·Google搜索引擎,发现已经无法正常使用
·教你如何解除“该网站可能会损害您的计算机”提示
·总结:Google使用技巧
·《Google排名技巧》共十五课学习笔记
·33招Google技巧玩法
·c#实现google样式的分页
·google maps api document 中文翻译

Google的技术剖析

日期:2007年7月23日 作者: 查看:[大字体 中字体 小字体]


创始人Sergey Brin 和 Lawrence Page的研究论文
来源:www.51web.biz

The Anatomy of a Large-Scale

Hypertextual Web Search Engine
Sergey Brin and Lawrence Page

{sergey, page}@cs.stanford.edu

Computer Science Department, Stanford University, Stanford, CA 94305

Abstract
In this paper, we present Google, a prototype of a large-scale search engine which makes heavy use of the strUCture present in hypertext. Google is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems. The prototype with a full text and hyperlink database of at least 24 million pages is available at http://google.stanford.edu/
To engineer a search engine is a challenging task. Search engines index tens to hundreds of millions of web pages involving a comparable number of distinct terms. They answer tens of millions of queries every day. Despite the importance of large-scale search engines on the web, very little academic research has been done on them. Furthermore, due to rapid advance in technology and web proliferation, creating a web search engine today is very different from three years ago. This paper provides an in-depth description of our large-scale web search engine -- the first such detailed public description we know of to date.
Apart from the problems of scaling traditional search techniques to data of this magnitude, there are new technical challenges involved with using the additional information present in hypertext to produce better search results. This paper addresses this question of how to build a practical large-scale system which can eXPloit the additional information present in hypertext. Also we look at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.
 

KeyWords: World Wide Web, Search Engines, Information Retrieval, PageRank, Google


 

1. Introduction
(Note: There are two versions of this paper -- a longer full version and a shorter printed version. The full version is available on the web and the conference CD-ROM.)
The web creates new challenges for information retrieval. The amount of information on the web is growing rapidly, as well as the number of new users inexperienced in the art of web research. People are likely to surf the web using its link graph, often starting with high quality human maintained indices such as Yahoo! or with search engines. Human maintained lists cover popular topics effectively but are subjective, expensive to build and maintain, slow to improve, and cannot cover all esoteric topics. Automated search engines that rely on keyword matching usually return too many low quality matches. To make matters worse, some advertisers attempt to gain people's attention by taking measures meant to mislead automated search engines. We have built a large-scale search engine which addresses many of the problems of existing systems. It makes especially heavy use of the additional structure present in hypertext to provide much higher quality search results. We chose our system name, Google, because it is a common spelling of googol, or 10100 and fits well with our goal of building very large-scale search engines.

[1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] 下一页 




上一篇:关键词在GOOGLE中排名查询工具

下一篇:Google的原罪--网页序列等级

Google的技术剖析 相关文章:
·SCEA超大作《战神》完全权威评析(ps2) - 战神攻略秘籍 - 战神
·魔法门7 攻略解析
·输入验证+重启验证的软件破解 算法分析
·Windows系统进程列表完全解析
·[宠物]问道宝宝,肉盾,法宠,攻宠全面解析
·C语言的常用库函数使用方法分析及用途
·全面分析解决硬盘故障
·设计理念剖析:什么是“平面构成”
·股票分析专家 同花顺2008功能概述
·[任务]20-80级剧情任务分析
Google的技术剖析 相关软件:
·《红楼梦》对联赏析
·苏州19岁的美少女写真 RMVB高清析 极品美女吐血推荐
·赢证股市分析软件v4.0
·3D 动画与建模:人体的综合与分析技术
·文物典藏系列-故宫馆藏文房四宝赏析
·《股票常识与技术分析》
·股票常识与技术分析
·如何进行上市公司财务分析及公告解读
·系统分析师考试培训视频教程1
·哈佛经理的心理分析word格式

特别声明:本站除部分特别声明禁止转载的专稿外的其他文章可以自由转载,但请务必注明出处和原始作者。文章版权归文章原始作者所有。对于被本站转载文章的个人和网站,我们表示深深的谢意。如果本站转载的文章有版权问题请联系编辑人员,我们尽快予以更正。
[打印本页] [关闭窗口] 转载请注明来源:http://www.viphot.com
| 帮助(?) | 版权声明 | 友情连接 | 关于我们 | 信息发布
Copyright 2007 www.viphot.com All Rights Reserved. 鄂ICP备05000083号Powered by:vipcn