Videos tagged with Web Mining




Boosting Performance of Web Search Engines Using Query Logs

Boosting Performance of Web Search Engines Using Query Logs

Posted in Science

Lecture slides: Introduction What is a Query Log Our Logs What Does it Contain? How We Exploited Caching Policies Probability Driven Caching Static Dynamic Caching SDC Sections SDC and Prefetching Hit Ratio Throughput Collection Partitioning and Selection Innovations Contingency Matrix Co-clustering Results Experiments Precision Better Term Partitioning Why Term Partitioning? Encouraging Figure...

Tags: Science, Lectures, Computer Science, VideoLectures.Net, Web Mining


Ranking Web Sites with Real User Traffic

Ranking Web Sites with Real User Traffic

Posted in Science

Lecture slides: Outline Sources for Ranking Data: The Link Graph Sources for Ranking Data: Dynamic Sources Sources for Ranking Data: Packet Inspection Data Collection Host graphs Structural properties: Degree Caveat: Sampling Bias Structural properties: Strength (Site Traffic) Structural properties: Weights (Link Traffic) Behavioral patterns (HUMAN) Ratios are stable Validation of PageRank Kend...

Tags: Science, Lectures, Computer Science, VideoLectures.Net, Web Mining


Graph Fibrations, graph isomorphism and PageRank

Graph Fibrations, graph isomorphism and PageRank

Posted in Science

Lecture slides: Things related to PageRank Covering projections in algebraic topology Covering projections in modern mathematics From covering projections to fibrations My own personal relation with fibrations A graph is a graph is a graph... Graph morphisms Graph fibration A graph fibration is... A basic ingredient: universal total graph Basic property of universal total graphs Minimum base Ma...

Tags: Science, Lectures, Math, Computer Science, VideoLectures.Net, Graph Theory, Web Mining




Theoretical analysis of Link Analysis Ranking

Theoretical analysis of Link Analysis Ranking

Posted in Science

Lecture slides: Link Analysis Ranking Why theoretical analysis of Link Analysis Ranking? Link Analysis Ranking algorithm Popular LAR algorithms Properties of Interest Distance between LAR vectors Stability: graph distance Stability Stability: Results Perturbations of PageRank Instability of PageRank Singular Value Decomposition Instability of HITS Stability of HITS Similarity Similarity: Result...

Tags: Science, Lectures, Computer Science, VideoLectures.Net, Network Analysis, Web Mining


Using Rank Propagation and Probabilistic Counting for Link-based Spam Detection

Using Rank Propagation and Probabilistic Counting for Link-based Spam Detection

Posted in Companies, Science

Lecture slides: Content What is on the Web? Web spam (keywords + links) Web spam (mostly keywords) Search engine? Fake search engine Problem: “normal” pages that are spam Link farms Motivation Metrics Test collection Degree-based measures Degree Edge reciprocity Assortativity Automatic classifier PageRank Maximum PageRank in the Host Variance of PageRank Variance of PageRank of in-n...

Tags: Yahoo!, Science, Lectures, Computer Science, VideoLectures.Net, Network Analysis, Web Mining


Mixture Models and Collaborative Filtering Algorithms

Mixture Models and Collaborative Filtering Algorithms

Posted in Science

Lecture slides: The Wisdom of Crowds Francis Galton visits a Country Fair (1906) Wisdom of Crowds on the Web Wisdom of Crowds on the Web: Google Mining the Internet People and Music Collaborative filtering Mixture model Biclustering Genes Bi-clustering algorithms Collaborative Filtering in Mixture Models A Porfolio of Iterative Biclustering Algorithms Tests on generated data Results for the dis...

Tags: Science, Lectures, Computer Science, Clustering, Machine Learning, VideoLectures.Net, Web Mining


Applications of Query Mining

Applications of Query Mining

Posted in Companies, Science

Lecture slides: European Yahoo! Research Lab Yahoo! World Yahoo! Numbers Crawled Data Produced data Observed Data The power of social media Fight Spam My Motivations for Web Mining Mining Queries for ... Web Queries Relevance of the Context Context Using the Context Context in Web Queries User Goals Features Clustering Queries Our Approach Clusters Examples Query Recommendation Simple Query Rec...

Tags: Yahoo!, Science, Lectures, Computer Science, VideoLectures.Net, Web Mining


Efficient and Decentralized PageRank Approximation in a P2P Web Search Network

Efficient and Decentralized PageRank Approximation in a P2P Web Search Network

Posted in Science

Lecture slides: Outline Motivation Related Work JXP Algorithm World Node The Algorithm Example Peer Selection Strategy MIPs MIPs Example Mathematical Analysis Setup Overall performance comparison JXP in P2P Search Results Conclusions and Ongoing Work Author: Josiane Parreira, Max Planck Institute

Tags: Science, Lectures, Computer Science, VideoLectures.Net, Network Analysis, Web Mining, P2P, PageRank