Antonio Gulli

Curriculum Vitae 

Name: Antonio Gulli
whois: AG2-ORG
Nationality: Italian
Net Citizen: since 1989 ...
Date of Birth: Jan 16th - 1971
e-mail: gulli@di.unipi.it
HTTP: http://www.unipi.it/~gulli/
Address: Dipartimento di Informatica 56100 PISA (PI) - ITALY
Search Engine: Antonio Gulli Antonio
Who is that guy ?

Overview

  • October 1997, Laurea degree in Computer Science from University of Pisa, Italy.
  • December 2003, Master Degree in Engineering from University of Pisa, Italy.
  • November  2002-current, 2° year Ph.D. in Computer Science at University of Pisa.
My PhD advisor is Prof. Paolo Ferragina and my research activity is actually within the field of web search engines and
specifically on the design of algorithms and data structures for the efficient computation and update of PageRank values
of massive web graphs, as well on the design and engineering of novel algorithms for clustering and labeling Search Engine
Snippets (a la' Vivisimo).

Research & Work Experiences

  • November 1999,  December  2002, CTO Ideare S.p.a., Tiscali group  Pisa (Italy). Web search engine.
  • September. 1999,  Fireball Search Engine, Lycos group  Hamburg. (Germany). Automatic classification of web pages.
  • July 1998, Arianna Search Engine, Wind group Pisa – Milan (Italy). First Italian Web spider.

Search Engine Activities

Since 1997 I work in the Search Engine field. I was the chief architect of many products, such as:
  • Spidering system, used by Arianna
  • Automatic Directory, used by Arianna, Fireball
  • Audio Search Engine, used by Tiscali, Jumpy, CiaoWeb, Arianna
  • Video Search Engine, used by Tiscali, Jumpy, CiaoWeb
  • Image Search Engine, used by Tiscali, Jumpy, CiaoWeb
  • Usenet Search Engine, used by Arianna
  • Pseudo Real time news engine, used by Arianna, Corriere della Sera, La Repubblica
  • Chat & Search Engine on IRC, used by Tiscali, EdisonTel
  • Shopping Comparison Engine, used by Tiscali, SuperEva
  • Web Search Engine, used by Tiscali, SuperEva, Interfree, Infocamere
These solutions were licensed to Grun+Jahr, Fireball( Lycos group), SuperEva( Dada group), Arianna MP3 Arianna Usenet Arianna  Infostrada CiaoWeb Fiat group), Interfree  ( CDC group), Infocamere , Jumpy ( Mediaset group), Excite MP3, EdisonTel, Tiscali and to the most important Italian newspapers: Corrieredella Sera, Nazione, Giorno, , Il Resto del Carlino and Repubblica

Papers

Clustering: a flat list of search results is not enough...

  • The Anatomy of a Clustering Engine for Web Snippets. TR-04-04 Technical Report  Dipartimento di Informatica
  • The Anatomy of SnakeT: a hierarchical clustering engine for web-page snippets. In Proceedings of PKDD/ECML 2004
  • Experimenting SnakeT: a hierarchical clustering engine for web-page snippets. In Proceedings of PKDD/ECML 2004

Ranking: how to accellerate Google's PageRank...

  • Exploiting Web Matrix Permutations to Speedup PageRank Computation.
    IIT TR-04/2004 Technical Report  Istituto di Informatica e Telematica
  • Fast PageRank Computation via a sparse linear system In Proceedings of Third Workshop on Algorithms and Models for the Web-Graph (WAW 2004)

Categorization: how to build a Web Directory automatically...

  • Automatic Web page categorization by link and context analysis. In Proceedings of THAI'99, European Symposium on Telematics, Hypermedia and Artificial Intelligence, Varese, IT, pp. 105--119.
  • THESEUS: categorization by context. In Poster Proceedings of WWW'99, 8th International Conference on the World Wide Web, Toronto, CA, pp. 136-137.

Spidering: how to accellerate search engine's gathering...

  • Web Host Enumeration Through DNS. In  Proceedings of WebNet 97  - World Conference on the WWW, Internet & Intranet, Toronto, Canada, November 1-5, 1997.

Balance: how to balance a web server's load...

  • Jamming.Net: a Server to Balance WWW Load. In Proceedings of WebNet 98 - World Conference on the WWW and Internet & Intranet, Orlando, Florida, USA, November 7-12, 1998 

Sql: how to accellerate SQL using an ad-hoc Web server...

  • SqlWWW: un server Internet peraccedere alla base di dati della Soprintendenza ai Beni Ambientali, Architettonici, Artistici e Storici di Pisa, con connessioni di tipoKeep-Alive. In Bollettino VI, n. 1 Scuola Normale Superiore

Talks

  •   "Ranking the Web", Invited Tutorial, Fun04

Teaching

  • Ph.D. Course "Algorithms for Internet and the Web: Web Search Engine and Ranking", Dottorato in Informatica ed Applicazioni. Spring 2004 (florence)
  • Master Degree Course "Laboratorio di Programmazione di Sistema", Dipartimento Informatica. Spring 2004 (Pisa) -- assistent
  • Master Degree Course "Fondamenti di Programmazione", Dipartimento di Matematica. Spring 2004 (Pisa) -- assistent
  • Master Degree Course "Algoritmi per Internet e Web: Indicizzazione e Ricerca nei Testi ", Dipartimento di Matematica. Spring 2004 (Pisa) -- assistent

Open Source Software

  • Net Knife : an Internet suite for network analysis. More than 10.000 copies downloaded.
  • Jamming.Net a java multithread server for HTTP balancing.