Espresso: Explaining Relationships between Entity Sets
Espresso is a system to compute semantically meaningful substructures (so-called relatedness cores) from a knowledge graph. The purpose of the system is to answer questions of the form «Which European politicians are related to politicians in the United States and how?» or «How can one summarize the relationship between China and countries from the Middle East over the last five years?» In this setting, a question is specified by means of two sets of query entities. These sets (e.g. "European politicians" or "United States politicians") can be determined by an initial graph query over a knowledge graph capturing relationships between real-world entities. As a next step, we analyze the (indirect) relationships that connect entities from both sets (e. g. membership in organizations, statements made on TV, etc.), generate an informative and concise result, and finally provide a user-friendly explanation of the answer. As output, we aim to return concise subgraphs corresponding to important event complexes, that connect entities from the two sets and explain their relationships. Espresso provides a user interface for the specification of entity sets, computes informative relatedness cores that summarize the relationship between the query entities, and finally displays a visually appealing visualization of the extracted subgraph to the user. Applications of the proposed system include scenarios that require to provide background information on the current state-of-affairs between real-world entities such as politicians, organizations, and the like, e. g. to a journalist preparing an article involving the entities of interest.
Publications
- Espresso: Explaining Relationships between Entity Sets
Stephan Seufert, Klaus Berberich, Srikanta J. Bedathur, Sarath Kumar Kondreddi, Patrick Ernst, and Gerhard Weikum
Proceedings of the 25th International Conference on Information and Knowledge Management (CIKM 2016),
Indianapolis, IN, United States, October 24-28, 2016. ACM. - Instant Espresso: Interactive Analysis of Relationships in Knowledge Graphs (Demo)
Stephan Seufert, Patrick Ernst, Srikanta J. Bedathur, Sarath Kumar Kondreddi, Klaus Berberich, and Gerhard Weikum
Proceedings of the 25th International World Wide Web Conference (WWW 2016),
Montreal, QC, Canada, April 11-15, 2016. ACM. - Efficient Computation of Relationship-Centrality in Large Entity-Relationship Graphs (poster)
Stephan Seufert, Srikanta J. Bedathur, Johannes Hoffart, Andrey Gubichev, and Klaus Berberisch
Posters and Demonstrations Track of the 12th International Semantic Web Conference (ISWC 2013),
Sydney, NSW, Australia, October 21-25, 2013.
People
- Seufert, Stephan
- Gubichev, Andrey
- Kondreddi, Sarath
- Weikum, Gerhard
Datasets
Name | Description | Fields | Link | Size |
---|---|---|---|---|
Entity | Collection of all entities | id, yagoid, freebaseid, wpid, name, readable yagoid, event (t/f) | Download | 117M |
YAGO Types | Collection of YAGO types | id,name | Download | 4.6M |
Entity-YAGO Type | Association of entity with YAGO type | entity,type | Download | 169M |
Freebase Types | Collection of Freebase types | id,name | Download | 64K |
Entity-Freebase Type | Association of entity with Freebase type | entity,type | Download | 29M |
Links | Links between entities | source,target,MW-similarity,KORE-similarity | Download | 798M |
Relations | Collection of relations | id,name,count | Download | 423 |
Link-Relations | Association of links with relations | source,target,relation | Download | 259M |
Popularity | Entity popularities based on pageviews | entity,pop | Download | 34M |
Views | Pageviews for entities | entity,day,count,Z-score,relative popularity | Download | 32G |
Snippets | Short textual entity descriptions from Wikipedia | entity,snippet | Download | 462M |
ClueWeb12 Counts | Number of entity occurrences in ClueWeb | entity,count | Download | 8.4M |
ClueWeb12 Cooccurrence | Entity-cooccurences in ClueWeb | entity1,entity2,count | Download | 728M |
Disclaimer
Provided files are BZ2-compressed CSV files with header and double quoting.
- The datasets contain material from Wikipedia, which is released under the Creative Commons Attribution-Share-Alike License 3.0.
- The datasets contain data derived from ClueWeb12.
- The datasets contain material from Freebase.
- The datasets contain material from the Wikipedia Pageview project.