WC3(WC-triple):Wikipedia Category Consistency Checker (DBpedia
This program checks the consistency of Wikipedia category information by
using DBpedia information.
WC3 tries to construct the most appropriate SPARQL query that
describes the meaning of the category. (i.e., an ideal constructed
query is one that retrieves all pages that belong to the target
category and no pages for other categories.)
System outputs include:
It is better to check the contents of NotFound and Error for improving
consistency of the category pages.
- The SPARQL query for the given category
- Found: which shows pages satify the constructed sparql query and
that belong to the target category
- NotFound: which list pages that do not satify the constructed
sparql query and belongs to the target category
- Error: which displays pages that satify the constructed sparql
query, but they do not belong to the target category
You can check the example result for analyzing category
"Songs written by Paul McCartney" example result of
The usage of the system is simple.
- Input the name of the category in Category: text box and click the
Check button. When you type the first two characters in the text box,
candidate category names start with input characters are shown as a list.
The system may take several minutes to retrieve the results
- The results are shown at the bottom of the page.
Detailed information of Found, NotFound, Error are displayed by clicking
- If the user is not satisfied with the automatically constructed query,
he/she can modify the sparql query in the SPARQL textbox (you can only
use the simple constraints of "?s") and check "No automatic SPARQL
query generation (Use SPARQL query below) box before clicking the
Results of the constructed query may vary because the system uses
random sampling for sibling categories of the query category.
- For each page, you can check the Wikipedia and/or DBpedia information
by using the Wikipedia or DBpedia link in the table.
- For any new searches, please ensure to clear both the Category:
and SPARQL textboxes by clicking "Clear SPARQL".
NotFound information shows the reason why specific category pages are not
retrieved by the query.
Error information shows a list of pages that are retrieved by the query
and do not belongs to the category. The section also includes common
attribute information related to the error page. That may be used for
excluding those pages from the retrieved results. First number shows
number of page that shares the attribute for error page and second
number shows for pages that belongs to the target category page.
Typical Analysis Cases