Assignment 2

Posted on

Students presentations on Friday, Feb 8, Pinni B0016, 14.00-16.00, and Thursday, Feb 14, Pinni B0016, 14.00-16.00.

For the second assignment, we will work in 6 groups.

Next, you can find the reading material for this assignment (please read carefully):

Each group is associated with a sub-assignment.

The presenters (P) of each group will prepare an around 25 minutes presentation that answers the questions associated with the sub-assignment of the group. Presenters will work independently to each other, and each presenter will prepare his/her own presentation.
After this presentation, the reviewers of the corresponding group (R) will explain their own point of view regarding the sub-assignment, e.g., they can add more details and explanations to the presented material, explain if they understood some concepts of the original paper in a different way, or even provide their vision for alternative ideas or solutions. Answers of the form “I do not have actually something to add”, are not well accepted. At least, you can propose your own preliminary solution for the problem we discuss.

Sub-assignment 1:
Group 1 will focus mainly on [Paper 1]. Specifically, the target here is to discuss the quality and performance of the evaluated blocking methods, as well as the results of blocking, when different kinds of links, other than owl:sameAs, are used as ground truth (Section 5). Please include briefly in your presentation a discussion on the datasets and measures used (Section 4), and a discussion on the lessons learned from your analysis.

Sub-assignment 2:
Group 2 will focus mainly on [Paper 2]. Specifically, the target here is to discuss a way for organizing blocking methods. According to this taxonomy, the presentation will focus on lazy blocking methods, block-refinement methods, comparison-refinement methods and proactive blocking methods (Section 4).

Sub-assignment 3:
Group 3 will focus mainly on [Paper 2]. Specifically, the target here is to discuss the effectiveness, efficiency and scalability of the evaluated blocking methods (Sections 6.2, 6.3 and 6.4). Please include briefly in your presentation a discussion on the datasets used and how the parameters are tuned (Section 5).

Sub-assignment 4:
Group 4 will focus mainly on [Paper 3]. Specifically, the target here is to discuss two methods, namely graph partitioning and redundancy pruning, that lead to scalable Meta-blocking (Sections 4.1 and 4.2).

Sub-assignment 5:
Group 5 will focus mainly on [Paper 3]. Specifically, the target here is to discuss two methods, namely reciprocal pruning and block filtering, that lead to scalable Meta-blocking (Sections 4.3 and 4.4).

Sub-assignment 6:
Group 6 will focus mainly on [Paper 4]. Specifically, the target here is to discuss the multi-core techniques for parallelizing the pruning algorithms of Meta-blocking (Section 4).

Students presentations on Friday, Feb 8, Pinni B0016, 14.00-16.00, and Thursday, Feb 14, Pinni B0016, 14.00-16.00.

Please send your presentations at konstantinos.stefanidis@tuni.fi before 10.00am, Friday, February 8.

Groups and group members are as follows (P stands for a presenter, R stands for a reviewer):

Group 1
P – Gong Jin
P – Bagale Krishna
R – Jari Haapaniemi
R – Nayan Subba

Group 2
P – Thiago Braguim Neves
P – Guo Bujia
R – Maria Stratigi
R – Lea Kahkonen
R – Jimi Laakso

Group 3
P – Jari Haapaniemi
P – Samuli Lumirae
R – Jakub Hruska
R – Heidi Mikkola

Group 4
P – Thi Nguyen
P – Heidi Mikkola
R – Bagale Krishna
R – Gong Jin

Group 5
P – Jakub Hruska
P – Nayan Subba
P – Jimi Laakso
R – Thiago Braguim Neves
R – Guo Bujia

Group 6
P – Maria Stratigi
P – Lea Kahkonen
R – Thi Nguyen
R – Samuli Lumirae

Each student will work independently of the others. Please send an email at konstantinos.stefanidis@tuni.fi if you cannot find your name in the above assignment to groups.