Personal tools
You are here: Home Courses Distributed Systems Seminar -- Spring 2018

Distributed Systems Seminar -- Spring 2018

last modified Sep 02, 2018 03:38 PM

Wednesday 14.15-16.00 Ülikooli 17 - 220

Organization and requirements


14.02.2018: First Meeting, topics introduction

21.02.2018: Topic choice, consultation on how to proceed

28.02.2018: No meeting

7.03.2018: No meeting 

14.03.2018: Meeting for Research Plan and initial Literature Review presentations 

21.03.2018: (No meeting, Individual work, Meetings with supervisors)

28.03.2018: No meeting, Do the tests and experiments, Start writing the final report, provide the supervisors with an access to the code as well as to the report draft

4.04.2018: Meeting: Intermediate presentations / Technical Demo, First draft of the final report ready for Peer-review

11.04.2018 (No meeting, writing the report) 

18.04.2018 (No meeting, writing the report)

25.04.2018: Last day to share the reports between the reviewers

02.05.2018: Last day to the reviewers to give their feedback

9.05.2018: The day of the final presentation and handing out the final report.




You can see the final reports online at: Previous: Distributed Systems Seminar (Spring 2017)


Seminar supervisors: Prof. Eero Vainikko, Dr Benson Muite, Dr Amnir Hadachi, Artjom Lind

Topic Areas (Concerned persons):

  1. Parallel Scientific Applications and Concurrent Computing (Eero Vainikko, Benson Muite)
  2. Network Applications and Protocols (Artjom Lind)
  3. Applied Computer Vision (Artjom Lind)
  4. Parallel Machine learning algorithms (Artjom Lind, Amnir Hadachi, Benson Muite)
  5. Geographic information systems (GIS) and Intelligent Transportation Systems (ITS) (Amnir Hadachi)
  6. Topics of Dmitri Danilov and Toivo Vajakas are to be confirmed first

LaTeX Resources


LaTeX Site

LaTeX Wikibook


Literature Search Resources


List of possible topics

Possible themes with some suggested materials to start with:


1. Parallel Scientific Applications and Concurrent Computing (eero at


  • Parallel programming environments, languages and practices
    • Parallel profiling tools and best practices
    • Best practices for parallel python programming using numpy and mpi4py
  • Recent hot topics in Distributed Systems development


2. Parallel Scientific Applications and Concurrent Computing (benson punkt muite at ut punkt ee)


  • Scripting and high level language interfaces (eg Java and Python) to the Message Passing Interface and/or to OpenCL
  • Evaluation of Simscale  for simulation in a web browser
  • Information security on distributed systems
  • Build a data visualization solution using pbdR
  • Machine learning for in-situ visualization of solutions to differential equations
  • Evaluate performance of parallel file systems
  • Read the paper on distmesh, and re-implement the algorithms in another language such as Fortran or Python.
  • Experiment with RSVDPACK and compare performance for image compression or another application of your choice. A useful starting point is here
  • Efficient time integration schemes, geometric integration schemes
  • Examine open hardware for parallel computing, such as Nyuzi, Risc-V or OpenSPARC
  • Good parallel software development practices: Examine development of the visualization software ParaView and VisIt
  • Testing and improvement of FortranCL 
  • Financially sustainable open source parallel software development models and practices
  • Testing and improvement of Seedme for distributed computing
  • Testing MPPA accelerators such as Kalray and Parallela
  • Testing FPGAs from Altera and Xilinix
  • Parallel Graph Algorithms, for example GraphBLAS
  • Try out XMT or Habanero

3. Network Applications and Protocols (Artjom Lind)

Covering the topics related to distributed computing in peer-to-peer networks. Here I will focus you on own lab framework, however you can propose different topic (some existing framework with similar features or your own design).

  • Individual topic -> Contact me!


4. Applied Computer Vision (CV) (Artjom Lind)

Mostly the topics related to the application of latest results in CV. In this area we mostly use OpenCV library, which is recommended but not obligatory. The several topics we can focus on:

  • Structure from motion
  • Object detection/classification
  • Object tracking
  • Optical Character Recognition (OCR)
  • Augmented Reality

 More information ...

5. Parallel Machine learning algorithms (Artjom Lind, Amnir Hadachi, Benson Muite)


  • Machine digitization and translation for Estonian and non-latin scripts such as Arabic / Cyrillic / Chinese / Farsi / Hebrew / Hindi / Japanese / Korean
  • Character recognition algorithms


6. Exploratory search (Dimitri Danilov)

  • Collaborative search
  • Search Patterns
  • New Development in Search Engines
  • The Vision of Ted Nelson (the inventor of the internet?)
  • Xanadu (and undanax)
  • Graph Based Information Storage
  • New Search Interfaces in Mozilla (practical and theoretical topics available)
  • Machine learning based topic modeling in text documents (using the program Mallet).
  • More topics on demand


7. Modeling and analyzing semantic trajectories (Amnir Hadachi)

8. Mobility data modelling (Toivo Vajakas)

Fast data structure for trajectory data (further development of existing code)
o    Currently existing code for direct read of 1 individual trajectory
o    Add support for batch full-scan and batch subset.
o    Add support for indexing by time and space – to avoid (with high probability) the analysis of trajectories that do not intersect with time-space volume of given query.
Exploratory data analysis on mobile positioning data, using results of Jilles Vreeken group (JV was a speaker on ESSCASS summer school)
o    Separation of time of each person as „currently in routine“ and „non-typical behavior (tourist mode)“.
o    Describe the data, ie the behavior of radio network combined with behavior of people -- pattern mining based on information theory.
o    Clustering of (relatively small) directed graphs, each edge and vertex has also attributes. (graphs are HMM of humans in timespace „states“ like work and home, after removing absolute location and travel direction info (but keeping relative travel distance info)
o    Clurstering of vertices in directed graphs (states of HMM in previous entry). Graphs are HMM of humans in timespace „states“ like work and home, after removing absolute location and travel direction info (but keeping relative travel distance info). Clustering of states gives something like „here many people spend there night“, „many people come here for work“, ...
Combination of traffic simuluation package and data from mobile positioning data, to get local traffic density estimates.

8A. GPS and INS, Lidar and panorama data coregistration (Toivo Vajakas)

Coregistration refers to any method for realigning images. This particular set of problems stems from Streetview-like application where GPS receiver, inertial sensors, 360degree camera and LIDAR sensor are fixed to top of vehicle. Vehicle drives along streets to collect images, lidar point cloud, location and orientation data. Different data sources must be combined to get 3D landscape model.

* combine GPS and inertial sensor readings to get maximum accuracy estimate of location and orientation at any time moment

* align images for stitching and/or to estimate orientation errors

* align images with lidar data

The student has good chances to become a co-author of a publication later.


9. Mobility data modeling and representation (Amnir Hadachi)


Document Actions
« April 2020 »