Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Posts

publications

Musical signal segmentation method based on self-similarity matrices

Published in 67th Conference on Electrical, Electronic, and Computing Engineering ETRAN, 2023

This paper deals with the problem of automated segmentation and structure analysis of musical pieces on the basis of extracted features from audio recordings of the piece. The complexity of musical piece segmentation problems is a consequence of the multiple elements by which the segments differ, and those can be melody, harmony, rhythm, and timbre. Self-similarity matrices (SSM), based on selected features are used as a cornerstone for segmentation. The features used are MFCC (Mel-Frequency Cepstral Coefficients) and chroma features which follow changes in timbre and harmonic structure. A self-similarity matrix segmentation procedure based on straightforward methods for morphological image processing is proposed. The Dynamic Time Warping algorithm (DTW) was used as a tool for evaluating the degree of similarity of the segments. The proposed procedure was applied to 2 pieces of classical music.

M. Ratković, M. Marijan, T. Miljković, M. Bjelić, "Musical signal segmentation method based on self-similarity matrices" 67th Conference on Electrical, Electronic, and Computing Engineering ETRAN.

Download Paper

A rule-based algorithm for automated renal calculus extraction on CT scans

Published in 67th Conference on Electrical, Electronic, and Computing Engineering ETRAN, 2023

Planning the optimal approach for performing percutaneous nephrolithotomy (PCNL), minimally invasive surgery to remove kidney stones, is critical to reducing the risk of surgical complications. This paper presents the 3D Gastro CT Auto tool for automatic three-dimensional (3D) segmentation and visualization of renal structures with calculus using contrast-enhanced computerized tomography (CT) scans. A novel algorithm for 3D segmentation based on histogram analysis of preprocessed CT scans was implemented as an upgrade of a previously developed 3D Gastro CT Extended application with the aim of simplifying its use, reducing study processing times, and bringing the use of a minimally invasive PCNL planning tool closer to clinical practice. The results of using the tool are shown on the CT scan database of eight patients.

M. Marijan, K. Milićević, O. Durutović, A. Filipović, M. Janković, "A rule-based algorithm for automated renal calculus extraction on CT scans" 67th Conference on Electrical, Electronic, and Computing Engineering ETRAN.

Download Paper

Impact of Protein Representations on Drug–Target Affinity Prediction

Published in 5th Belgrade Bioinformatics Conference, 2024

Accurate and rapid prediction of the binding affinity between potential drug candidates and target proteins can significantly hasten the drug discovery and development process. Utilizing artificial intelligence (AI) models to predict drug-target affinity (DTA) is an affordable and efficient strategy for sifting out undesirable molecules and identifying promising drug candidates. This approach allows researchers to focus on the most promising compounds for further in silico and wet lab experiments, thereby streamlining the overall workflow. Advancements in AI research, such as the development and implementation of graph neural networks (GNN) and attention mechanisms, have significantly improved methods for processing small molecules as potential drug candidates. These developments now allow for very efficient and accurate DTA prediction, without the need for extensive protein processing resources. While this progress marks a significant step forward in computational drug discovery, models that heavily rely on efficient molecule processing may still lack the incorporation of highly specific protein information into their algorithms, which could be crucial for further improvement. In this study, we present a comprehensive analysis of the impact of different protein representations on the accuracy of DTA prediction using two datasets, by implementing and modifying AI models that are based on GNNs and large language models (LLM). Motivated by the intuitive resemblance between traditional motif search methods for protein sequence analysis and conventional one-dimensional convolution in AI signal processing, we propose a protein representation model based on transposed convolutional neural network (NN) layers. Preliminary results indicate that such embeddings improve the overall affinity prediction accuracy, compared to similar models from the literature. Additionally, implementing LLMs to generate protein embeddings independently of other NN layers has demonstrated potential to significantly enhance the accuracy of predicting drug-target pairs that have a very low or unmeasurable affinity.

M. Marijan, I. Tanasijević, "Impact of Protein Representations on Drug–Target Affinity Prediction" 5th Belgrade Bioinformatics Conference.

Download Paper

Message Passing Neural Networks for Sound Source Localization

Published in 33rd Telecommunications Forum TELFOR, 2025

Sound source localization (SSL) is a fundamental problem in audio signal processing. Traditional SSL approaches often degrade in challenging acoustic conditions, while more recent deep learning-based methods struggle to generalize to unseen microphone arrays or to fully exploit the relationships between microphones in an array. We propose Graph-RelNet, a graph-based neural network that extends relation networks with residual graph convolutional layers to regress the direction of arrival (DoA) from generalized cross-correlation with phase transform (GCC-PHAT) features. Experiments on the TIMIT speech corpus demonstrate that Graph-RelNet consistently outperforms a regression-based adaptation of GNN-SSL, while also surpassing conventional steered response power (SRP) with delay-and-sum (D&S) beamforming at medium and high signal-to-noise ratios (SNRs), especially in scenarios with few microphones. At 5 microphones and SNR = 30 dB, Graph-RelNet reduces mean absolute DoA error by up to 45%. The model also shows strong generalization to unseen microphone arrays and remains competitive under low-SNR conditions.

M. Marijan, M. Bjelić, "Message Passing Neural Networks for Sound Source Localization" 33rd Telecommunications Forum TELFOR.

Download Paper

Drone-Enabled Non-Invasive Ultrasound Method for Rodent Deterrence

Published in Drones, 2026

Unmanned aerial vehicles open new possibilities for developing technologies that support more sustainable and efficient agriculture. This papproaches often degrade in challenging acoustic conditions, while more recent deep learning-based methods struggle to generalize to unseen microphone arrays or to fully exploit the relationships between microphones in an array. We propose Graph-RelNet, a graph-based neural network that extends relation networks with residual graph convolutional layers to regress the direction of arrival (DoA) from generalized cross-correlation with phase transform (GCC-PHAT) features. Experiments on the TIMIT speech corpus demonstrate that Graph-RelNet consistently outperforms a regression-based adaptation of GNN-SSL, while also surpassing conventional steered response power (SRP) with delay-and-sum (D&S) beamforming at medium and high signal-to-noise ratios (SNRs), especially in scenarios with few microphones. At 5 microphones and SNR = 30 dB, Graph-RelNet reduces mean absolute DoA error by up to 45%. The model also shows strong generalization to unseen microphone arrays and remains competitive under low-SNR conditions.

M. Ratković, V. Kovačević, M. Marijan, M. Kostadinov, T. Miljković, M. Bjelić, "Drone-Enabled Non-Invasive Ultrasound Method for Rodent Deterrence" Drones.

Download Paper

Matija Marijan

Sitemap

Pages

Page Not Found

Hi!

CV

Markdown

Page Archive

Publications

Research

Sitemap

Terms and Privacy Policy

Jupyter notebook markdown generator

Posts

publications

Musical signal segmentation method based on self-similarity matrices

A rule-based algorithm for automated renal calculus extraction on CT scans

Impact of Protein Representations on Drug–Target Affinity Prediction

Message Passing Neural Networks for Sound Source Localization

Drone-Enabled Non-Invasive Ultrasound Method for Rodent Deterrence