subscribe to arXiv mailings

Bayesian optimization for state engineering of quantum gases

Authors: Gabriel Müller, V. J. Martínez-Lahuerta, Ivan Sekulic, Sven Burger, Philipp-Immanuel Schneider, Naceur Gaaloul

Abstract: State engineering of quantum objects is a central requirement in most implementations. In the cases where the quantum dynamics can be described by analytical solutions or simple approximation models, optimal state preparation protocols have been theoretically proposed and experimentally realized. For more complex systems, however, such as multi-component quantum gases, simplifying assumptions do n… ▽ More State engineering of quantum objects is a central requirement in most implementations. In the cases where the quantum dynamics can be described by analytical solutions or simple approximation models, optimal state preparation protocols have been theoretically proposed and experimentally realized. For more complex systems, however, such as multi-component quantum gases, simplifying assumptions do not apply anymore and the optimization techniques become computationally impractical. Here, we propose Bayesian optimization based on multi-output Gaussian processes to learn the quantum state's physical properties from few simulations only. We evaluate its performance on an optimization study case of diabatically transporting a Bose-Einstein condensate while keeping it in its ground state, and show that within only few hundreds of executions of the underlying physics simulation, we reach a competitive performance with other protocols. While restricting this benchmarking to well known approximations for straightforward comparisons, we expect a similar performance when employing more involving models, which are computationally more challenging. This paves the way to efficient state engineering of complex quantum systems. △ Less

Submitted 28 April, 2024; originally announced April 2024.

Comments: 12 pages, 5 figures

arXiv:2403.01747 [pdf, other]

doi 10.1145/3627508.3638300

Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search

Authors: Ivan Sekulić, Krisztian Balog, Fabio Crestani

Abstract: Conversational information-seeking (CIS) is an emerging paradigm for knowledge acquisition and exploratory search. Traditional web search interfaces enable easy exploration of entities, but this is limited in conversational settings due to the limited-bandwidth interface. This paper explore ways to rewrite answers in CIS, so that users can understand them without having to resort to external servi… ▽ More Conversational information-seeking (CIS) is an emerging paradigm for knowledge acquisition and exploratory search. Traditional web search interfaces enable easy exploration of entities, but this is limited in conversational settings due to the limited-bandwidth interface. This paper explore ways to rewrite answers in CIS, so that users can understand them without having to resort to external services or sources. Specifically, we focus on salient entities -- entities that are central to understanding the answer. As our first contribution, we create a dataset of conversations annotated with entities for saliency. Our analysis of the collected data reveals that the majority of answers contain salient entities. As our second contribution, we propose two answer rewriting strategies aimed at improving the overall user experience in CIS. One approach expands answers with inline definitions of salient entities, making the answer self-contained. The other approach complements answers with follow-up questions, offering users the possibility to learn more about specific entities. Results of a crowdsourcing-based study indicate that rewritten answers are clearly preferred over the original ones. We also find that inline definitions tend to be favored over follow-up questions, but this choice is highly subjective, thereby providing a promising future direction for personalization. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2402.13374 [pdf, other]

Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems

Authors: Ivan Sekulić, Silvia Terragni, Victor Guimarães, Nghia Khau, Bruna Guedes, Modestas Filipavicius, André Ferreira Manso, Roland Mathis

Abstract: In the realm of dialogue systems, user simulation techniques have emerged as a game-changer, redefining the evaluation and enhancement of task-oriented dialogue (TOD) systems. These methods are crucial for replicating real user interactions, enabling applications like synthetic data augmentation, error detection, and robust evaluation. However, existing approaches often rely on rigid rule-based me… ▽ More In the realm of dialogue systems, user simulation techniques have emerged as a game-changer, redefining the evaluation and enhancement of task-oriented dialogue (TOD) systems. These methods are crucial for replicating real user interactions, enabling applications like synthetic data augmentation, error detection, and robust evaluation. However, existing approaches often rely on rigid rule-based methods or on annotated data. This paper introduces DAUS, a Domain-Aware User Simulator. Leveraging large language models, we fine-tune DAUS on real examples of task-oriented dialogues. Results on two relevant benchmarks showcase significant improvements in terms of user goal fulfillment. Notably, we have observed that fine-tuning enhances the simulator's coherence with user goals, effectively mitigating hallucinations -- a major source of inconsistencies in simulator responses. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2401.11463 [pdf, ps, other]

Estimating the Usefulness of Clarifying Questions and Answers for Conversational Search

Authors: Ivan Sekulić, Weronika Łajewska, Krisztian Balog, Fabio Crestani

Abstract: While the body of research directed towards constructing and generating clarifying questions in mixed-initiative conversational search systems is vast, research aimed at processing and comprehending users' answers to such questions is scarce. To this end, we present a simple yet effective method for processing answers to clarifying questions, moving away from previous work that simply appends answ… ▽ More While the body of research directed towards constructing and generating clarifying questions in mixed-initiative conversational search systems is vast, research aimed at processing and comprehending users' answers to such questions is scarce. To this end, we present a simple yet effective method for processing answers to clarifying questions, moving away from previous work that simply appends answers to the original query and thus potentially degrades retrieval performance. Specifically, we propose a classifier for assessing usefulness of the prompted clarifying question and an answer given by the user. Useful questions or answers are further appended to the conversation history and passed to a transformer-based query rewriting module. Results demonstrate significant improvements over strong non-mixed-initiative baselines. Furthermore, the proposed approach mitigates the performance drops when non useful questions and answers are utilized. △ Less

Submitted 21 January, 2024; originally announced January 2024.

Comments: This is the author's version of the work. The definitive version is published in: Proceedings of the 46th European Conference on Information Retrieval (ECIR '24), March 24-28, 2024, Glasgow, Scotland

arXiv:2401.04524 [pdf, other]

Analyzing Coherency in Facet-based Clarification Prompt Generation for Search

Authors: Oleg Litvinov, Ivan Sekulić, Mohammad Aliannejadi, Fabio Crestani

Abstract: Clarifying user's information needs is an essential component of modern search systems. While most of the approaches for constructing clarifying prompts rely on query facets, the impact of the quality of the facets is relatively unexplored. In this work, we concentrate on facet quality through the notion of facet coherency and assess its importance for overall usefulness for clarification in searc… ▽ More Clarifying user's information needs is an essential component of modern search systems. While most of the approaches for constructing clarifying prompts rely on query facets, the impact of the quality of the facets is relatively unexplored. In this work, we concentrate on facet quality through the notion of facet coherency and assess its importance for overall usefulness for clarification in search. We find that existing evaluation procedures do not account for facet coherency, as evident by the poor correlation of coherency with automated metrics. Moreover, we propose a coherency classifier and assess the prevalence of incoherent facets in a well-established dataset on clarification. Our findings can serve as motivation for future work on the topic. △ Less

Submitted 9 January, 2024; originally announced January 2024.

arXiv:2312.13397 [pdf, ps, other]

doi 10.1088/2632-2153/ad3cb6

Review and experimental benchmarking of machine learning algorithms for efficient optimization of cold atom experiments

Authors: Oliver Anton, Victoria A. Henderson, Elisa Da Ros, Ivan Sekulic, Sven Burger, Philipp-Immanuel Schneider, Markus Krutzik

Abstract: The generation of cold atom clouds is a complex process which involves the optimization of noisy data in high dimensional parameter spaces. Optimization can be challenging both in and especially outside of the lab due to lack of time, expertise, or access for lengthy manual optimization. In recent years, it was demonstrated that machine learning offers a solution since it can optimize high dimensi… ▽ More The generation of cold atom clouds is a complex process which involves the optimization of noisy data in high dimensional parameter spaces. Optimization can be challenging both in and especially outside of the lab due to lack of time, expertise, or access for lengthy manual optimization. In recent years, it was demonstrated that machine learning offers a solution since it can optimize high dimensional problems quickly, without knowledge of the experiment itself. In this paper we present results showing the benchmarking of nine different optimization techniques and implementations, alongside their ability to optimize a Rubidium (Rb) cold atom experiment. The investigations are performed on a 3D $^{87}$Rb molasses with 10 and 18 adjustable parameters, respectively, where the atom number obtained by absorption imaging was chosen as the test problem. We further compare the best performing optimizers under different effective noise conditions by reducing the Signal-to-Noise ratio of the images via adapting the atomic vapor pressure in the 2D+ MOT and the detection laser frequency stability. △ Less

Submitted 20 December, 2023; originally announced December 2023.

Journal ref: Mach. Learn. Sci. Technol. 5, 025022 (2024)

arXiv:2306.09938 [pdf, other]

GRM: Generative Relevance Modeling Using Relevance-Aware Sample Estimation for Document Retrieval

Authors: Iain Mackie, Ivan Sekulic, Shubham Chatterjee, Jeffrey Dalton, Fabio Crestani

Abstract: Recent studies show that Generative Relevance Feedback (GRF), using text generated by Large Language Models (LLMs), can enhance the effectiveness of query expansion. However, LLMs can generate irrelevant information that harms retrieval effectiveness. To address this, we propose Generative Relevance Modeling (GRM) that uses Relevance-Aware Sample Estimation (RASE) for more accurate weighting of ex… ▽ More Recent studies show that Generative Relevance Feedback (GRF), using text generated by Large Language Models (LLMs), can enhance the effectiveness of query expansion. However, LLMs can generate irrelevant information that harms retrieval effectiveness. To address this, we propose Generative Relevance Modeling (GRM) that uses Relevance-Aware Sample Estimation (RASE) for more accurate weighting of expansion terms. Specifically, we identify similar real documents for each generated document and use a neural re-ranker to estimate their relevance. Experiments on three standard document ranking benchmarks show that GRM improves MAP by 6-9% and R@1k by 2-4%, surpassing previous methods. △ Less

Submitted 16 June, 2023; originally announced June 2023.

arXiv:2304.13874 [pdf, other]

doi 10.1145/3539618.3591683

Exploiting Simulated User Feedback for Conversational Search: Ranking, Rewriting, and Beyond

Authors: Paul Owoicho, Ivan Sekulić, Mohammad Aliannejadi, Jeffrey Dalton, Fabio Crestani

Abstract: This research aims to explore various methods for assessing user feedback in mixed-initiative conversational search (CS) systems. While CS systems enjoy profuse advancements across multiple aspects, recent research fails to successfully incorporate feedback from the users. One of the main reasons for that is the lack of system-user conversational interaction data. To this end, we propose a user si… ▽ More This research aims to explore various methods for assessing user feedback in mixed-initiative conversational search (CS) systems. While CS systems enjoy profuse advancements across multiple aspects, recent research fails to successfully incorporate feedback from the users. One of the main reasons for that is the lack of system-user conversational interaction data. To this end, we propose a user simulator-based framework for multi-turn interactions with a variety of mixed-initiative CS systems. Specifically, we develop a user simulator, dubbed ConvSim, that, once initialized with an information need description, is capable of providing feedback to a system's responses, as well as answering potential clarifying questions. Our experiments on a wide variety of state-of-the-art passage retrieval and neural re-ranking models show that effective utilization of user feedback can lead to 16% retrieval performance increase in terms of nDCG@3. Moreover, we observe consistent improvements as the number of feedback rounds increases (35% relative improvement in terms of nDCG@3 after three rounds). This points to a research gap in the development of specific feedback processing modules and opens a potential for significant advancements in CS. To support further research in the topic, we release over 30,000 transcripts of system-simulator interactions based on well-established CS datasets. △ Less

Submitted 7 May, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

Comments: 11 pages, 2 figures, to be published in SIGIR 2023

ACM Class: H.3.3

arXiv:2208.13977 [pdf, other]

$\mathcal{T}$-matrix method for computation of second-harmonic generation upon optical wave scattering from clusters of arbitrary particles

Authors: Ivan Sekulic, Jian Wei You, Nicolae C. Panoiu

Abstract: We derive the $\mathcal{T}$-matrix formalism tailored for the numerical analysis of second-harmonic (SH) generation from arbitrarily-shaped particles made of centrosymmetric optical materials. First, the transfer matrix of a single particle is computed \textit{via} the extended boundary condition method, in which the electromagnetic fields both at fundamental frequency and SH are expanded in vecto… ▽ More We derive the $\mathcal{T}$-matrix formalism tailored for the numerical analysis of second-harmonic (SH) generation from arbitrarily-shaped particles made of centrosymmetric optical materials. First, the transfer matrix of a single particle is computed \textit{via} the extended boundary condition method, in which the electromagnetic fields both at fundamental frequency and SH are expanded in vector spherical wave functions and the integral formulation is satisfied away from the surface of the scatterer. We allow for the accurate physical description of the SH sources by taking into account both local surface and nonlocal bulk polarization contributions to the nonlinear polarization density source responsible for the generation of the SH signal in a particle. This single-particle formalism is then extended to arbitrary distributions of particles by incorporating into the formalism linear and nonlinear electromagnetic wave scattering from the particles in the cluster. Importantly from a practical point of view, our method can be applied to particles of arbitrary shape made of optical materials characterized by general frequency-dispersion relations, so that it can describe the linear and nonlinear optical response of clusters of metallic, semiconductor, or polaritonic particles, as well as mixtures of such particles. The approach proposed here is faster and more memory-efficient than well-established numerical techniques, especially in the analysis of spheroidal particles, due to the favourable symmetries of spherical wave basis functions used in the wave scattering analysis. △ Less

Submitted 30 August, 2022; originally announced August 2022.

Comments: 19 pages, 12 figures

arXiv:2204.08046 [pdf, ps, other]

doi 10.1145/3488560.3498440

Evaluating Mixed-initiative Conversational Search Systems via User Simulation

Authors: Ivan Sekulić, Mohammad Aliannejadi, Fabio Crestani

Abstract: Clarifying the underlying user information need by asking clarifying questions is an important feature of modern conversational search system. However, evaluation of such systems through answering prompted clarifying questions requires significant human effort, which can be time-consuming and expensive. In this paper, we propose a conversational User Simulator, called USi, for automatic evaluation… ▽ More Clarifying the underlying user information need by asking clarifying questions is an important feature of modern conversational search system. However, evaluation of such systems through answering prompted clarifying questions requires significant human effort, which can be time-consuming and expensive. In this paper, we propose a conversational User Simulator, called USi, for automatic evaluation of such conversational search systems. Given a description of an information need, USi is capable of automatically answering clarifying questions about the topic throughout the search session. Through a set of experiments, including automated natural language generation metrics and crowdsourcing studies, we show that responses generated by USi are both inline with the underlying information need and comparable to human-generated answers. Moreover, we make the first steps towards multi-turn interactions, where conversational search systems asks multiple questions to the (simulated) user with a goal of clarifying the user need. To this end, we expand on currently available datasets for studying clarifying questions, i.e., Qulac and ClariQ, by performing a crowdsourcing-based multi-turn data acquisition. We show that our generative, GPT2-based model, is capable of providing accurate and natural answers to unseen clarifying questions in the single-turn setting and discuss capabilities of our model in the multi-turn setting. We provide the code, data, and the pre-trained model to be used for further research on the topic. △ Less

Submitted 20 April, 2022; v1 submitted 17 April, 2022; originally announced April 2022.

arXiv:2104.00218 [pdf, other]

Integrating Subgraph-aware Relation and DirectionReasoning for Question Answering

Authors: Xu Wang, Shuai Zhao, Bo Cheng, Jiale Han, Yingting Li, Hao Yang, Ivan Sekulic, Guoshun Nan

Abstract: Question Answering (QA) models over Knowledge Bases (KBs) are capable of providing more precise answers by utilizing relation information among entities. Although effective, most of these models solely rely on fixed relation representations to obtain answers for different question-related KB subgraphs. Hence, the rich structured information of these subgraphs may be overlooked by the relation repr… ▽ More Question Answering (QA) models over Knowledge Bases (KBs) are capable of providing more precise answers by utilizing relation information among entities. Although effective, most of these models solely rely on fixed relation representations to obtain answers for different question-related KB subgraphs. Hence, the rich structured information of these subgraphs may be overlooked by the relation representation vectors. Meanwhile, the direction information of reasoning, which has been proven effective for the answer prediction on graphs, has not been fully explored in existing work. To address these challenges, we propose a novel neural model, Relation-updated Direction-guided Answer Selector (RDAS), which converts relations in each subgraph to additional nodes to learn structure information. Additionally, we utilize direction information to enhance the reasoning ability. Experimental results show that our model yields substantial improvements on two widely used datasets. △ Less

Submitted 31 March, 2021; originally announced April 2021.

Comments: Accepted by ICASSP 2021

arXiv:2102.09266 [pdf, other]

doi 10.1016/j.jqsrt.2021.107643

$T$-matrix method for calculation of second-harmonic generation in clusters of spherical particles

Authors: Ivan Sekulic, Jian Wei You, Nicolae C. Panoiu

Abstract: In this article, we present a $T$-matrix method for numerical computation of second-harmonic generation from clusters of arbitrarily distributed spherical particles made of centrosymmetric optical materials. The electromagnetic fields at the fundamental and second-harmonic (SH) frequencies are expanded in series of vector spherical wave functions, and the single sphere $T$-matrix entries are compu… ▽ More In this article, we present a $T$-matrix method for numerical computation of second-harmonic generation from clusters of arbitrarily distributed spherical particles made of centrosymmetric optical materials. The electromagnetic fields at the fundamental and second-harmonic (SH) frequencies are expanded in series of vector spherical wave functions, and the single sphere $T$-matrix entries are computed by imposing field boundary conditions at the surface of the particles. Different from previous approaches, we compute the SH fields by taking into account both local surface and nonlocal bulk polarization sources, which allows one to accurately describe the generation of SH in arbitrary clusters of spherical particles. Our numerical method can be used to efficiently analyze clusters of spherical particles made of various optical materials, including metallic, dielectric, semiconductor, and polaritonic materials. △ Less

Submitted 18 February, 2021; originally announced February 2021.

Comments: 14 pages, 11 figures

arXiv:2102.04163 [pdf, other]

User Engagement Prediction for Clarification in Search

Authors: Ivan Sekulić, Mohammad Aliannejadi, Fabio Crestani

Abstract: Clarification is increasingly becoming a vital factor in various topics of information retrieval, such as conversational search and modern Web search engines. Prompting the user for clarification in a search session can be very beneficial to the system as the user's explicit feedback helps the system improve retrieval massively. However, it comes with a very high risk of frustrating the user in ca… ▽ More Clarification is increasingly becoming a vital factor in various topics of information retrieval, such as conversational search and modern Web search engines. Prompting the user for clarification in a search session can be very beneficial to the system as the user's explicit feedback helps the system improve retrieval massively. However, it comes with a very high risk of frustrating the user in case the system fails in asking decent clarifying questions. Therefore, it is of great importance to determine when and how to ask for clarification. To this aim, in this work, we model search clarification prediction as user engagement problem. We assume that the better a clarification is, the higher user engagement with it would be. We propose a Transformer-based model to tackle the task. The comparison with competitive baselines on large-scale real-life clarification engagement data proves the effectiveness of our model. Also, we analyse the effect of all result page elements on the performance and find that, among others, the ranked list of the search engine leads to considerable improvements. Our extensive analysis of task-specific features guides future research. △ Less

Submitted 8 February, 2021; originally announced February 2021.

arXiv:2009.09392 [pdf, ps, other]

Longformer for MS MARCO Document Re-ranking Task

Authors: Ivan Sekulić, Amir Soleimani, Mohammad Aliannejadi, Fabio Crestani

Abstract: Two step document ranking, where the initial retrieval is done by a classical information retrieval method, followed by neural re-ranking model, is the new standard. The best performance is achieved by using transformer-based models as re-rankers, e.g., BERT. We employ Longformer, a BERT-like model for long documents, on the MS MARCO document re-ranking task. The complete code used for training th… ▽ More Two step document ranking, where the initial retrieval is done by a classical information retrieval method, followed by neural re-ranking model, is the new standard. The best performance is achieved by using transformer-based models as re-rankers, e.g., BERT. We employ Longformer, a BERT-like model for long documents, on the MS MARCO document re-ranking task. The complete code used for training the model can be found on: https://github.com/isekulic/longformer-marco △ Less

Submitted 20 September, 2020; originally announced September 2020.

arXiv:2005.06312 [pdf, other]

Reasoning with Latent Structure Refinement for Document-Level Relation Extraction

Authors: Guoshun Nan, Zhijiang Guo, Ivan Sekulić, Wei Lu

Abstract: Document-level relation extraction requires integrating information within and across multiple sentences of a document and capturing complex interactions between inter-sentence entities. However, effective aggregation of relevant information in the document remains a challenging research question. Existing approaches construct static document-level graphs based on syntactic trees, co-references or… ▽ More Document-level relation extraction requires integrating information within and across multiple sentences of a document and capturing complex interactions between inter-sentence entities. However, effective aggregation of relevant information in the document remains a challenging research question. Existing approaches construct static document-level graphs based on syntactic trees, co-references or heuristics from the unstructured text to model the dependencies. Unlike previous methods that may not be able to capture rich non-local interactions for inference, we propose a novel model that empowers the relational reasoning across sentences by automatically inducing the latent document-level graph. We further develop a refinement strategy, which enables the model to incrementally aggregate relevant information for multi-hop reasoning. Specifically, our model achieves an F1 score of 59.05 on a large-scale document-level dataset (DocRED), significantly improving over the previous results, and also yields new state-of-the-art results on the CDR and GDA dataset. Furthermore, extensive analyses show that the model is able to discover more accurate inter-sentence relations. △ Less

Submitted 28 July, 2020; v1 submitted 13 May, 2020; originally announced May 2020.

Comments: Appeared in the proceedings of ACL 2020 (Long paper)

arXiv:2003.07634 [pdf, other]

doi 10.18653/v1/D19-5542

Adapting Deep Learning Methods for Mental Health Prediction on Social Media

Authors: Ivan Sekulić, Michael Strube

Abstract: Mental health poses a significant challenge for an individual's well-being. Text analysis of rich resources, like social media, can contribute to deeper understanding of illnesses and provide means for their early detection. We tackle a challenge of detecting social media users' mental status through deep learning-based models, moving away from traditional approaches to the task. In a binary class… ▽ More Mental health poses a significant challenge for an individual's well-being. Text analysis of rich resources, like social media, can contribute to deeper understanding of illnesses and provide means for their early detection. We tackle a challenge of detecting social media users' mental status through deep learning-based models, moving away from traditional approaches to the task. In a binary classification task on predicting if a user suffers from one of nine different disorders, a hierarchical attention network outperforms previously set benchmarks for four of the disorders. Furthermore, we explore the limitations of our model and analyze phrases relevant for classification by inspecting the model's word-level attention weights. △ Less

Submitted 17 March, 2020; originally announced March 2020.

Comments: W-NUT at EMNLP 2019

Journal ref: Proceedings of the 5th Workshop on Noisy User-generated Text, 2019, 322-327

arXiv:1811.04655 [pdf, ps, other]

Not Just Depressed: Bipolar Disorder Prediction on Reddit

Authors: Ivan Sekulić, Matej Gjurković, Jan Šnajder

Abstract: Bipolar disorder, an illness characterized by manic and depressive episodes, affects more than 60 million people worldwide. We present a preliminary study on bipolar disorder prediction from user-generated text on Reddit, which relies on users' self-reported labels. Our benchmark classifiers for bipolar disorder prediction outperform the baselines and reach accuracy and F1-scores of above 86%. Fea… ▽ More Bipolar disorder, an illness characterized by manic and depressive episodes, affects more than 60 million people worldwide. We present a preliminary study on bipolar disorder prediction from user-generated text on Reddit, which relies on users' self-reported labels. Our benchmark classifiers for bipolar disorder prediction outperform the baselines and reach accuracy and F1-scores of above 86%. Feature analysis shows interesting differences in language use between users with bipolar disorders and the control group, including differences in the use of emotion-expressive words. △ Less

Submitted 27 March, 2019; v1 submitted 12 November, 2018; originally announced November 2018.

Comments: WASSA at EMNLP 2018

Journal ref: WASSA@EMNLP 2018: 72-78

Showing 1–17 of 17 results for author: Sekulic, I