Skip to main content

Showing 1–50 of 53 results for author: Bi, K

  1. arXiv:2409.14744  [pdf, other

    cs.CL

    LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs

    Authors: Sihui Yang, Keping Bi, Wanqing Cui, Jiafeng Guo, Xueqi Cheng

    Abstract: Non-Factoid (NF) Question Answering (QA) is challenging to evaluate due to diverse potential answers and no objective criterion. The commonly used automatic evaluation metrics like ROUGE or BERTScore cannot accurately measure semantic similarities or answers from different perspectives. Recently, Large Language Models (LLMs) have been resorted to for NFQA evaluation due to their compelling perform… ▽ More

    Submitted 30 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: Published as a conference paper at EMNLP Findings 2024

  2. arXiv:2408.09817  [pdf, other

    cs.IR cs.AI

    Contextual Dual Learning Algorithm with Listwise Distillation for Unbiased Learning to Rank

    Authors: Lulu Yu, Keping Bi, Shiyu Ni, Jiafeng Guo

    Abstract: Unbiased Learning to Rank (ULTR) aims to leverage biased implicit user feedback (e.g., click) to optimize an unbiased ranking model. The effectiveness of the existing ULTR methods has primarily been validated on synthetic datasets. However, their performance on real-world click data remains unclear. Recently, Baidu released a large publicly available dataset of their web search logs. Subsequently,… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 12 pages, 2 figures

  3. arXiv:2408.09773  [pdf, other

    cs.CL

    Are Large Language Models More Honest in Their Probabilistic or Verbalized Confidence?

    Authors: Shiyu Ni, Keping Bi, Lulu Yu, Jiafeng Guo

    Abstract: Large language models (LLMs) have been found to produce hallucinations when the question exceeds their internal knowledge boundaries. A reliable model should have a clear perception of its knowledge boundaries, providing correct answers within its scope and refusing to answer when it lacks knowledge. Existing research on LLMs' perception of their knowledge boundaries typically uses either the prob… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  4. arXiv:2408.09388  [pdf

    cond-mat.supr-con cond-mat.str-el

    Crystal growth and characterization of Fe$_{1+δ}$Se$_{1-x}$Te$_x$ (0.5 $\leq$ $x$ $\leq$ 1) from LiCl/KCl flux

    Authors: Qiaoyu Wang, Kexin Bi, Lewei Chen, Yunqing Shi, Junkun Yi, Yadong Gu, Menghu Zhou, Binbin Ruan, Xingye Lu, Mingwei Ma, Genfu Chen, Zhian Ren

    Abstract: An eutectic LiCl/KCl flux method in a horizontal configuration has been used to grow a series of homogeneous Fe$_{1+δ}$Se$_{1-x}$Te$_x$ single crystals of high quality with 0.5 $\leq$ $x$ $\leq$ 1. Compared with previously used melt-growth method, the stable crystallization process in LiCl/KCl flux below their peritectic temperatures results in better homogeneity and crystalline perfection identif… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  5. arXiv:2406.14023  [pdf, other

    cs.CL cs.AI

    Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective

    Authors: Yuchen Wen, Keping Bi, Wei Chen, Jiafeng Guo, Xueqi Cheng

    Abstract: As Large Language Models (LLMs) become an important way of information seeking, there have been increasing concerns about the unethical content LLMs may generate. In this paper, we conduct a rigorous evaluation of LLMs' implicit bias towards certain groups by attacking them with carefully crafted instructions to elicit biased responses. Our attack methodology is inspired by psychometric principles… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Code and datasets are available at https://github.com/wen112358/ImplicitBiasPsychometricEvaluation

  6. arXiv:2406.11290  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Iterative Utility Judgment Framework via LLMs Inspired by Relevance in Philosophy

    Authors: Hengran Zhang, Keping Bi, Jiafeng Guo, Xueqi Cheng

    Abstract: Utility and topical relevance are critical measures in information retrieval (IR), reflecting system and user perspectives, respectively. While topical relevance has long been emphasized, utility is a higher standard of relevance and is more useful for facilitating downstream tasks, e.g., in Retrieval-Augmented Generation (RAG). When we incorporate utility judgments into RAG, we realize that the t… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 22 pages

  7. arXiv:2402.13625  [pdf, other

    cs.CL

    MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning

    Authors: Wanqing Cui, Keping Bi, Jiafeng Guo, Xueqi Cheng

    Abstract: Since commonsense information has been recorded significantly less frequently than its existence, language models pre-trained by text generation have difficulty to learn sufficient commonsense knowledge. Several studies have leveraged text retrieval to augment the models' commonsense ability. Unlike text, images capture commonsense information inherently but little effort has been paid to effectiv… ▽ More

    Submitted 13 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Published as a conference paper at ACL Findings 2024

  8. arXiv:2402.11457  [pdf, other

    cs.CL

    When Do LLMs Need Retrieval Augmentation? Mitigating LLMs' Overconfidence Helps Retrieval Augmentation

    Authors: Shiyu Ni, Keping Bi, Jiafeng Guo, Xueqi Cheng

    Abstract: Large Language Models (LLMs) have been found to have difficulty knowing they do not possess certain knowledge and tend to provide specious answers in such cases. Retrieval Augmentation (RA) has been extensively studied to mitigate LLMs' hallucinations. However, due to the extra overhead and unassured quality of retrieval, it may not be optimal to conduct RA all the time. A straightforward idea is… ▽ More

    Submitted 11 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Journal ref: Findings of ACL2024

  9. arXiv:2401.03648  [pdf, other

    cs.IR

    Reproducibility Analysis and Enhancements for Multi-Aspect Dense Retriever with Aspect Learning

    Authors: Keping Bi, Xiaojie Sun, Jiafeng Guo, Xueqi Cheng

    Abstract: Multi-aspect dense retrieval aims to incorporate aspect information (e.g., brand and category) into dual encoders to facilitate relevance matching. As an early and representative multi-aspect dense retriever, MADRAL learns several extra aspect embeddings and fuses the explicit aspects with an implicit aspect "OTHER" for final representation. MADRAL was evaluated on proprietary data and its code wa… ▽ More

    Submitted 16 January, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

    Comments: accepted by ecir2024 as a reproducibility paper

  10. arXiv:2312.02538  [pdf, other

    cs.IR

    A Multi-Granularity-Aware Aspect Learning Model for Multi-Aspect Dense Retrieval

    Authors: Xiaojie Sun, Keping Bi, Jiafeng Guo, Sihui Yang, Qishen Zhang, Zhongyi Liu, Guannan Zhang, Xueqi Cheng

    Abstract: Dense retrieval methods have been mostly focused on unstructured text and less attention has been drawn to structured data with various aspects, e.g., products with aspects such as category and brand. Recent work has proposed two approaches to incorporate the aspect information into item representations for effective retrieval by predicting the values associated with the item aspects. Despite thei… ▽ More

    Submitted 16 January, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted by WSDM2024, update

  11. arXiv:2311.04244  [pdf

    cs.LG cs.AI

    HKTGNN: Hierarchical Knowledge Transferable Graph Neural Network-based Supply Chain Risk Assessment

    Authors: Zhanting Zhou, Kejun Bi, Yuyanzhen Zhong, Chao Tang, Dongfen Li, Shi Ying, Ruijin Wang

    Abstract: The strength of a supply chain is an important measure of a country's or region's technical advancement and overall competitiveness. Establishing supply chain risk assessment models for effective management and mitigation of potential risks has become increasingly crucial. As the number of businesses grows, the important relationships become more complicated and difficult to measure. This emphasiz… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 11pages, 3 figures, accepted and submitted by IEEE ISPA 2023(The 21st IEEE International Symposium on Parallel and Distributed Processing with Applications)

    ACM Class: I.2.4

  12. arXiv:2311.02834  [pdf, other

    cs.IR

    CAME: Competitively Learning a Mixture-of-Experts Model for First-stage Retrieval

    Authors: Yinqiong Cai, Yixing Fan, Keping Bi, Jiafeng Guo, Wei Chen, Ruqing Zhang, Xueqi Cheng

    Abstract: The first-stage retrieval aims to retrieve a subset of candidate documents from a huge collection both effectively and efficiently. Since various matching patterns can exist between queries and relevant documents, previous work tries to combine multiple retrieval models to find as many relevant results as possible. The constructed ensembles, whether learned independently or jointly, do not care wh… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  13. arXiv:2310.20251  [pdf, other

    cs.MM

    An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation

    Authors: Yingjie Zhou, Yaodong Chen, Kaiyue Bi, Lian Xiong, Hui Liu

    Abstract: With the rapid development of artificial intelligence (AI), digital humans have attracted more and more attention and are expected to achieve a wide range of applications in several industries. Then, most of the existing digital humans still rely on manual modeling by designers, which is a cumbersome process and has a long development cycle. Therefore, facing the rise of digital humans, there is a… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  14. arXiv:2310.11852  [pdf, other

    cs.IR

    CIR at the NTCIR-17 ULTRE-2 Task

    Authors: Lulu Yu, Keping Bi, Jiafeng Guo, Xueqi Cheng

    Abstract: The Chinese academy of sciences Information Retrieval team (CIR) has participated in the NTCIR-17 ULTRE-2 task. This paper describes our approaches and reports our results on the ULTRE-2 task. We recognize the issue of false negatives in the Baidu search data in this competition is very severe, much more severe than position bias. Hence, we adopt the Dual Learning Algorithm (DLA) to address the po… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 5 pages, 1 figure, NTCIR-17

  15. arXiv:2310.00703  [pdf, other

    cs.IR

    A Comparative Study of Training Objectives for Clarification Facet Generation

    Authors: Shiyu Ni, Keping Bi, Jiafeng Guo, Xueqi Cheng

    Abstract: Due to the ambiguity and vagueness of a user query, it is essential to identify the query facets for the clarification of user intents. Existing work on query facet generation has achieved compelling performance by sequentially predicting the next facet given previously generated facets based on pre-trained language generation models such as BART. Given a query, there are mainly two types of train… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  16. arXiv:2308.11512  [pdf, other

    cs.IR

    L^2R: Lifelong Learning for First-stage Retrieval with Backward-Compatible Representations

    Authors: Yinqiong Cai, Keping Bi, Yixing Fan, Jiafeng Guo, Wei Chen, Xueqi Cheng

    Abstract: First-stage retrieval is a critical task that aims to retrieve relevant document candidates from a large-scale collection. While existing retrieval models have achieved impressive performance, they are mostly studied on static data sets, ignoring that in the real-world, the data on the Web is continuously growing with potential distribution drift. Consequently, retrievers trained on static old dat… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: accepted by CIKM2023

  17. arXiv:2308.11474  [pdf, other

    cs.IR

    Pre-training with Aspect-Content Text Mutual Prediction for Multi-Aspect Dense Retrieval

    Authors: Xiaojie Sun, Keping Bi, Jiafeng Guo, Xinyu Ma, Fan Yixing, Hongyu Shan, Qishen Zhang, Zhongyi Liu

    Abstract: Grounded on pre-trained language models (PLMs), dense retrieval has been studied extensively on plain text. In contrast, there has been little research on retrieving data with multiple aspects using dense models. In the scenarios such as product search, the aspect information plays an essential role in relevance matching, e.g., category: Electronics, Computers, and Pet Supplies. A common way of le… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: accepted by cikm2023

  18. arXiv:2306.08641  [pdf, other

    cs.CV cs.AI

    Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models

    Authors: Lingxi Xie, Longhui Wei, Xiaopeng Zhang, Kaifeng Bi, Xiaotao Gu, Jianlong Chang, Qi Tian

    Abstract: The AI community has been pursuing algorithms known as artificial general intelligence (AGI) that apply to any kind of real-world problem. Recently, chat systems powered by large language models (LLMs) emerge and rapidly become a promising direction to achieve AGI in natural language processing (NLP), but the path towards AGI in computer vision (CV) remains unclear. One may owe the dilemma to the… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 17 pages, 14 figures, technical report, expected to be updated in the near future

  19. arXiv:2304.11414  [pdf, other

    cs.DC cs.LG

    Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism

    Authors: Xin Chen, Hengheng Zhang, Xiaotao Gu, Kaifeng Bi, Lingxi Xie, Qi Tian

    Abstract: The Mixture of Experts (MoE) model becomes an important choice of large language models nowadays because of its scalability with sublinear computational complexity for training and inference. However, existing MoE models suffer from two critical drawbacks, 1) tremendous inner-node and inter-node communication overhead introduced by all-to-all dispatching and gathering, and 2) limited scalability f… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: A novel framework for MoE models. Work in progress

  20. arXiv:2302.09340  [pdf, other

    cs.IR

    Ensemble Ranking Model with Multiple Pretraining Strategies for Web Search

    Authors: Xiaojie Sun, Lulu Yu, Yiting Wang, Keping Bi, Jiafeng Guo

    Abstract: An effective ranking model usually requires a large amount of training data to learn the relevance between documents and queries. User clicks are often used as training data since they can indicate relevance and are cheap to collect, but they contain substantial bias and noise. There has been some work on mitigating various types of bias in simulated user clicks to train effective learning-to-rank… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: 4 pages, 2 figures, WSDM Cup 2023

  21. arXiv:2302.07530  [pdf, other

    cs.IR

    Feature-Enhanced Network with Hybrid Debiasing Strategies for Unbiased Learning to Rank

    Authors: Lulu Yu, Yiting Wang, Xiaojie Sun, Keping Bi, Jiafeng Guo

    Abstract: Unbiased learning to rank (ULTR) aims to mitigate various biases existing in user clicks, such as position bias, trust bias, presentation bias, and learn an effective ranker. In this paper, we introduce our winning approach for the "Unbiased Learning to Rank" task in WSDM Cup 2023. We find that the provided data is severely biased so neural models trained directly with the top 10 results with clic… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 5 pages, 1 figure, WSDM Cup 2023

  22. arXiv:2211.02556  [pdf, other

    physics.ao-ph cs.AI cs.CV cs.LG

    Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast

    Authors: Kaifeng Bi, Lingxi Xie, Hengheng Zhang, Xin Chen, Xiaotao Gu, Qi Tian

    Abstract: In this paper, we present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast. For this purpose, we establish a data-driven environment by downloading $43$ years of hourly global weather data from the 5th generation of ECMWF reanalysis (ERA5) data and train a few deep neural networks with about $256$ million parameters in total. The spatial resolution of forec… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 19 pages, 13 figures: the first ever AI-based method that outperforms traditional numerical weather prediction methods

  23. arXiv:2203.05220  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Quantifying the Dzyaloshinskii-Moriya Interaction Induced by the Bulk Magnetic Asymmetry

    Authors: Qihan Zhang, Jinghua Liang, Kaiqi Bi, Le Zhao, He Bai, Qirui Cui, Heng-An Zhou, Hao Bai, Hongmei Feng, Wenjie Song, Guozhi Chai, O. Gladii, H. Schultheiss, Tao Zhu, Junwei Zhang, Yong Peng, Hongxin Yang, Wanjun Jiang

    Abstract: A broken interfacial inversion symmetry in ultrathin ferromagnet/heavy metal (FM/HM) bilayers is generally believed to be a prerequisite for accommodating the Dzyaloshinskii-Moriya interaction (DMI) and for stabilizing chiral spin textures. In these bilayers, the strength of the DMI decays as the thickness of the FM layer increases and vanishes around a few nanometers. In the present study, throug… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Physical Review Letters, in press

  24. arXiv:2201.02702  [pdf

    math.DS cs.LG math.OC stat.AP stat.ME

    An Improved Mathematical Model of Sepsis: Modeling, Bifurcation Analysis, and Optimal Control Study for Complex Nonlinear Infectious Disease System

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: Sepsis is a life-threatening medical emergency, which is a major cause of death worldwide and the second highest cause of mortality in the United States. Researching the optimal control treatment or intervention strategy on the comprehensive sepsis system is key in reducing mortality. For this purpose, first, this paper improves a complex nonlinear sepsis model proposed in our previous work. Then,… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: 25 pages, 7 figures, 1 table

  25. arXiv:2201.00147  [pdf

    cs.LG math.OC stat.AP stat.ME

    High-dimensional Bayesian Optimization Algorithm with Recurrent Neural Network for Disease Control Models in Time Series

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: Bayesian Optimization algorithm has become a promising approach for nonlinear global optimization problems and many machine learning applications. Over the past few years, improvements and enhancements have been brought forward and they have shown some promising results in solving the complex dynamic problems, systems of ordinary differential equations where the objective functions are computation… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: 16 pages, 9 figures, 2 tables

  26. arXiv:2112.14969  [pdf

    physics.ins-det physics.app-ph

    Ultra-wideband electrostrictive mechanical antenna

    Authors: Jianchun Xu, Zhao Li, Xuchao Pan, Xi Wen, Jinqing Cao, Wen Gong, Shaolong Yang, Ming Lei, Fangzhou Yao, Ke Bi

    Abstract: Conventional mechanical antennas provide a strategy in long-wave communication with a surprisingly compact size below 1/1,000 of the wavelength. However, the narrow bandwidth and weak field intensity seriously hamper its practical applications. Here, we present a mechanical antenna based on the electrostrictive effect of PMN-PT-based relaxor ferroelectric ceramic to improve radiation capacity and… ▽ More

    Submitted 30 December, 2021; originally announced December 2021.

    Comments: 28 pages, 10 figures,

  27. arXiv:2110.12313  [pdf, other

    eess.SP

    High-Sensitivity Electric Potential Sensors for Non-Contact Monitoring of Physiological Signals

    Authors: Xinyao Tang, Wangbo Chen, Soumyajit Mandal, Kevin Bi, Tayfun Ozdemir

    Abstract: The paper describes highly-sensitive passive electric potential sensors (EPS) for non-contact detection of multiple biophysical signals, including electrocardiogram (ECG), respiration cycle (RC), and electroencephalogram (EEG). The proposed EPS uses an optimized transimpedance amplifier (TIA), a single guarded sensing electrode, and an adaptive cancellation loop (ACL) to maximize sensitivity (DC t… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: Submitted to the IEEE Transactions on Instrumentation and Measurement

  28. arXiv:2108.02289  [pdf

    cs.LG math.OC stat.AP stat.ME

    High dimensional Bayesian Optimization Algorithm for Complex System in Time Series

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: At present, high-dimensional global optimization problems with time-series models have received much attention from engineering fields. Since it was proposed, Bayesian optimization has quickly become a popular and promising approach for solving global optimization problems. However, the standard Bayesian optimization algorithm is insufficient to solving the global optimal solution when the model i… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: 18 pages, 13 figures

  29. arXiv:2108.00062  [pdf

    stat.ME math.OC stat.AP

    A New Bayesian Optimization Algorithm for Complex High-Dimensional Disease Epidemic Systems

    Authors: Yuyang Chen, Kaiming Bi, Chih-Hang J. Wu, David Ben-Arieh, Ashesh Sinha

    Abstract: This paper presents an Improved Bayesian Optimization (IBO) algorithm to solve complex high-dimensional epidemic models' optimal control solution. Evaluating the total objective function value for disease control models with hundreds of thousands of control time periods is a high computational cost. In this paper, we improve the conventional Bayesian Optimization (BO) approach from two parts. The… ▽ More

    Submitted 30 July, 2021; originally announced August 2021.

    Comments: 17 pages, 14 figures

  30. Asking Clarifying Questions Based on Negative Feedback in Conversational Search

    Authors: Keping Bi, Qingyao Ai, W. Bruce Croft

    Abstract: Users often need to look through multiple search result pages or reformulate queries when they have complex information-seeking needs. Conversational search systems make it possible to improve user satisfaction by asking questions to clarify users' search intents. This, however, can take significant effort to answer a series of questions starting with "what/why/how". To quickly identify user inten… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: In the proceedings of ICTIR'21

  31. Leveraging User Behavior History for Personalized Email Search

    Authors: Keping Bi, Pavel Metrikov, Chunyuan Li, Byungki Byun

    Abstract: An effective email search engine can facilitate users' search tasks and improve their communication efficiency. Users could have varied preferences on various ranking signals of an email, such as relevance and recency based on their tasks at hand and even their jobs. Thus a uniform matching pattern is not optimal for all users. Instead, an effective email ranker should conduct personalized ranking… ▽ More

    Submitted 17 March, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Comments: In proceedings of the Web Conference 2021

  32. arXiv:2008.01475  [pdf, other

    cs.CV cs.LG

    Weight-Sharing Neural Architecture Search: A Battle to Shrink the Optimization Gap

    Authors: Lingxi Xie, Xin Chen, Kaifeng Bi, Longhui Wei, Yuhui Xu, Zhengsu Chen, Lanfei Wang, An Xiao, Jianlong Chang, Xiaopeng Zhang, Qi Tian

    Abstract: Neural architecture search (NAS) has attracted increasing attentions in both academia and industry. In the early age, researchers mostly applied individual search methods which sample and evaluate the candidate architectures separately and thus incur heavy computational overheads. To alleviate the burden, weight-sharing methods were proposed in which exponentially many architectures share weights… ▽ More

    Submitted 4 August, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: 24 pages, 3 figures, 2 tables, meta data updated

  33. arXiv:2007.03331  [pdf, other

    cs.CV cs.LG cs.NE

    GOLD-NAS: Gradual, One-Level, Differentiable

    Authors: Kaifeng Bi, Lingxi Xie, Xin Chen, Longhui Wei, Qi Tian

    Abstract: There has been a large literature of neural architecture search, but most existing work made use of heuristic rules that largely constrained the search flexibility. In this paper, we first relax these manually designed constraints and enlarge the search space to contain more than $10^{160}$ candidates. In the new space, most existing differentiable search methods can fail dramatically. We then pro… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: 14 pages, 5 figures

  34. A Transformer-based Embedding Model for Personalized Product Search

    Authors: Keping Bi, Qingyao Ai, W. Bruce Croft

    Abstract: Product search is an important way for people to browse and purchase items on E-commerce platforms. While customers tend to make choices based on their personal tastes and preferences, analysis of commercial product search logs has shown that personalization does not always improve product search quality. Most existing product search techniques, however, conduct undifferentiated personalization ac… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: In the proceedings of SIGIR 2020

    ACM Class: H.3.3

  35. arXiv:2005.02146  [pdf, other

    cs.CL

    Artemis: A Novel Annotation Methodology for Indicative Single Document Summarization

    Authors: Rahul Jha, Keping Bi, Yang Li, Mahdi Pakdaman, Asli Celikyilmaz, Ivan Zhiboedov, Kieran McDonald

    Abstract: We describe Artemis (Annotation methodology for Rich, Tractable, Extractive, Multi-domain, Indicative Summarization), a novel hierarchical annotation process that produces indicative summaries for documents from multiple domains. Current summarization evaluation datasets are single-domain and focused on a few domains for which naturally occurring summaries can be easily found, such as news and sci… ▽ More

    Submitted 13 May, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

  36. Learning a Fine-Grained Review-based Transformer Model for Personalized Product Search

    Authors: Keping Bi, Qingyao Ai, W. Bruce Croft

    Abstract: Product search has been a crucial entry point to serve people shopping online. Most existing personalized product models follow the paradigm of representing and matching user intents and items in the semantic space, where finer-grained matching is totally discarded and the ranking of an item cannot be explained further than just user/item level similarity. In addition, while some models in existin… ▽ More

    Submitted 3 June, 2021; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: To appear in SIGIR'2021

    MSC Class: 68T07; 68P20 ACM Class: H.3.3

  37. arXiv:2004.06176  [pdf, other

    cs.CL

    AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization

    Authors: Keping Bi, Rahul Jha, W. Bruce Croft, Asli Celikyilmaz

    Abstract: Redundancy-aware extractive summarization systems score the redundancy of the sentences to be included in a summary either jointly with their salience information or separately as an additional sentence scoring step. Previous work shows the efficacy of jointly scoring and selecting sentences with neural sequence generation models. It is, however, not well-understood if the gain is due to better en… ▽ More

    Submitted 2 April, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: In proceedings of EACL'2021

  38. arXiv:1912.05804  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci physics.app-ph

    Topological Lattice Metamaterials -- A Platform For Novel Electromagnetic Material Design Based On An Artificial Topological "Atom"

    Authors: Wenjin Zhang, Ziyuan Meng, Zidong Zhang, Ke Bi, Runhua Fan, Yi Du, Weichang Hao

    Abstract: In nature, most materials are composed of atoms with periodic structures. Hence, it's impossible to introduce topological structures into their lattice compose, because the atoms as basic blocks cannot be modulated. However, the lattice compose of metamaterials can be designed conveniently. In our work, we propose to introduce topological non-trivial structures, Mobius unknots, as the basic block… ▽ More

    Submitted 4 October, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: 9 pages, 5 figures

  39. arXiv:1910.11831  [pdf, other

    cs.LG cs.CV stat.ML

    Stabilizing DARTS with Amended Gradient Estimation on Architectural Parameters

    Authors: Kaifeng Bi, Changping Hu, Lingxi Xie, Xin Chen, Longhui Wei, Qi Tian

    Abstract: DARTS is a popular algorithm for neural architecture search (NAS). Despite its great advantage in search efficiency, DARTS often suffers weak stability, which reflects in the large variation among individual trials as well as the sensitivity to the hyper-parameters of the search process. This paper owes such instability to an optimization gap between the super-network and its sub-networks, namely,… ▽ More

    Submitted 4 May, 2020; v1 submitted 25 October, 2019; originally announced October 2019.

    Comments: 22 pages, 12 figures, submitted to ICML 2020, updated experiments on Penn Treebank

  40. arXiv:1909.07212  [pdf, other

    cs.IR

    Explainable Product Search with a Dynamic Relation Embedding Model

    Authors: Qingyao Ai, Yongfeng Zhang, Keping Bi, W. Bruce Croft

    Abstract: Product search is one of the most popular methods for customers to discover products online. Most existing studies on product search focus on developing effective retrieval models that rank items by their likelihood to be purchased. They, however, ignore the problem that there is a gap between how systems and customers perceive the relevance of items. Without explanations, users may not understand… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

  41. A Study of Context Dependencies in Multi-page Product Search

    Authors: Keping Bi, Choon Hui Teo, Yesh Dattatreya, Vijai Mohan, W. Bruce Croft

    Abstract: In product search, users tend to browse results on multiple search result pages (SERPs) (e.g., for queries on clothing and shoes) before deciding which item to purchase. Users' clicks can be considered as implicit feedback which indicates their preferences and used to re-rank subsequent SERPs. Relevance feedback (RF) techniques are usually involved to deal with such scenarios. However, these metho… ▽ More

    Submitted 9 January, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted by CIKM 2019. arXiv admin note: substantial text overlap with arXiv:1909.02065

  42. Conversational Product Search Based on Negative Feedback

    Authors: Keping Bi, Qingyao Ai, Yongfeng Zhang, W. Bruce Croft

    Abstract: Intelligent assistants change the way people interact with computers and make it possible for people to search for products through conversations when they have purchase needs. During the interactions, the system could ask questions on certain aspects of the ideal products to clarify the users' needs. For example, previous work proposed to ask users the exact characteristics of their ideal items b… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted as a long paper in CIKM 2019

  43. arXiv:1909.02065  [pdf, other

    cs.IR

    Leverage Implicit Feedback for Context-aware Product Search

    Authors: Keping Bi, Choon Hui Teo, Yesh Dattatreya, Vijai Mohan, W. Bruce Croft

    Abstract: Product search serves as an important entry point for online shopping. In contrast to web search, the retrieved results in product search not only need to be relevant but also should satisfy customers' preferences in order to elicit purchases. Previous work has shown the efficacy of purchase history in personalized product search. However, customers with little or no purchase history do not benefi… ▽ More

    Submitted 9 January, 2020; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: Presented at 2019 SIGIR Workshop on eCommerce (ECOM'19)

  44. arXiv:1812.08870  [pdf, other

    cs.IR

    Iterative Relevance Feedback for Answer Passage Retrieval with Passage-level Semantic Match

    Authors: Keping Bi, Qingyao Ai, W. Bruce Croft

    Abstract: Relevance feedback techniques assume that users provide relevance judgments for the top k (usually 10) documents and then re-rank using a new query model based on those judgments. Even though this is effective, there has been little research recently on this topic because requiring users to provide substantial feedback on a result list is impractical in a typical web search scenario. In new enviro… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    Journal ref: 41st European Conference on IR Research, ECIR 2019

  45. arXiv:1812.05731  [pdf, other

    cs.IR

    Revisiting Iterative Relevance Feedback for Document and Passage Retrieval

    Authors: Keping Bi, Qingyao Ai, W. Bruce Croft

    Abstract: As more and more search traffic comes from mobile phones, intelligent assistants, and smart-home devices, new challenges (e.g., limited presentation space) and opportunities come up in information retrieval. Previously, an effective technique, relevance feedback (RF), has rarely been used in real search scenarios due to the overhead of collecting users' relevance judgments. However, since users te… ▽ More

    Submitted 9 June, 2019; v1 submitted 13 December, 2018; originally announced December 2018.

  46. Unbiased Learning to Rank with Unbiased Propensity Estimation

    Authors: Qingyao Ai, Keping Bi, Cheng Luo, Jiafeng Guo, W. Bruce Croft

    Abstract: Learning to rank with biased click data is a well-known challenge. A variety of methods has been explored to debias click data for learning to rank such as click models, result interleaving and, more recently, the unbiased learning-to-rank framework based on inverse propensity weighting. Despite their differences, most existing studies separate the estimation of click bias (namely the \textit{prop… ▽ More

    Submitted 23 April, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

  47. Learning a Deep Listwise Context Model for Ranking Refinement

    Authors: Qingyao Ai, Keping Bi, Jiafeng Guo, W. Bruce Croft

    Abstract: Learning to rank has been intensively studied and widely applied in information retrieval. Typically, a global ranking function is learned from a set of labeled data, which can achieve good performance on average but may be suboptimal for individual queries by ignoring the fact that relevant documents for different queries may have different distributions in the feature space. Inspired by the idea… ▽ More

    Submitted 23 April, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

  48. arXiv:1702.06684  [pdf, other

    math.CO math.NT

    Residues modulo powers of two in the Young-Fibonacci lattice

    Authors: N. Karimilla Bi, Amritanshu Prasad, P. Giftson Santhosh

    Abstract: We study the subgraph of the Young-Fibonacci graph induced by elements with odd $f$-statistic (the $f$-statistic of an element $w$ of a differential graded poset is the number of saturated chains from the minimal element of the poset to $w$). We show that this subgraph is a binary tree. Moreover, the odd residues of the $f$-statistics in a row of this tree equidistibute modulo any power two. This… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

    Comments: 9 pages, 3 figures

    MSC Class: 06A10; 11N69; 05A15

  49. arXiv:1506.02780  [pdf, ps, other

    math.RT

    Cellularity of a Larger Class of Diagram Algebras

    Authors: N. Karimilla Bi

    Abstract: In this paper, we realize the algebra of $\mathbb{Z}_2$-relations, signed partition algebras and partition algebras as tabular algebras and prove the cellularity of these algebras using the method of \cite{GM1}. Using the results of Graham and Lehrer in \cite{GL}, we give the modular representations of the algebra of $\mathbb{Z}_2$-relations, signed partition algebras and partition algebras.

    Submitted 9 June, 2015; originally announced June 2015.

    Comments: 3 figures

    MSC Class: 16G30

  50. arXiv:1504.01377  [pdf, ps, other

    math.RA

    Eigenvalues of Gram Matrices of a class of Diagram Algebras

    Authors: N. Karimilla Bi, M. Parvathi

    Abstract: In this paper, we introduce symmetric diagram matrices $A_{s+r,s}$ of size ${_{(s+r)}}C_s$ whose entries are $\{x_i\}_{min\{s,r\}}$. We compute the eigenvalues of symmetric diagram matrices using elementary row and column operations inductively. As a byproduct, we obtain the eigenvalues of Gram matrices of a larger class of diagram algebras like the signed partition algebras, algebra of… ▽ More

    Submitted 6 April, 2015; originally announced April 2015.

    Comments: 32 pages, 11 figures. arXiv admin note: substantial text overlap with arXiv:1504.01241

    MSC Class: 16Gxx