Skip to main content

Showing 1–8 of 8 results for author: Tekin, S F

  1. arXiv:2410.03953  [pdf, other

    cs.CL cs.LG

    LLM-TOPLA: Efficient LLM Ensemble by Maximising Diversity

    Authors: Selim Furkan Tekin, Fatih Ilhan, Tiansheng Huang, Sihao Hu, Ling Liu

    Abstract: Combining large language models during training or at inference time has shown substantial performance gain over component LLMs. This paper presents LLM-TOPLA, a diversity-optimized LLM ensemble method with three unique properties: (i) We introduce the focal diversity metric to capture the diversity-performance correlation among component LLMs of an ensemble. (ii) We develop a diversity-optimized… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  2. arXiv:2409.18169  [pdf, other

    cs.CR cs.AI cs.LG

    Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey

    Authors: Tiansheng Huang, Sihao Hu, Fatih Ilhan, Selim Furkan Tekin, Ling Liu

    Abstract: Recent research demonstrates that the nascent fine-tuning-as-a-service business model exposes serious safety concerns -- fine-tuning over a few harmful data uploaded by the users can compromise the safety alignment of the model. The attack, known as harmful fine-tuning, has raised a broad research interest among the community. However, as the attack is still new, \textbf{we observe from our misera… ▽ More

    Submitted 21 October, 2024; v1 submitted 26 September, 2024; originally announced September 2024.

  3. arXiv:2409.01586  [pdf, other

    cs.CL cs.AI

    Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation

    Authors: Tiansheng Huang, Sihao Hu, Fatih Ilhan, Selim Furkan Tekin, Ling Liu

    Abstract: Harmful fine-tuning issue \citep{qi2023fine} poses serious safety concerns for Large language models' fine-tuning-as-a-service. While existing defenses \citep{huang2024vaccine,rosati2024representation} have been proposed to mitigate the issue, their performances are still far away from satisfactory, and the root cause of the problem has not been fully recovered. For the first time in the literatur… ▽ More

    Submitted 18 September, 2024; v1 submitted 2 September, 2024; originally announced September 2024.

  4. arXiv:2405.18641  [pdf, other

    cs.LG

    Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning

    Authors: Tiansheng Huang, Sihao Hu, Fatih Ilhan, Selim Furkan Tekin, Ling Liu

    Abstract: Recent studies show that Large Language Models (LLMs) with safety alignment can be jail-broken by fine-tuning on a dataset mixed with harmful data. First time in the literature, we show that the jail-broken effect can be mitigated by separating states in the finetuning stage to optimize the alignment and user datasets. Unfortunately, our subsequent study shows that this simple Bi-State Optimizatio… ▽ More

    Submitted 26 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  5. arXiv:2404.04434  [pdf, other

    cs.CV cs.LG

    Robust Few-Shot Ensemble Learning with Focal Diversity-Based Pruning

    Authors: Selim Furkan Tekin, Fatih Ilhan, Tiansheng Huang, Sihao Hu, Ka-Ho Chow, Margaret L. Loper, Ling Liu

    Abstract: This paper presents FusionShot, a focal diversity optimized few-shot ensemble learning approach for boosting the robustness and generalization performance of pre-trained few-shot models. The paper makes three original contributions. First, we explore the unique characteristics of few-shot learning to ensemble multiple few-shot (FS) models by creating three alternative fusion channels. Second, we i… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  6. arXiv:2310.01152  [pdf, other

    cs.CR cs.AI

    Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives

    Authors: Sihao Hu, Tiansheng Huang, Fatih İlhan, Selim Furkan Tekin, Ling Liu

    Abstract: This paper provides a systematic analysis of the opportunities, challenges, and potential solutions of harnessing Large Language Models (LLMs) such as GPT-4 to dig out vulnerabilities within smart contracts based on our ongoing research. For the task of smart contract vulnerability detection, achieving practical usability hinges on identifying as many true vulnerabilities as possible while minimiz… ▽ More

    Submitted 16 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 10 pages

    Journal ref: IEEE International Conference on Trust, Privacy and Security in Intelligent Systems, and Applications 2023

  7. arXiv:2111.14733  [pdf, other

    cs.LG cs.AI

    Crime Prediction with Graph Neural Networks and Multivariate Normal Distributions

    Authors: Selim Furkan Tekin, Suleyman Serdar Kozat

    Abstract: Existing approaches to the crime prediction problem are unsuccessful in expressing the details since they assign the probability values to large regions. This paper introduces a new architecture with the graph convolutional networks (GCN) and multivariate Gaussian distributions to perform high-resolution forecasting that applies to any spatiotemporal data. We tackle the sparsity problem in high re… ▽ More

    Submitted 16 December, 2021; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: Added references for coding libraries, and typos on equations and figure captions fixed

  8. arXiv:2102.00696  [pdf, other

    cs.LG cs.AI cs.CV

    Numerical Weather Forecasting using Convolutional-LSTM with Attention and Context Matcher Mechanisms

    Authors: Selim Furkan Tekin, Arda Fazla, Suleyman Serdar Kozat

    Abstract: Numerical weather forecasting using high-resolution physical models often requires extensive computational resources on supercomputers, which diminishes their wide usage in most real-life applications. As a remedy, applying deep learning methods has revealed innovative solutions within this field. To this end, we introduce a novel deep learning architecture for forecasting high-resolution spatio-t… ▽ More

    Submitted 4 October, 2023; v1 submitted 1 February, 2021; originally announced February 2021.

    Comments: - In our journal submission, we removed the integration of the observational data section since it was not used in the experiments. Thus, we also removed the authors from the paper who were responsible for that section. - In the second version, we also performed an experiment on WeatherBench. We compare our results with the Physical Weather Forecasting Models