This article proposes a linguistic augmentation scheme for typographic attacks using explicit instructional directives.This article proposes a linguistic augmentation scheme for typographic attacks using explicit instructional directives.

Exploiting Vision-LLM Vulnerability: Enhancing Typographic Attacks with Instructional Directives

2025/10/01 03:30

Abstract and 1. Introduction

  1. Related Work

    2.1 Vision-LLMs

    2.2 Transferable Adversarial Attacks

  2. Preliminaries

    3.1 Revisiting Auto-Regressive Vision-LLMs

    3.2 Typographic Attacks in Vision-LLMs-based AD Systems

  3. Methodology

    4.1 Auto-Generation of Typographic Attack

    4.2 Augmentations of Typographic Attack

    4.3 Realizations of Typographic Attacks

  4. Experiments

  5. Conclusion and References

4.2 Augmentations of Typographic Attack

Inspired by the success of instruction-prompting methodologies [37, 38], the greedy reasoning in LLMs [39], and to further exploit the ambiguity between textual and visual tokens in Vision-LLMs, we propose to augment the typographic attacks prompts within images by explicitly providing instruction keywords that emphasize text-to-text alignment over that of visual-language tokens. Our approach realizes the concept in the form of instructional directives: ❶ command directives for emphasizing a false answer and ❷ conjunction directives to additionally include attack clauses. In particular, we have developed,

\ • Command Directive. By embedding commands with the attacks, we aim to prompt the VisionLLMs into greedily producing erroneous answers. Our work investigates the "ANSWER:" directive as a prefix before the first attack prompt.

\ • Conjunction Directive. Conjunctions, connectors (or the lack thereof) act to link together separate attack concepts that make the overall text appear more coherent, thereby increasing the likelihood of multi-task success. In our work, we investigate these directives as "AND," "OR," "WITH," or simply empty spaces as prefixes between attack prompts.

\ While other forms of directives can also be useful for enhancing the attack success rate, we focus on investigating basic directives related to typographic attacks in this work.

\

:::info Authors:

(1) Nhat Chung, CFAR and IHPC, A*STAR, Singapore and VNU-HCM, Vietnam;

(2) Sensen Gao, CFAR and IHPC, A*STAR, Singapore and Nankai University, China;

(3) Tuan-Anh Vu, CFAR and IHPC, A*STAR, Singapore and HKUST, HKSAR;

(4) Jie Zhang, Nanyang Technological University, Singapore;

(5) Aishan Liu, Beihang University, China;

(6) Yun Lin, Shanghai Jiao Tong University, China;

(7) Jin Song Dong, National University of Singapore, Singapore;

(8) Qing Guo, CFAR and IHPC, A*STAR, Singapore and National University of Singapore, Singapore.

:::


:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

\

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Pound Sterling softens as traders eye BoE rate cut next week

Pound Sterling softens as traders eye BoE rate cut next week

The post Pound Sterling softens as traders eye BoE rate cut next week appeared on BitcoinEthereumNews.com. The GBP/USD pair trades in negative territory near 1.3365 during the early European trading hours on Thursday, pressured by the rebound in the US Dollar (USD). Nonetheless, the potential downside might be limited after the US Federal Reserve (Fed) delivered a rate cut at its December policy meeting. Traders brace for the US weekly Initial Jobless Claims report, which will be published later on Thursday.  Markets continue to digest the largely anticipated rate cut by the Fed on Wednesday. The US central bank reduced its key interest rate for the third time in a row at its December meeting but signaled that it may leave rates unchanged in the coming months. Two Fed officials voted to keep the rate unchanged, while Stephen Miran, whom Trump appointed in September, voted for a larger rate cut. During the press conference, Fed Chair Jerome Powell said central bankers need time to see how the three reductions this year work their way through the US economy. Powell added that he will closely examine incoming data leading up to the next meeting in January. The Fed’s economic projections suggested one rate cut will take place next year, although new data could change this. On the other hand, the prospect of the Bank of England (BoE) rate reductions could drag the Pound Sterling (GBP) lower against the Greenback. Financial markets are now pricing in nearly an 88% chance of the BoE rate cut next week after signs from economic data that inflation pressure has eased.  Pound Sterling FAQs The Pound Sterling (GBP) is the oldest currency in the world (886 AD) and the official currency of the United Kingdom. It is the fourth most traded unit for foreign exchange (FX) in the world, accounting for 12% of all transactions, averaging $630 billion a day, according to 2022…
Share
BitcoinEthereumNews2025/12/11 13:40