Bandits for Online Calibration: An Application to Content Moderation on Social Media Platforms
Authors:
Vashist Avadhanula,
Omar Abdul Baki,
Hamsa Bastani,
Osbert Bastani,
Caner Gocmen,
Daniel Haimovich,
Darren Hwang,
Dima Karamshuk,
Thomas Leeper,
Jiayuan Ma,
Gregory Macnamara,
Jake Mullett,
Christopher Palow,
Sung Park,
Varun S Rajagopal,
Kevin Schaeffer,
Parikshit Shah,
Deeksha Sinha,
Nicolas Stier-Moses,
Peng Xu
Abstract:
We describe the current content moderation strategy employed by Meta to remove policy-violating content from its platforms. Meta relies on both handcrafted and learned risk models to flag potentially violating content for human review. Our approach aggregates these risk models into a single ranking score, calibrating them to prioritize more reliable risk models. A key challenge is that violation t…
▽ More
We describe the current content moderation strategy employed by Meta to remove policy-violating content from its platforms. Meta relies on both handcrafted and learned risk models to flag potentially violating content for human review. Our approach aggregates these risk models into a single ranking score, calibrating them to prioritize more reliable risk models. A key challenge is that violation trends change over time, affecting which risk models are most reliable. Our system additionally handles production challenges such as changing risk models and novel risk models. We use a contextual bandit to update the calibration in response to such trends. Our approach increases Meta's top-line metric for measuring the effectiveness of its content moderation strategy by 13%.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
Information Disclosure and Promotion Policy Design for Platforms
Authors:
Yonatan Gur,
Gregory Macnamara,
Ilan Morgenstern,
Daniela Saban
Abstract:
We consider a platform facilitating trade between sellers and buyers with the objective of maximizing consumer surplus. Even though in many such marketplaces prices are set by revenue-maximizing sellers, platforms can influence prices through (i) price-dependent promotion policies that can increase demand for a product by featuring it in a prominent position on the webpage and (ii) the information…
▽ More
We consider a platform facilitating trade between sellers and buyers with the objective of maximizing consumer surplus. Even though in many such marketplaces prices are set by revenue-maximizing sellers, platforms can influence prices through (i) price-dependent promotion policies that can increase demand for a product by featuring it in a prominent position on the webpage and (ii) the information revealed to sellers about the value of being promoted. Identifying effective joint information design and promotion policies is a challenging dynamic problem as sellers can sequentially learn the promotion value from sales observations and update prices accordingly. We introduce the notion of confounding promotion policies, which are designed to prevent a Bayesian seller from learning the promotion value (at the expense of the short-run loss of diverting some consumers from the best product offering). Leveraging these policies, we characterize the maximum long-run average consumer surplus that is achievable through joint information design and promotion policies when the seller sets prices myopically. We then construct a Bayesian Nash equilibrium in which the seller's best response to the platform's optimal policy is to price myopically in every period. Moreover, the equilibrium we identify is platform-optimal within the class of horizon-maximin equilibria, in which strategies are not predicated on precise knowledge of the horizon length, and are designed to maximize payoff over the worst-case horizon. Our analysis allows one to identify practical long-run average optimal platform policies in a broad range of demand models.
△ Less
Submitted 15 December, 2022; v1 submitted 20 November, 2019;
originally announced November 2019.