Article

Communication decisions in multi-agent cooperation: model and experiments

Authors:

Shlomo ZilbersteinAuthors Info & Claims

AGENTS '01: Proceedings of the fifth international conference on Autonomous agents

Pages 616 - 623

https://doi.org/10.1145/375735.376469

Published: 28 May 2001 Publication History

Abstract

In multi-agent cooperation, agents share a common goal, which is evaluated through a global utility function. However, an agent typically cannot observe the global state of an uncertain environment, and therefore they must communicate with each other in order to share the information needed for deciding which actions to take. We argue that, when communication incurs a cost (due to resource consumption, for example), whether to communicate or not also becomes a decision to make. Hence, communication decision becomes part of the overall agent decision problem. In order to explicitly address this problem, we present a multi-agent extension to Markov decision processes in which communication can be modeled as an explicit action that incurs a cost. This framework provides a foundation for a quantified study of agent coordination policies and provides both motivation and insight to the design of heuristic approaches. An example problem is studied under this framework. From this example we can see the impact communication policies have on the overall agent policies, and what implications we can find toward the design of agent coordination policies.

References

[1]

M. Aicardi, F. Davoli, and R. Minciardi. Decentralized optimal control of markov chains with a common past information set. IEEE Transactions on Automatic Control, AC-32:1028-1031, 1987.

[2]

D. S. Bernstein, S. Zilberstein, and N. Immerman. The complexity of decentralized control of markov decision processes. In Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI-2000), 2000.

Digital Library

[3]

C. Boutilier. Sequential optimality and coordination in multiagent systems. In Proceedings of the Sixteenth International Joint Conferences on Artificial Intelligence (IJCAI-99), July 1999.

Digital Library

[4]

P. J. Gmytrasiewicz and E. H. Durfee. Rational interaction in multiagent environments: Coordination. Autonomous Agents and Multi-Agent Systems Journal, 1999.

Digital Library

[5]

E. Hansen. Cost-effective sensing during plan execution. In Proceedings of the Twelth National Conference onArtificial Intelligence, 1994.

Digital Library

[6]

E. Hansen, A. Barto, and S. Zilberstein. Reinforcement learning for mixed open-loop and closed-loop control. In Proceedings of the Ninth Neural Information Processing Systems Conference, December 1996.

[7]

E. A. Hansen and S. Zilberstein. Monitoring the progress of anytime problem-solving. In Proceedings of the 13th National Conference onArtificial Intelligence, pages 1229-1234, 1996.

Digital Library

[8]

Y. C. Ho and T. S. Chang. Another look at the nonclassical information problem. IEEE Transactions on Automatic Control, AC-25:537-540, 1980.

[9]

K. Hsu and S. I. Marcus. Decentralized control of finite state markov processes. IEEE Transactions on Automatic Control, AC-27:426-431, 1982.

[10]

M. L. Littman. Markov games as a framework for multi-agent reinforcement learning. In Proc. 11th International Conf. on Machine Learning, pages 157-163, 1994.

[11]

G. O'Hare and N. Jennings, editors. Foundations of Distributed Artificial Intelligence. John Wiley, 1996.

Digital Library

[12]

C. H. Papadimitriou and J. N. Tsitsiklis. The complexity of markov decision processes. Mathematics of Operations Research, 12(3):441-450, 1987.

Digital Library

[13]

N. R. Sandell, P. Varaiya, M. Athans, and M. Safonov. Survey of decentralized control methods for large scale systems. IEEE Transactions on Automatic Control, AC-23:108-128, 1978.

[14]

J. N. Tsitsiklis and M. Athans. On the complexity of decentralized decision making and detection problems. IEEE Transactions on Automatic Control, AC-30:440-446, 1985.

[15]

H. S. Witsenhausen. A counterexample in stochastic optimum control. SIAM Journal on Control, 6(1):138-147, 1968.

[16]

T. Yoshikawa. Decomposition of dynamic team decision problems. IEEE Transactions on Automatic Control, AC-23:443-445, 1978.

Cited By

Mostaani AVu TChatzinotas SOttersten B(2024)Task-Effective Compression of Observations for the Centralized Control of a Multiagent System Over Bit-Budgeted ChannelsIEEE Internet of Things Journal10.1109/JIOT.2023.331255311:4(6131-6143)Online publication date: 15-Feb-2024
https://doi.org/10.1109/JIOT.2023.3312553
Cao ZWang ZXie SLiu AFan L(2024)Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.01713(18091-18101)Online publication date: 16-Jun-2024
https://doi.org/10.1109/CVPR52733.2024.01713
Keren SWies DBernardini SElkind E(2023)Helpful information sharing for partially informed planning agentsProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/597(5377-5385)Online publication date: 19-Aug-2023
https://dl.acm.org/doi/10.24963/ijcai.2023/597
Show More Cited By

Index Terms

Communication decisions in multi-agent cooperation: model and experiments

Recommendations

Execution-time communication decisions for coordination of multi-agent teams
Congregation Formation in Multiagent Systems

We present congregating both as a metaphor for describing and modeling multiagent systems (MAS) and as a means for reducing coordination costs in large-scale MAS. When agents must search for other agents to interact with, congregations provide a way for ...
How communication can improve the performance of multi-agent systems
AGENTS '01: Proceedings of the fifth international conference on Autonomous agents

We analyze a general model of multi-agent communication in which all agents learn to communicate simultaneously to a message board. We show that the communicating multi-agent system is equivalent to a Mealy finite state machine whose states are ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

AGENTS '01: Proceedings of the fifth international conference on Autonomous agents

May 2001

662 pages

ISBN:158113326X

DOI:10.1145/375735

Chairmen:
Elisabeth André
DFKI, Germany
,
Sandip Sen
Univ. of Tulsa,
,
Claude Frasson
Univ. of Montreal, Montreal, P.Q., Canada
,
Jörg P. Müller
Seimens, Germany

Copyright © 2001 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 May 2001

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

AGENTS01

Sponsor:

AGENTS01: Autonomous Agents 2001

Quebec, Montreal, Canada

Acceptance Rates

AGENTS '01 Paper Acceptance Rate 66 of 248 submissions, 27%;

Overall Acceptance Rate 182 of 599 submissions, 30%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

125
Total Citations
View Citations
1,641
Total Downloads

Downloads (Last 12 months)94
Downloads (Last 6 weeks)8

Reflects downloads up to 23 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Mostaani AVu TChatzinotas SOttersten B(2024)Task-Effective Compression of Observations for the Centralized Control of a Multiagent System Over Bit-Budgeted ChannelsIEEE Internet of Things Journal10.1109/JIOT.2023.331255311:4(6131-6143)Online publication date: 15-Feb-2024
https://doi.org/10.1109/JIOT.2023.3312553
Cao ZWang ZXie SLiu AFan L(2024)Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.01713(18091-18101)Online publication date: 16-Jun-2024
https://doi.org/10.1109/CVPR52733.2024.01713
Keren SWies DBernardini SElkind E(2023)Helpful information sharing for partially informed planning agentsProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/597(5377-5385)Online publication date: 19-Aug-2023
https://dl.acm.org/doi/10.24963/ijcai.2023/597
Pan JYoshikawa AYamamura M(2023)Cooperation: A Systematic Review of how to Enable Agent to Circumvent the Prisoner’s DilemmaSHS Web of Conferences10.1051/shsconf/202317803005178(03005)Online publication date: 23-Oct-2023
https://doi.org/10.1051/shsconf/202317803005
Quandt R(2022)AI in society: A theoryFrontiers in Physics10.3389/fphy.2022.94182410Online publication date: 6-Oct-2022
https://doi.org/10.3389/fphy.2022.941824
Mostaani AVu TChatzinotas SOttersten B(2022)Task-Oriented Data Compression for Multi-Agent Communications Over Bit-Budgeted ChannelsIEEE Open Journal of the Communications Society10.1109/OJCOMS.2022.32132133(1867-1886)Online publication date: 2022
https://doi.org/10.1109/OJCOMS.2022.3213213
Bossé ÉBarès MBarès MBossé É(2022)Actionable Knowledge for Efficient ActionsRelational Calculus for Actionable Knowledge10.1007/978-3-030-92430-0_5(217-279)Online publication date: 21-Jan-2022
https://doi.org/10.1007/978-3-030-92430-0_5
Liu WRan WNantogma SXu Y(2021)Adaptive Information Sharing with Ontological Relevance Computation for Decentralized Self-Organization SystemsEntropy10.3390/e2303034223:3(342)Online publication date: 14-Mar-2021
https://doi.org/10.3390/e23030342
Sun QYao YYi PZhou XYang G(2021)Work in Progress: Role-based Deep Reinforcement Learning with Information Sharing for Intelligent Unmanned Systems2021 IEEE 27th Real-Time and Embedded Technology and Applications Symposium (RTAS)10.1109/RTAS52030.2021.00059(489-492)Online publication date: May-2021
https://doi.org/10.1109/RTAS52030.2021.00059
Cheriyan JSavarimuthu BCranefield S(2021)Norm Violation in Online Communities – A Study of Stack Overflow CommentsCoordination, Organizations, Institutions, Norms, and Ethics for Governance of Multi-Agent Systems XIII10.1007/978-3-030-72376-7_2(20-34)Online publication date: 2-Apr-2021
https://doi.org/10.1007/978-3-030-72376-7_2
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents