skip to main content
Article

Communication decisions in multi-agent cooperation: model and experiments

Published: 28 May 2001 Publication History

Abstract

In multi-agent cooperation, agents share a common goal, which is evaluated through a global utility function. However, an agent typically cannot observe the global state of an uncertain environment, and therefore they must communicate with each other in order to share the information needed for deciding which actions to take. We argue that, when communication incurs a cost (due to resource consumption, for example), whether to communicate or not also becomes a decision to make. Hence, communication decision becomes part of the overall agent decision problem. In order to explicitly address this problem, we present a multi-agent extension to Markov decision processes in which communication can be modeled as an explicit action that incurs a cost. This framework provides a foundation for a quantified study of agent coordination policies and provides both motivation and insight to the design of heuristic approaches. An example problem is studied under this framework. From this example we can see the impact communication policies have on the overall agent policies, and what implications we can find toward the design of agent coordination policies.

References

[1]
M. Aicardi, F. Davoli, and R. Minciardi. Decentralized optimal control of markov chains with a common past information set. IEEE Transactions on Automatic Control, AC-32:1028-1031, 1987.
[2]
D. S. Bernstein, S. Zilberstein, and N. Immerman. The complexity of decentralized control of markov decision processes. In Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI-2000), 2000.
[3]
C. Boutilier. Sequential optimality and coordination in multiagent systems. In Proceedings of the Sixteenth International Joint Conferences on Artificial Intelligence (IJCAI-99), July 1999.
[4]
P. J. Gmytrasiewicz and E. H. Durfee. Rational interaction in multiagent environments: Coordination. Autonomous Agents and Multi-Agent Systems Journal, 1999.
[5]
E. Hansen. Cost-effective sensing during plan execution. In Proceedings of the Twelth National Conference onArtificial Intelligence, 1994.
[6]
E. Hansen, A. Barto, and S. Zilberstein. Reinforcement learning for mixed open-loop and closed-loop control. In Proceedings of the Ninth Neural Information Processing Systems Conference, December 1996.
[7]
E. A. Hansen and S. Zilberstein. Monitoring the progress of anytime problem-solving. In Proceedings of the 13th National Conference onArtificial Intelligence, pages 1229-1234, 1996.
[8]
Y. C. Ho and T. S. Chang. Another look at the nonclassical information problem. IEEE Transactions on Automatic Control, AC-25:537-540, 1980.
[9]
K. Hsu and S. I. Marcus. Decentralized control of finite state markov processes. IEEE Transactions on Automatic Control, AC-27:426-431, 1982.
[10]
M. L. Littman. Markov games as a framework for multi-agent reinforcement learning. In Proc. 11th International Conf. on Machine Learning, pages 157-163, 1994.
[11]
G. O'Hare and N. Jennings, editors. Foundations of Distributed Artificial Intelligence. John Wiley, 1996.
[12]
C. H. Papadimitriou and J. N. Tsitsiklis. The complexity of markov decision processes. Mathematics of Operations Research, 12(3):441-450, 1987.
[13]
N. R. Sandell, P. Varaiya, M. Athans, and M. Safonov. Survey of decentralized control methods for large scale systems. IEEE Transactions on Automatic Control, AC-23:108-128, 1978.
[14]
J. N. Tsitsiklis and M. Athans. On the complexity of decentralized decision making and detection problems. IEEE Transactions on Automatic Control, AC-30:440-446, 1985.
[15]
H. S. Witsenhausen. A counterexample in stochastic optimum control. SIAM Journal on Control, 6(1):138-147, 1968.
[16]
T. Yoshikawa. Decomposition of dynamic team decision problems. IEEE Transactions on Automatic Control, AC-23:443-445, 1978.

Cited By

View all
  • (2024)Task-Effective Compression of Observations for the Centralized Control of a Multiagent System Over Bit-Budgeted ChannelsIEEE Internet of Things Journal10.1109/JIOT.2023.331255311:4(6131-6143)Online publication date: 15-Feb-2024
  • (2024)Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.01713(18091-18101)Online publication date: 16-Jun-2024
  • (2023)Helpful information sharing for partially informed planning agentsProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/597(5377-5385)Online publication date: 19-Aug-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
AGENTS '01: Proceedings of the fifth international conference on Autonomous agents
May 2001
662 pages
ISBN:158113326X
DOI:10.1145/375735
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 May 2001

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. MDP
  2. coordinating multiple agents
  3. multi-agent communication/collaboration

Qualifiers

  • Article

Conference

AGENTS01
AGENTS01: Autonomous Agents 2001
Quebec, Montreal, Canada

Acceptance Rates

AGENTS '01 Paper Acceptance Rate 66 of 248 submissions, 27%;
Overall Acceptance Rate 182 of 599 submissions, 30%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)94
  • Downloads (Last 6 weeks)8
Reflects downloads up to 23 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Task-Effective Compression of Observations for the Centralized Control of a Multiagent System Over Bit-Budgeted ChannelsIEEE Internet of Things Journal10.1109/JIOT.2023.331255311:4(6131-6143)Online publication date: 15-Feb-2024
  • (2024)Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.01713(18091-18101)Online publication date: 16-Jun-2024
  • (2023)Helpful information sharing for partially informed planning agentsProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/597(5377-5385)Online publication date: 19-Aug-2023
  • (2023)Cooperation: A Systematic Review of how to Enable Agent to Circumvent the Prisoner’s DilemmaSHS Web of Conferences10.1051/shsconf/202317803005178(03005)Online publication date: 23-Oct-2023
  • (2022)AI in society: A theoryFrontiers in Physics10.3389/fphy.2022.94182410Online publication date: 6-Oct-2022
  • (2022)Task-Oriented Data Compression for Multi-Agent Communications Over Bit-Budgeted ChannelsIEEE Open Journal of the Communications Society10.1109/OJCOMS.2022.32132133(1867-1886)Online publication date: 2022
  • (2022)Actionable Knowledge for Efficient ActionsRelational Calculus for Actionable Knowledge10.1007/978-3-030-92430-0_5(217-279)Online publication date: 21-Jan-2022
  • (2021)Adaptive Information Sharing with Ontological Relevance Computation for Decentralized Self-Organization SystemsEntropy10.3390/e2303034223:3(342)Online publication date: 14-Mar-2021
  • (2021)Work in Progress: Role-based Deep Reinforcement Learning with Information Sharing for Intelligent Unmanned Systems2021 IEEE 27th Real-Time and Embedded Technology and Applications Symposium (RTAS)10.1109/RTAS52030.2021.00059(489-492)Online publication date: May-2021
  • (2021)Norm Violation in Online Communities – A Study of Stack Overflow CommentsCoordination, Organizations, Institutions, Norms, and Ethics for Governance of Multi-Agent Systems XIII10.1007/978-3-030-72376-7_2(20-34)Online publication date: 2-Apr-2021
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media