~

8d738f52 · Zidong Du · c26556f4 · 8d738f52 · 8d738f52
Commit 8d738f52 authored Sep 09, 2020 by Zidong Du
Hide whitespace changes
Inline Side-by-side

Showing with 126 additions and 107 deletions

AAAI2021/paper.tex
+11 -7

AAAI2021/tex/introduction.tex
+115 -100

No files found.
--- a/AAAI2021/paper.tex
+++ b/AAAI2021/paper.tex
@@ -7,6 +7,7 @@
 \usepackage{color}
 \newcommand{\rmk}[1]{\textcolor{red}{--[#1]--}}
 \newcommand{\note}[1]{\textcolor{red}{#1}}
+\usepackage{enumitem}

 \usepackage{aaai21}  % DO NOT CHANGE THIS
 \usepackage{times}  % DO NOT CHANGE THIS
@@ -166,13 +167,13 @@
 \maketitle

 \begin{abstract}
- The natural emergence of symbolic languages with high compositionality has
+ The emergence of symbolic languages with high compositionality has
  attracted extensive attention from a broad range of communities. Existing
  studies achieve high compositionality through \emph{deliberately handcrafted}
  inductions (e.g., small vocabulary sizes, carefully constructed distractors, 
  and ease-of-teaching) in multi-agent learning, which are unnatural. 
  Yet, few studies investigate the emergence of symbolic language with high
-  compositionality  \emph{``naturally''}, i.e., without any deliberately handcrafted
+  compositionality \emph{naturally}, i.e., without any deliberately handcrafted
  inductions. 
  
  In this paper, we are the first to successfully achieve high compositional symbolic
@@ -182,11 +183,14 @@
  inductions, we observe that the agent capacity plays a key role in
  compositionality. We further reveal and characterize the quantitative relationship
  between the agent capacity and the compositionality of symbolic language both
-  theoretically and experimentally. The theoretical analysis is built on the MSC
-  (Markov Series Channel) model for the language transmission process and a
-  novel mutual information-based metric for the compositionality. The
-  experiments are conducted on a listener-speaker referential game framework
-  with eliminated external environment factors. Both theoretical analysis and
+  theoretically and experimentally.
+  %The theoretical analysis is built on the MSC
+  %(Markov Series Channel) model for the language transmission process and a
+  %novel mutual information-based metric for the compositionality.
+  %The
+  %experiments are conducted on a listener-speaker referential game framework
+  %with eliminated external environment factors.
+  Both theoretical analysis and
  experimental results lead to a counter-intuitive conclusion that lower agent
  capacity facilitates the emergence of symbolic language with higher
  compositionality. \note{Based on our conclusion, we can generate higher

--- a/AAAI2021/tex/introduction.tex
+++ b/AAAI2021/tex/introduction.tex
 \section{Introduction}
 \label{sec:introduction}

-The emergence of human language has always been an important and controversial
+The emergence of symbolic language has always been an important and controversial
 issue. This problem attracts attentions from a broad range of communities,
 including philology~\cite{}, biology~\cite{}, and computer
 science~\cite{}. Especially in computer science, efforts in recent years try to explore
-the emergence of symbolic language in virtual environments, where agents are trained 
-to communicate with neural network based method, i.e, deep reinforcement
-learning~\cite{}. For example, \note{XXXX}
+the emergence of symbolic language in virtual, multi-agent environments, where
+agents are trained to communicate with neural network based method, i.e., deep
+reinforcement learning~\cite{}. For example, \note{XXXX}
 %Such works can be roughly classified into two categories,
 %referential game~\cite{} and multi-agent reinforcement learning (MARL)~\cite{}, based on
 %the environment setting.  
@@ -20,7 +20,7 @@ whether the meaning of a complex expression (e.g, phase), which is assembled out
 given set of simple components (e.g., symbols), can be determined by its
 constituent components and the rules that combines them~\cite{}.
 \note{For example, the expression "AAAI is a conference'' consists of two
-meaningful words ``AAAI'' and ``conference'', and a rule of definition (``is'').}
+meaningful words ``AAAI'' and ``conference'', and a rule for definition (``is'').}
 More recently, measuring the compositionality \note{xxxxx}.


@@ -41,43 +41,43 @@ More recently, measuring the compositionality \note{xxxxx}.
  \label{fig:symbols}
  \end{figure}

-Prior studies focus on achieving high compositionality of the emergent language
-through \emph{deliberately handcrafted} inductions unnaturally, e.g., small vocabulary
+Prior studies focus on achieving high compositionality
+through \emph{deliberately handcrafted} inductions, e.g., small vocabulary
 sizes~\cite{}, memoryless~\cite{}, carefully constructed rewards~\cite{}, and
 ease-of-teaching~\cite{}. \note{xxxxxxx}
-However, these unnatural inductions prevent us to better understand the mystery of
+However, these unnatural inductions prevent us from better understanding the mystery of
 the emergence of language and even intelligence among our pre-human ancestors.
-
-
-Yet, few works investigate the emergence of symbolic language with high
-compositionality \emph{naturally} (i.e., without \emph{deliberately
-  handcrafted} inductions). As a results, it is never clear whether \emph{natural}
+Yet, few works investigate the emergence of high compositional symbolic language
+\emph{naturally}, i.e., without \emph{deliberately
+  handcrafted} inductions.
+As a results, it is never clear whether \emph{natural}
 environment and agent are sufficient for compositionality. 
-In this work, we focus on generating high compositional symbolic language
-naturally without any ``human'' induction.
+
+In this work, we focus on the natural emergence of high compositional symbolic language
+naturally without any handcrafted induction.
 Initially, we thoroughly analyze the compositionality of emerged symbolic
 language after removing the \emph{deliberately handcrafted}
 inductions. Figure~\ref{fig:comp} reports the compositionality when train two
 agents in a listener-speaker referential game. It can be observed that \note{it
-  is challenging to achieve high compositionality without induction as xxxxxx}. Moreover,
-we observe that the agent capacity plays a key role in compositionality, see
-Figure xxx.
+  is challenging to achieve high compositionality without induction as
+  xxxxxx}. Moreover, we observe that the agent capacity plays a key role in
+compositionality, see Figure xxx.

-
-In this work, we reveal and characterize the quantitative relationship
+We reveal and characterize the quantitative relationship
 between the agent capacity and the compositionality of symbolic language both
 theoretically and experimentally.
 %theoretically
-Regarding the theoretical analysis, it is conduct based on the MSC
-(Markov Series Channel) model for the language transmission process and a
-novel mutual information-based metric for the compositionality.
-
+Regarding the theoretical analysis, we use the
+Markov Series Channel (MSC)~\cite{} to model the language transmission process and a
+novel mutual information-based metric to measure the compositionality quantitatively. 
 %experimentally
-Regarding the experimental verification, it is conducted on a listener-speaker referential game framework
-with eliminated unnatural inductions.
-
-================
-
+Regarding the experimental verification, it is conducted on a listener-speaker
+referential game framework with eliminated unnatural inductions.
+Both theoretical analysis and
+  experimental results lead to a counter-intuitive conclusion that lower agent
+  capacity facilitates the emergence of symbolic language with higher
+  compositionality.
+  

 %Prior studies focus on investigating how to affect the
 %compositionality of the emergent language. Researchers
@@ -90,76 +90,91 @@ with eliminated unnatural inductions.
 %like regrading the entropy as an item of additional rewards, may be too ideal to
 %exist in the real world. 

-In contrast to prior work, we investigate the compositionality of emergent
-language from a new perspective, i.e., the agent capacity. Different from
-previous work that only considers external environmental factors, we study the
-impact of agent internal capacity on the compositionality of emergent
-language. Specifically, we first analyze the correlation between agent capacity
-and compositionality theoretically, and propose a novel metric to evaluate
-compostionality quantitatively. Then, on the basis of the theoretical analysis
-and the metric proposed, we verify the relationship between agent capacity and
-compostionality experimentally. 
-
-
-Theoretically, on the basis of mutual information theory[8], we analyse the
-correlation between compostionality of the emergent language and complexity of
-the semantic information carried by a symbol. Such semantic information can be
-characterized in neural network-based agents and requires the certain capacity
-(i.e., the count of neural nodes in the hidden layer). Specifically, we use the
-MSC (Markov Series Channel)[9] to model the language transmission process and
-use the probability distribution of symbols and concepts to model policies of
-agents. After modelling, we use the mutual information matrix $MRI^B$ to
-quantitatively represent the semantic information, and each column of $MRI^B$
-correspond to information carried by one symbol. We find that each column of the
-matrix should be an one-hot vector for a perfectly compositional language, cause
-a symbol only transmit information of a certain concept exclusively. Therefore,
-the average similarity between the columns of $MRI^B$ and a one-hot vector is
-higher, indicating that the emergent language is more compostional (i.e., the
-compostionality is higher). We propose the metric \emph{MIS} to measure
-compositionality by calculating such average similarity
-quantitatively. Different from other metrics, such as \emph{topographic
-  similarity}[10] and \emph{posdis}[11], \emph{MIS} is a bilateral metric
-because it takes both listener and speaker's understanding of semantics into
-account. Moreover, \emph{MIS} comes lower indicates that the emergent language
-tends to delivery semantic information about more concepts in each symbol, so
-that the complexity of semantic information carried by one symbol tend to be
-higher. As a result, higher agent capacity is required to characterize the more
-complex semantic information when \emph{MIS} (i.e., compositionality) is lower. 
-
-Experimentally, we verify the relationship between agent capacity and
-compostionality. We build a listener-speaker referential game as experimental
-framework, and train agents of Stochastic Policy Gradient Algorithm[12] with the
-correctness of forecast output from the listener as the criterion (i.e.,
-reward). The criterion does not imply any environmental pressures on the
-agents. Therefore, we can study the impact of capacity on the compositionality
-without any environmental pressures’ affection. Moreover, to study the impact of
-capacity on the compositionality under a more ‘natural’ environment, the speaker
-and listener are disconnected models without sharing parameters. Our first
-experiment is to verify that agent need higher capacity to master an artificial
-language with lower compositionality under a scenario of language
-teaching. Specifically, we fabricate the speaker to output preassigned languages
-with different compostionality respectively, and train the listener to interpret
-the preassigned language. For all artificial language, we compare the accuracy
-curve during training process of the listener with different capacity, and show
-how capacity affect learning languages with different compostionality. Our
-second experiment is to verify that lower agent capacity would facilitate higher
-compostionality of the emergent language under a scenario of language
-inducing. Specifically, we training a speaker and a listener to create a
-communication protocol (i.e., emergent language), so that the listener can
-select the same object which is received by the speaker. By adjusting capacity
-and comparing the compositionality of emergent language, we show that the
-emergent language attend to have higher compositionality when agent capacity is
-restricted more stringently. As a result, these two experiments verify the
-negative correlation between agent capacity and compostionality both in language
-teaching and language inducing. 
-
-
-
-This paper makes the following contributions:
-
-We propose a ‘bilateral’ metric \emph{MIS}, which takes both listener and speaker's understanding of semantics into account. Compare to previous ‘unilateral’ metrics, \emph{MIS} can handle situations where the semantics of the listener and the speaker are not exactly the same (, we discuss the problem in next section). 
+%In contrast to prior work, we investigate the compositionality of emergent
+%language from a new perspective, i.e., the agent capacity. Different from
+%previous work that only considers external environmental factors, we study the
+%impact of agent internal capacity on the compositionality of emergent
+%language. Specifically, we first analyze the correlation between agent capacity
+%and compositionality theoretically, and propose a novel metric to evaluate
+%compostionality quantitatively. Then, on the basis of the theoretical analysis
+%and the metric proposed, we verify the relationship between agent capacity and
+%compostionality experimentally. 
+%
+%
+%Theoretically, on the basis of mutual information theory[8], we analyse the
+%correlation between compostionality of the emergent language and complexity of
+%the semantic information carried by a symbol. Such semantic information can be
+%characterized in neural network-based agents and requires the certain capacity
+%(i.e., the count of neural nodes in the hidden layer). Specifically, we use the
+%MSC (Markov Series Channel)[9] to model the language transmission process and
+%use the probability distribution of symbols and concepts to model policies of
+%agents. After modelling, we use the mutual information matrix $MRI^B$ to
+%quantitatively represent the semantic information, and each column of $MRI^B$
+%correspond to information carried by one symbol. We find that each column of the
+%matrix should be an one-hot vector for a perfectly compositional language, cause
+%a symbol only transmit information of a certain concept exclusively. Therefore,
+%the average similarity between the columns of $MRI^B$ and a one-hot vector is
+%higher, indicating that the emergent language is more compostional (i.e., the
+%compostionality is higher). We propose the metric \emph{MIS} to measure
+%compositionality by calculating such average similarity
+%quantitatively. Different from other metrics, such as \emph{topographic
+%  similarity}[10] and \emph{posdis}[11], \emph{MIS} is a bilateral metric
+%because it takes both listener and speaker's understanding of semantics into
+%account. Moreover, \emph{MIS} comes lower indicates that the emergent language
+%tends to delivery semantic information about more concepts in each symbol, so
+%that the complexity of semantic information carried by one symbol tend to be
+%higher. As a result, higher agent capacity is required to characterize the more
+%complex semantic information when \emph{MIS} (i.e., compositionality) is lower. 
+%
+%Experimentally, we verify the relationship between agent capacity and
+%compostionality. We build a listener-speaker referential game as experimental
+%framework, and train agents of Stochastic Policy Gradient Algorithm[12] with the
+%correctness of forecast output from the listener as the criterion (i.e.,
+%reward). The criterion does not imply any environmental pressures on the
+%agents. Therefore, we can study the impact of capacity on the compositionality
+%without any environmental pressures’ affection. Moreover, to study the impact of
+%capacity on the compositionality under a more ‘natural’ environment, the speaker
+%and listener are disconnected models without sharing parameters. Our first
+%experiment is to verify that agent need higher capacity to master an artificial
+%language with lower compositionality under a scenario of language
+%teaching. Specifically, we fabricate the speaker to output preassigned languages
+%with different compostionality respectively, and train the listener to interpret
+%the preassigned language. For all artificial language, we compare the accuracy
+%curve during training process of the listener with different capacity, and show
+%how capacity affect learning languages with different compostionality. Our
+%second experiment is to verify that lower agent capacity would facilitate higher
+%compostionality of the emergent language under a scenario of language
+%inducing. Specifically, we training a speaker and a listener to create a
+%communication protocol (i.e., emergent language), so that the listener can
+%select the same object which is received by the speaker. By adjusting capacity
+%and comparing the compositionality of emergent language, we show that the
+%emergent language attend to have higher compositionality when agent capacity is
+%restricted more stringently. As a result, these two experiments verify the
+%negative correlation between agent capacity and compostionality both in language
+%teaching and language inducing. 

-We analyse the relationship between compostionality and agent capacity theoretically.

-We verify the negative correlation between agent capacity and compostionality both in language teaching and language inducing.
-%\endsection
+%
+%This paper makes the following contributions:
+%
+%We propose a ‘bilateral’ metric \emph{MIS}, which takes both listener and speaker's understanding of semantics into account. Compare to previous ‘unilateral’ metrics, \emph{MIS} can handle situations where the semantics of the listener and the speaker are not exactly the same (, we discuss the problem in next section). 
+%
+%We analyse the relationship between compostionality and agent capacity theoretically.
+%
+%We verify the negative correlation between agent capacity and compostionality both in language teaching and language inducing.
+%%\endsection
+
+
+In this paper, we made the following contributions:
+\begin{itemize}[topsep=0pt,itemsep=0cm]
+\item We are the first to successfully achieve high compositional symbolic
+  language naturally, without any deliberately handcrafted inductions.
+\item We thoroughly analyze the compositionality of emerged symbolic language
+  after removing deliberately handcrafted inductions, and confirm that the agent
+  capacity acts as a key factor for compositionality.
+\item We experimentally verified the conclusion for relationship between agent
+  capacity. Both theoretical analysis and
+  experimental results lead to a counter-intuitive conclusion that lower agent
+  capacity facilitates the emergence of symbolic language with higher
+  compositionality.
+\end{itemize}