Commit f9368418 by Qi Guo
parents 0750d107 37c67ee2
...@@ -4,10 +4,9 @@ ...@@ -4,10 +4,9 @@
%\section{Agent Capacity vs. Compositionality} %\section{Agent Capacity vs. Compositionality}
%\label{ssec:exp} %\label{ssec:exp}
We examine the relationship between agent capacity and the compositionality of We exploit the relationship between agent capacity and the compositionality of
symbolic language that emerged in our natural referential game with various symbolic language that emerged in our natural referential game.
vocabulary size. For various configuration of
For each configuration of
vocabulary size, we train the speaker-listener agents to emerge symbolic vocabulary size, we train the speaker-listener agents to emerge symbolic
language when varying the agent capacities, i.e., hidden layer size language when varying the agent capacities, i.e., hidden layer size
($h_{size}$), from 6 to 100. ($h_{size}$), from 6 to 100.
...@@ -47,7 +46,8 @@ Taking vocabulary size $|V|=4$ as an example, symbolic languages with ...@@ -47,7 +46,8 @@ Taking vocabulary size $|V|=4$ as an example, symbolic languages with
compositionality $MIS>0.99$ take only around 10\% over all the emerged symbolic compositionality $MIS>0.99$ take only around 10\% over all the emerged symbolic
languages, when $h_{size}<20$; the ratio reduces to 0\%$\sim$5\% when $h_{size}$ languages, when $h_{size}<20$; the ratio reduces to 0\%$\sim$5\% when $h_{size}$
increases to 40; the ratio reduces around 3\% when $h_{size}$ goes beyond 40. increases to 40; the ratio reduces around 3\% when $h_{size}$ goes beyond 40.
Especially, when $h_size$ is large enough (e.g., $>40$), high compositional $MIS>0.9$ reports similar results.
Notably, when $h_{size}$ is large enough (e.g., $>40$), high compositional
symbolic language is hard to emerge in a natural referential game, for symbolic language is hard to emerge in a natural referential game, for
easy-to-emerge low compositional symbolic language is sufficient in scenarios of easy-to-emerge low compositional symbolic language is sufficient in scenarios of
referential game. referential game.
...@@ -105,12 +105,12 @@ Figure~\ref{fig:bench}. ...@@ -105,12 +105,12 @@ Figure~\ref{fig:bench}.
Figure~\ref{fig:exp3} reports the accuracy of Listener, i.e., ratio of the correctly Figure~\ref{fig:exp3} reports the accuracy of Listener, i.e., ratio of the correctly
predicted symbols spoken by Speaker ($t=\hat(t)$), which varies with the predicted symbols spoken by Speaker ($t=\hat(t)$), which varies with the
training iterations under different agent capacities. training iterations under different agent capacities.
Figure~\ref{fig:exp3} (a) shows that when $h_size$ equals to 1, the agent capacity is Figure~\ref{fig:exp3} (a) shows that when $h_{size}$ equals to 1, the agent capacity is
too low to handle languages. Figure~\ref{fig:exp3} (b) shows that when $h_size$ too low to handle languages. Figure~\ref{fig:exp3} (b) shows that when $h_{size}$
equals to 2, agent can only learn $LA$ whose compositionality (i.e. \emph{MIS}) equals to 2, agent can only learn $LA$ whose compositionality (i.e. \emph{MIS})
is highest in all three languages. Combing these two observations, we can infer that is highest in all three languages. Combing these two observations, we can infer that
language with lower compositionality requires higher agent capacity to ensure communicating language with lower compositionality requires higher agent capacity to ensure communicating
successfully (i.e., $h_size$). Figure~\ref{fig:exp3} (c) to (h) show that the successfully (i.e., $h_{size}$). Figure~\ref{fig:exp3} (c) to (h) show that the
higher agent capacity causes a faster training process for all three languages, but the higher agent capacity causes a faster training process for all three languages, but the
improvement for different languages is quite different. improvement for different languages is quite different.
It is obvious that language with lower compositionality also requires higher agent It is obvious that language with lower compositionality also requires higher agent
......
...@@ -13,14 +13,18 @@ reinforcement learning~\cite{}. ...@@ -13,14 +13,18 @@ reinforcement learning~\cite{}.
%the environment setting. %the environment setting.
The quality of emergent symbolic language is typically measured by its \emph{compositionality}. The quality of emergent symbolic language is typically measured by its
\emph{compositionality}.
Compositionality is a principle that determines Compositionality is a principle that determines
whether the meaning of a complex expression (e.g, phrase), which is assembled out of a whether the meaning of a complex expression (e.g, phrase), which is assembled out of a
given set of simple components (e.g., symbols), can be determined by its given set of simple components (e.g., symbols), can be determined by its
constituent components and the rules that combines them~\cite{}. constituent components and the rule combining them~\cite{}.
\note{For example, the expression "AAAI is a conference'' consists of two \note{For example, the expression "AAAI is a conference'' consists of two
meaningful words ``AAAI'' and ``conference'', and a rule for definition (``is''). meaningful words ``AAAI'' and ``conference'', and a rule for definition (``is'').
More recently, measuring the compositionality \note{xxxxx}.} Compositionality is considered to be a source of the productivity,
systematicity, and learnability of language, and the reason why a language with finite
vocabulary can express almost infinite concepts.}
%More recently, measuring the compositionality \note{xxxxx}.}
%It %It
...@@ -35,14 +39,14 @@ More recently, measuring the compositionality \note{xxxxx}.} ...@@ -35,14 +39,14 @@ More recently, measuring the compositionality \note{xxxxx}.}
% %
\begin{figure}[t] \begin{figure}[t]
\centering \centering
\includegraphics[width=0.9\columnwidth]{fig/occupy} \includegraphics[width=0.99\columnwidth]{fig/Figure1_motivation.pdf}
\caption{\rmk{compositionality.}} \caption{}
\label{fig:induction} \label{fig:induction}
\end{figure} \end{figure}
Prior studies focus on achieving high compositional symbolic language Prior studies focus on achieving high compositional symbolic language
through \emph{deliberately handcrafted} inductions, e.g., small vocabulary through \emph{deliberately handcrafted} inductions, e.g., small vocabulary
sizes~\cite{}, memoryless~\cite{}, carefully constructed rewards~\cite{}, and sizes~\cite{}, memoryless~\cite{}, addtional rewards~\cite{}, constructed loss functions~\cite{}, and
ease-of-teaching~\cite{}. \note{The possible intuition is that high compositional symbolic ease-of-teaching~\cite{}. \note{The possible intuition is that high compositional symbolic
language cannot emerge without induction in existing multi-agent environment.} language cannot emerge without induction in existing multi-agent environment.}
Figure~\ref{fig:induction} reports the compositionality when training two agents in the widely-used Figure~\ref{fig:induction} reports the compositionality when training two agents in the widely-used
...@@ -57,19 +61,25 @@ environment and agents are sufficient for achieving high compositionality. ...@@ -57,19 +61,25 @@ environment and agents are sufficient for achieving high compositionality.
In this paper, we are the first work to achieve high compositional In this paper, we are the first work to achieve high compositional
symbolic language without any deliberately handcrafted induction. The key observation symbolic language without any deliberately handcrafted induction. The key observation
is that the internal \emph{agent capacity} plays a crucial role in the compositionality is that the internal \emph{agent capacity} plays a crucial role in the
of symbolic language, compositionality of symbolic language,
by thoroughly analyzing the compositionality after removing the inductions in by %thoroughly
analyzing the compositionality after removing the inductions in
the most widely-used listener-speaker referential game framework. the most widely-used listener-speaker referential game framework.
Concretely, the relationship between the agent capacity and the compositionality Concretely, the relationship between the agent capacity and the compositionality
of symbolic language is characterized both theoretically and experimentally. of symbolic language is characterized, with a novel mutual information-based
metric for the compositionality.
%both theoretically and experimentally.
%theoretically %theoretically
Regarding the theoretical analysis, we use the Regarding the theoretical analysis, we propose
\note{Markov Series Channel (MSC)~\cite{} to model the language transmission process and a %use the \note{Markov Series Channel (MSC)~\cite{} to model the language
novel mutual information-based metric to measure the compositionality quantitatively}. % transmission process and}
a novel mutual information-based metric to measure the compositionality quantitatively.
%experimentally %experimentally
Regarding the experimental validation, two different dedicated experiments, i.e., Regarding the experimental validation, we exploit the relationship between agent
\note{XXX and XXX, are utilized for XXX}. capacity and the compositionality of symbolic language that emerged
\emph{naturally} in our experiments.
%two different dedicated experiments, i.e., \note{XXX and XXX, are utilized for XXX}.
%Regarding the experimental validation, it is conducted on a listener-speaker %Regarding the experimental validation, it is conducted on a listener-speaker
%referential game framework with eliminated unnatural inductions. %referential game framework with eliminated unnatural inductions.
Both the theoretical analysis and experimental results lead to a counter-intuitive Both the theoretical analysis and experimental results lead to a counter-intuitive
......
...@@ -45,11 +45,6 @@ $t={[0,0,1],[0,1,0]}$ would be equal to $\hat{t}=[0,0,0,0,0,1]$ if they both mea ...@@ -45,11 +45,6 @@ $t={[0,0,1],[0,1,0]}$ would be equal to $\hat{t}=[0,0,0,0,0,1]$ if they both mea
circle''. circle''.
\subsection{Agent architecture}
\label{ssec:agent}
\begin{figure*}[t] \begin{figure*}[t]
\centering \centering
\includegraphics[width=1.8\columnwidth]{fig/Figure3_The_architecture_of_agents.pdf} \includegraphics[width=1.8\columnwidth]{fig/Figure3_The_architecture_of_agents.pdf}
...@@ -57,6 +52,13 @@ circle''. ...@@ -57,6 +52,13 @@ circle''.
\label{fig:agents} \label{fig:agents}
\end{figure*} \end{figure*}
\subsection{Agent architecture}
\label{ssec:agent}
Figure~\ref{fig:agents} shows the architecture of the constructed agents, Figure~\ref{fig:agents} shows the architecture of the constructed agents,
including the Speaker $S$ and Listener $L$. including the Speaker $S$ and Listener $L$.
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment