Commit 52ace7e5 by Zidong Du

abstract

parent 239786c8
...@@ -167,19 +167,19 @@ ...@@ -167,19 +167,19 @@
\begin{abstract} \begin{abstract}
The natural emergence of symbolic languages with high compositionality has The natural emergence of symbolic languages with high compositionality has
attracted extensive attentions from a broad range of communities. Existing attracted extensive attention from a broad range of communities. Existing
studies achieve high compositionality through \emph{deliberately handcrafted} studies achieve high compositionality through \emph{deliberately handcrafted}
inductions (e.g., small vocabulary sizes, carefully constructed distractors, inductions (e.g., small vocabulary sizes, carefully constructed distractors,
and ease-of-teaching) in multi-agent learning, which are unnatural. and ease-of-teaching) in multi-agent learning, which are unnatural.
Yet, few studies investigate the emergence of symbolic language with high compositionality in Yet, few studies investigate the emergence of symbolic language with high
\emph{``natural''} environments, i.e., without any deliberately handcrafted compositionality \emph{``naturally''}, i.e., without any deliberately handcrafted
inductions. inductions.
In this paper, we are the first to successfully achieve high compositional symbolic In this paper, we are the first to successfully achieve high compositional symbolic
language in a purely \emph{natural} environment. language in a purely \emph{natural} manner.
Initially, by thoroughly investigating the compositionality of symbolic Initially, by thoroughly investigating the compositionality of emerged symbolic
language emerged after removing the \emph{deliberately handcrafted} language after removing the \emph{deliberately handcrafted}
inductions, we observe that the agent capacity plays the key role in inductions, we observe that the agent capacity plays a key role in
compositionality. We further reveal and characterize the quantitative relationship compositionality. We further reveal and characterize the quantitative relationship
between the agent capacity and the compositionality of symbolic language both between the agent capacity and the compositionality of symbolic language both
theoretically and experimentally. The theoretical analysis is built on the MSC theoretically and experimentally. The theoretical analysis is built on the MSC
...@@ -189,8 +189,8 @@ ...@@ -189,8 +189,8 @@
with eliminated external environment factors. Both theoretical analysis and with eliminated external environment factors. Both theoretical analysis and
experimental results lead to a counter-intuitive conclusion that lower agent experimental results lead to a counter-intuitive conclusion that lower agent
capacity facilitates the emergence of symbolic language with higher capacity facilitates the emergence of symbolic language with higher
compositionality. Based on our conclusion, we are able to generate higher compositionality. \note{Based on our conclusion, we can generate higher
compositional symbolic language with a high probability. compositional symbolic language with a high probability.}
% The natural emergence of symbolic languages with high compositionality has % The natural emergence of symbolic languages with high compositionality has
......
...@@ -13,14 +13,15 @@ learning~\cite{}. For example, \note{XXXX} ...@@ -13,14 +13,15 @@ learning~\cite{}. For example, \note{XXXX}
%the environment setting. %the environment setting.
To evaluate the emerged symbolic language, compositionality is widely used and Compositionality is widely used and
taken as an important metric. Roughly, compositionality is a principle that the taken as an important metric to evaluate the emerged symbolic language.
meaning of a complex expression (e.g, phase), which is assembled out of the Originally, compositionality is a principle that
given set of simple components (e.g., symbols), whether the meaning of a complex expression (e.g, phase), which is assembled out of the
is determined by its constituent components and the rules that combines them~\cite{}. given set of simple components (e.g., symbols), can be determined by its
For example, the expression "AAAI is a conference'' is consists of two constituent components and the rules that combines them~\cite{}.
meaningful words "'' \note{For example, the expression "AAAI is a conference'' consists of two
meaningful words ``AAAI'' and ``conference'', and a rule of definition (``is'').}
More recently, measuring the compositionality \note{xxxxx}.
%It %It
...@@ -33,26 +34,47 @@ meaningful words "'' ...@@ -33,26 +34,47 @@ meaningful words "''
%extract information from a single symbol. %extract information from a single symbol.
% %
% %
%\begin{figure}[t] \begin{figure}[t]
% \centering \centering
% \includegraphics[width=0.9\columnwidth]{fig/occupy} \includegraphics[width=0.9\columnwidth]{fig/occupy}
% \caption{(a): The correspondence between symbol sequences ($s_0$, $s_1$) and (shape, \caption{\rmk{compositionality.}}
%color) pairs in a perfectly compostional language. $s_0$, $s_1$ in {a, b, c}, shape \label{fig:symbols}
%in {circle, square} and color in {red, blue, green}; (b): The correspondence \end{figure}
%between symbol sequences ($s_0$, $s_1$) and (shape, color) pairs in a language with
%low compostionality.} Prior studies focus on achieving high compositionality of the emergent language
% \label{fig:symbols} through \emph{deliberately handcrafted} inductions unnaturally, e.g., small vocabulary
% \end{figure} sizes~\cite{}, memoryless~\cite{}, carefully constructed rewards~\cite{}, and
ease-of-teaching~\cite{}. \note{xxxxxxx}
Prior studies focus on investigating how to affect the compositionality of the However, these unnatural inductions prevent us to better understand the mystery of
emergent language. Researchers have found that various environmental pressures the emergence of language and even intelligence among our pre-human ancestors.
would affect compositionality, e.g., small vocabulary sizes[3], memoryless[4],
carefully constructed rewards[5] and ease-of-teaching[6]. However, these works
only consider \emph{nurture} [7] (i.e., environmental factors), rather than Yet, few works investigate the emergence of symbolic language with high
\emph{nature} (i.e., hereditary factors from agents), when inducing or exploring compositionality in \emph{naturally} (i.e., without \emph{deliberately
the emergent language without exception. Moreover, some environmental pressures, handcrafted} inductions). As a results, it is never clear whether \emph{natural}
like regrading the entropy as an item of additional rewards, may be too ideal to environment and agent are sufficient for compositionality.
exist in the real world. In this work, we focus on generating high compositional symbolic language
naturally without any ``human'' induction.
Initially, we thoroughly analyze the compositionality of emerged symbolic
language after removing the \emph{deliberately handcrafted}
inductions. Figure~\ref{fig:comp} reports the compositionality when train two
agents in a listener-speaker referential game. It can be observed that \note{xxxxxxxx}.
Thus, it is challenging to achieve high compositionality without induction.
================
%Prior studies focus on investigating how to affect the
%compositionality of the emergent language. Researchers
%have found that various environmental pressures would affect compositionality,
%e.g., small vocabulary sizes[3], memoryless[4],
%carefully constructed rewards[5] and ease-of-teaching[6]. However, these works
%only consider \emph{nurture} [7] (i.e., environmental factors), rather than
%\emph{nature} (i.e., hereditary factors from agents), when inducing or exploring
%the emergent language without exception. Moreover, some environmental pressures,
%like regrading the entropy as an item of additional rewards, may be too ideal to
%exist in the real world.
In contrast to prior work, we investigate the compositionality of emergent In contrast to prior work, we investigate the compositionality of emergent
language from a new perspective, i.e., the agent capacity. Different from language from a new perspective, i.e., the agent capacity. Different from
......
...@@ -166,7 +166,6 @@ works can be roughly classified into two categories, referential game and ...@@ -166,7 +166,6 @@ works can be roughly classified into two categories, referential game and
multi-agent reinforcement learning (MARL), based on the environment setting. multi-agent reinforcement learning (MARL), based on the environment setting.
However, previous works, no matter referential game related or multi-agent reinforcement However, previous works, no matter referential game related or multi-agent reinforcement
learning related, ignore the independence of agents in \note{training or learning related, ignore the independence of agents in \note{training or
inference.} Agents usually share one or more of the model parameters, loss functions,
observation of environments, and thusly can be taken as one huge brain with observation of environments, and thusly can be taken as one huge brain with
multiple connected sensors (agents). In other words, previous works did not multiple connected sensors (agents). In other words, previous works did not
really achieve the emergence of symbolic language among \emph{multiple} agents. really achieve the emergence of symbolic language among \emph{multiple} agents.
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment