Commit 86eb1a7e by Zidong Du

~

parent bfc96b16
...@@ -109,11 +109,11 @@ Figure~\ref{fig:exp3} (a) shows that when $h_{size}$ equals to 1, the agent capa ...@@ -109,11 +109,11 @@ Figure~\ref{fig:exp3} (a) shows that when $h_{size}$ equals to 1, the agent capa
too low to handle languages. Figure~\ref{fig:exp3} (b) shows that when $h_{size}$ too low to handle languages. Figure~\ref{fig:exp3} (b) shows that when $h_{size}$
equals to 2, agent can only learn $LA$ whose compositionality (i.e. \emph{MIS}) equals to 2, agent can only learn $LA$ whose compositionality (i.e. \emph{MIS})
is highest in all three languages. Combing these two observations, we can infer that is highest in all three languages. Combing these two observations, we can infer that
language with lower compositionality requires higher agent capacity to ensure communicating language with lower compositionality requires higher agent capacity to ensure
successfully (i.e., $h_{size}$). Figure~\ref{fig:exp3} (c) to (h) show that the communicating successfully (i.e., $h_{size}$).
Additionally, Figure~\ref{fig:exp3} (c)$\sim$(h) show that the
higher agent capacity causes a faster training process for all three languages, but the higher agent capacity causes a faster training process for all three languages, but the
improvement for different languages is quite different. improvement for different languages is quite different. It is obvious that language with lower compositionality also requires higher agent
It is obvious that language with lower compositionality also requires higher agent
capacity to train faster. capacity to train faster.
......
...@@ -40,9 +40,7 @@ from $V$. The Listener $L$ receives $s$ and output predicted result $\hat{t}$, ...@@ -40,9 +40,7 @@ from $V$. The Listener $L$ receives $s$ and output predicted result $\hat{t}$,
a single word (one-hot vector) selected from the Cartesian product of set two $V$s a single word (one-hot vector) selected from the Cartesian product of set two $V$s
($V\times V$), which representing all the meanings of two combined words from $V$. ($V\times V$), which representing all the meanings of two combined words from $V$.
Please note that since $t$ and $\hat{t}$ have different length, we say Please note that since $t$ and $\hat{t}$ have different length, we say
$t=\hat{t}$ if $t$ expresses the same meaning as $\hat{t}$, e.g., $t=\hat{t}$ if $t$ expresses the same meaning as $\hat{t}$, e.g., ``red circle''.
$t={[0,0,1],[0,1,0]}$ would be equal to $\hat{t}=[0,0,0,0,0,1]$ if they both mean ``red
circle''.
\begin{figure*}[t] \begin{figure*}[t]
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment