Hello,
apparently, the recently introduced mxnet.text API introduces test failures
https://github.com/apache/incubator-mxnet/pull/8763. It would be great if
the two following issues could be investigated:
http://jenkins.mxnet-ci.amazon-ml.com/blue/rest/organizations/jenkins/pipelines/incubator-mxnet/branches/master/runs/175/nodes/336/steps/629/log/?start=0
test_text.test_glove ... FAIL
Traceback (most recent call last):
File "C:\Anaconda3\envs\py2\lib\site-packages\nose\case.py", line 197, in
runTest
self.test(*self.arg)
File
"C:\jenkins_slave\workspace\ut-python-cpu\tests\python\unittest\test_text.py",
line 125, in test_glove
'glove', pretrained_file_name='glove.6B.50d.txt')
File
"C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\text\embedding.py",
line 371, in create
return create_text_embedding(embedding_name, **kwargs)
File
"C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\registry.py",
line 163, in create
return registry[name](*args, **kwargs)
File
"C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\text\embedding.py",
line 538, in __init__
self._load_embedding(pretrained_file_path, ' ', init_unknown_vec)
File
"C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\text\embedding.py",
line 201, in _load_embedding
% (line_num, token, len(elems), vec_len)
AssertionError: At line 321803 of the pre-trained token embedding file: the
dimension of token nonslip is 7 but the dimension of previous tokens is 50.
Dimensions of all the tokens must be the same.
-------------------- >> begin captured logging << --------------------
root: INFO: Loading pre-trained token embedding vectors from
C:\Windows\system32\config\systemprofile\.mxnet\embeddings\glove\glove.6B.50d.txt
--------------------- >> end captured logging << ---------------------
Also, we got a skipped test:
test_text.test_fasttext ...
C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\text\embedding.py:188:
UserWarning: At line 1 of the pre-trained text embedding file: token 111051
with 1-dimensional vector [300.0] is likely a header and is skipped.
'skipped.' % (line_num, token, elems))
Thank you
-Marco