Hello,

apparently, the recently introduced mxnet.text API introduces test failures
https://github.com/apache/incubator-mxnet/pull/8763. It would be great if
the two following issues could be investigated:
http://jenkins.mxnet-ci.amazon-ml.com/blue/rest/organizations/jenkins/pipelines/incubator-mxnet/branches/master/runs/175/nodes/336/steps/629/log/?start=0

test_text.test_glove ... FAIL
Traceback (most recent call last):

  File "C:\Anaconda3\envs\py2\lib\site-packages\nose\case.py", line 197, in
runTest

    self.test(*self.arg)

  File
"C:\jenkins_slave\workspace\ut-python-cpu\tests\python\unittest\test_text.py",
line 125, in test_glove

    'glove', pretrained_file_name='glove.6B.50d.txt')

  File
"C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\text\embedding.py",
line 371, in create

    return create_text_embedding(embedding_name, **kwargs)

  File
"C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\registry.py",
line 163, in create

    return registry[name](*args, **kwargs)

  File
"C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\text\embedding.py",
line 538, in __init__

    self._load_embedding(pretrained_file_path, ' ', init_unknown_vec)

  File
"C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\text\embedding.py",
line 201, in _load_embedding

    % (line_num, token, len(elems), vec_len)

AssertionError: At line 321803 of the pre-trained token embedding file: the
dimension of token nonslip is 7 but the dimension of previous tokens is 50.
Dimensions of all the tokens must be the same.

-------------------- >> begin captured logging << --------------------

root: INFO: Loading pre-trained token embedding vectors from
C:\Windows\system32\config\systemprofile\.mxnet\embeddings\glove\glove.6B.50d.txt

--------------------- >> end captured logging << ---------------------


Also, we got a skipped test:
test_text.test_fasttext ...
C:\jenkins_slave\workspace\ut-python-cpu\pkg_vc14_cpu\python\mxnet\text\embedding.py:188:
UserWarning: At line 1 of the pre-trained text embedding file: token 111051
with 1-dimensional vector [300.0] is likely a header and is skipped.

  'skipped.' % (line_num, token, elems))



Thank you

-Marco

Reply via email to