-
Notifications
You must be signed in to change notification settings - Fork 762
Issues: kimiyoung/transformer-xl
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How to train transformer-xl for new datasets (Specifically Hindi)
#152
opened May 28, 2024 by
SandyPanda-MLDL
Why do you pass query, key, and value through the same fc_layer in transformer_xl model?
#151
opened Nov 2, 2023 by
wonjunchoi-arc
can you provide an example program running with Python script?
#134
opened Mar 30, 2021 by
zane-star-bot
Question: why is relative positional encoding computed with length M vs. L+M in the paper ?
#132
opened Mar 18, 2021 by
gdoras
Possibly Incorrect Calculation of Perplexity in Pytorch Implementation
#131
opened Mar 18, 2021 by
shaan97
RuntimeError: CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling
cublasCreate(handle)
#125
opened Dec 7, 2020 by
demdecuong
Can someone please tell me on what dataset was transformer-XL pre-trained on?
#124
opened Nov 18, 2020 by
pathak-aman
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.