Author: Fantashit

Posted in Uncategorized

0%% GPU usage when using `hyperparameter_search`

## Environment info transformers version: 4.4.0.dev0 Platform: Linux-4.19.112+-x86_64-with-Ubuntu-18.04-bionic Python version: 3.6.9 PyTorch version (GPU?): 1.7.0+cu101 (True) Tensorflow version (GPU?): 2.4.1 (True) Using GPU in script?:…

Continue Reading
Posted in Uncategorized

Model Parallelism for Bert Models

Hi, I’m trying to implement Model parallelism for BERT models by splitting and assigning layers across GPUs. I took DeBERTa as an example for this….

Continue Reading
Posted in Uncategorized

Model not training beyond 1st epoch

Environment info transformers version: 4.4.0.dev0 Platform: Linux-4.19.112+-x86_64-with-Ubuntu-18.04-bionic Python version: 3.6.9 PyTorch version (GPU?): 1.7.0+cu101 (True) Tensorflow version (GPU?): 2.4.1 (True) Using GPU in script?: Yes…

Continue Reading
Posted in Uncategorized

Converting fairseq NMT to transformers misses model weight

Hi there, question about fairseq NMT model (FSMT) conversion. I tried to convert my own fairseq-nmt model (transformer_wmt_en_de) based on this conversion script. However, decoder.embed_out…

Continue Reading
Posted in Uncategorized

AutoTokenizer from pretrained BERT throws TypeError when encoding certain input

Environment info transformers version: 4.3.2 Platform: Arch Linux Python version: 3.9.1 PyTorch version (GPU?): 1.7.1, no Tensorflow version (GPU?): Not installed Using GPU in script?:…

Continue Reading
Posted in Uncategorized

Let NaNs pass through in OrdinalEncoder

We should allow NaNs to pass-through in OrdinalEncoder. One reason is for supporting categorical features with NaNs in the HistGradientBoosting estimators: For the native categorical…

Continue Reading
Posted in Uncategorized

Remove matplotlib warnings from examples executed when generating the documentation

There is a warning generated in our documentation for the 2 following examples: https://scikit-learn.org/stable/auto_examples/inspection/plot_permutation_importance.html#sphx-glr-auto-examples-inspection-plot-permutation-importance-py https://scikit-learn.org/stable/auto_examples/inspection/plot_permutation_importance_multicollinear.html#sphx-glr-auto-examples-inspection-plot-permutation-importance-multicollinear-py The warning is linked with the decoration of the yticks:…

Continue Reading
Posted in Uncategorized

Birch should be called BIRCH

C.f. the original paper. Zhang, T.; Ramakrishnan, R.; Livny, M. (1996). “BIRCH: an efficient data clustering method for very large databases”. Proceedings of the 1996…

Continue Reading
Posted in Uncategorized

write_videofile() appears fundamentally broken

Moviepy function write_videofile() works with static filename parameters, (i.e. “this_is_my_file.mp4”) but does not work as intended when variables are passed into the filename parameter (i.e….

Continue Reading
Posted in Uncategorized

Data movie was created

Love the library – been using it for a lot of utilities. Is it possible to use Moviepy to get the video information like when/where…

Continue Reading