Hi and thanks in advance for any help and advice!
I am trying to fine-tune a transformer model from the huggingface library and do some pre-processing using spacy. When I run the script, I get the following warning, which seems to be related to the Open MPI configuration. I can run the code despite the warning, but I’m wondering whether this impacts performance… Does anyone have an idea how to solve this issue?
--------------------------------------------------------------------------
WARNING: No preset parameters were found for the device that Open MPI
detected:
Local host: (I omit this info here)
Device name: mlx5_0
Device vendor ID: (I omit this info here)
Device vendor part ID: (I omit this info here)
Default device parameters will be used, which may result in lower
performance. You can edit any of the files specified by the
btl_openib_device_param_files MCA parameter to set values for your
device.
NOTE: You can turn off this warning by setting the MCA parameter
btl_openib_warn_no_device_params_found to 0.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
No OpenFabrics connection schemes reported that they were able to be
used on a specific port. As such, the openib BTL (OpenFabrics
support) will be disabled for this port.
Local host: (I omit this info here)
Local device: mlx5_0
Local port: 1
CPCs attempted: udcm
--------------------------------------------------------------------------
2023-05-12 14:59:25.306866: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful
NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA
node zero
2023-05-12 14:59:25.307798: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful
NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA
node zero
2023-05-12 14:59:25.307949: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:981] successful
NUMA node read from SysFS had negative value (-1), but there must be at least one NUMA node, so returning NUMA
node zero