Google Summer of Coding Week 2
Hello, all. š
This is my third blog in this category (Google Summer of Code 2018).
So, the second week is over now. Here are some important points from the 2nd week.
1. 70GB model for Mandarin
So the right way to do the installation should be:
- Step 1: add the following sentence in the /etc/hosts file
- Step 2: input the command in the path: /mnt/rds/redhen/gallina/models
We fixed all the bugs and successfully downloaded the 70GB model for Mandarin on CWRU HPC on May 22nd.
2. Use Singularity to run python files
Iāve figured out how to use Singularity to run python files with the help of Prof. Steen and Prof. Michael. Hereās the steps:
3. DeepSpeech2 on PaddlePaddle
My project is based on the PaddlePaddle framework. Starting with PaddlePaddle at HPC gave me some problems.
3.1 First try: PaddlePaddle Docker appliance on the CWRU HPC.
Initially, the team planned to try the PaddlePaddle Docker appliance on the CWRU HPC. Prof. Peter gave it a try but failed because the makers of the Docker image put stuff in /root, which is not accessible in Singularity unless you have root rights on the host machine. The details are shown below.
Then we can run it:
And from in there, we get the error message:
3.2 Second try: Install PaddlePaddle from source
Since the Docker approach failed, we switched to install PaddlePaddle at HPC with singularity support.
What weād like to see is PaddlePaddle and DeepSpeech2 inside a Singularity container, so that we can deploy it on different accounts and different HPCs. The method weād like to use is Singularity Hub. Letās see if we can make the PaddlePaddle/DeepSpeech 2 installation work in a reproducible manner on Red Hen Labās github account using Singularity recipes on Singularity Hub. The goal is to have a system where we can upgrade the image simply by modifying the recipe, make use of the automated build system, and deploy at will in multiple locations.
For instance, your recipe can specify āFrom:ubuntu:latestā. If we maintain these recipes on Red Hen Labās github account, we can easily keep them updated, maintain different versions for different purposes, and pull them in as needed for our HPC pipelines.
We are still working on it and hope we can succeed asap!
Reference:
-
Singularity Hub: a likely useful service for creating, maintaining, and distributing singularity images
4. Conclusion
This week was great, but I contributed a little due to the English Exam this week. The next week (third week) will be more about buiding Singularity images. The third week ends on June 3. Till then, happy coding and cheers. See you later.