GSoC Weekly Check-In #9 (August 02)

Published: 08/04/2021

What did I do this week?
  • Made a pull request to the Hub’s main branch. Received some feedback from Abhinav and Dyllan , of which, I’ve made adjustments.
  • Wrote test cases and fixed comments
  • Dyllan proposed switching my current approach of storing hashes in a meta file called “hashlist.json” to “linked tensor”. Every time a sample is appended to a tensor, its corresponding linked tensors also get a sampled appended. This will allow us to deal hashes as a separate tensor (plus, get additional tensor features). I've created a basic framework but will need to run it by members of the team.

What will I do next week?
  • Changing the architecture will take most of my time this week. I need to implement linked tensors and sort out its details. One advantage is, I won’t need to adapt the hashes to transforms and version control, as hashes will be considered as a tensor.

Did I get stuck anywhere?
This was my first ever PR to Hub, so I had to learn about the style guide, code coverage tools, CI tests, etc. I had some trouble running the CI tests after making my PR. One of the libraries I was using, mmh3 (murmurhash3) hadn’t been stated in the requirements.txt file.