What did I do this week?
This week, I had my mid GSoC presentation where I talked about my progress so far contributing to Hub. I also started working with the Hub 2.0 codebase by familiarising myself with the chunking + compressing mechanism.
What will I do next week?
This week, I’ll start working on a fork of the Hub main branch. I will generate hashes when a dataset is being uploaded and will be using those hashes to compare with existing datasets. This will be an ‘opt-in’ mechanism i.e the user will have to choose if they want to generate hashes or not. After that, the eventual goal is to inform the user if a similar dataset already exists to prevent reupload.
Did I get stuck anywhere?
No.
This week, I had my mid GSoC presentation where I talked about my progress so far contributing to Hub. I also started working with the Hub 2.0 codebase by familiarising myself with the chunking + compressing mechanism.
What will I do next week?
This week, I’ll start working on a fork of the Hub main branch. I will generate hashes when a dataset is being uploaded and will be using those hashes to compare with existing datasets. This will be an ‘opt-in’ mechanism i.e the user will have to choose if they want to generate hashes or not. After that, the eventual goal is to inform the user if a similar dataset already exists to prevent reupload.
Did I get stuck anywhere?
No.