programmer290399's Blog

Weekly Blog Post #12 [Aug. 23, 2021]

programmer290399
Published: 08/23/2021

What did you do this week?

  • This week I streamlined the SimpleModel class which derives from the Model base class in which I recently added archive support code.
  • I also made multiple minor changes [1,2] as requested by my mentor.
  • Other than that I updated all the models to reduce code duplication and support archive storage.
  • While I was updating all the models I noticed that I was having problems in saving and loading trained models for the spaCy model, I discovered that there was an issue in the archive creation code.
  • I quickly fixed the problem in a subsequent PR.
  • I also added a tutorial on how to save/load models as archives.
  • Last but not the least I added test cases to cover dffml.df.archive.

What is coming up next?

  • This week marks the end of my GSoC'21 journey, I would continue contributing to DFFML in the future.

Did you get stuck anywhere?

  • Not really, I was stuck for some time figuring out why the spaCy model was not working, but I was able to spot out the issue and fix it pretty quickly.
View Blog Post

Weekly Check-In #11 [Aug. 16, 2021]

programmer290399
Published: 08/16/2021

What did you do this week?

  • At the beginning of this week, I received some feedback from my mentor regarding the flow and some other stuff[1,2].
  • I worked on the changes requested by my mentor and fixed various bugs [1,2,3,4,5].
  • Other than that I also fixed some test cases that were failing after I made the changes last week [1,2].

What is coming up next?

  • Testing and implementing/fixing archive storage support in all 11 models listed in this PR.
  • Other than I have noticed that some models have repetitive code blocks, I would try to eliminate any such duplication.

Did you get stuck anywhere?

  • Nope, this week I spent most of my time reading the model code and analyzing where and what changes would be needed.
View Blog Post

Weekly Blog Post #10 [Aug. 9, 2021]

programmer290399
Published: 08/11/2021

What did you do this week?

What is coming up next?

  • I would be working on the flow linked above and would be streamlining various other parts of code to work properly with archive code.
  • I would be perusing through the code for various models to understand and plan where & what all changes would be needed to properly support archive storage.
  • Other than that I'd be fixing any bugs that might have been introduced in my recent commits.

Did you get stuck anywhere?

  • I got stuck when I was trying to condense the code for archive dataflow creation, but with some time, thought and basic math, I was able to significantly reduce the code's verbosity while improving its readability.
View Blog Post

Weekly Check-In #9 [Aug. 2, 2021]

programmer290399
Published: 08/02/2021

What did you do this week?

  • This week I got two of my PRs merged after a long time:
    • ci : lint : commits : Adding ci job to validate commit message format #1076
    • util:log: Added log_time decorator #1101
  • However merging PR #1076 introduced an issue in master in which the lint commit command kept running forever and was killed by Github Actions for exceeding run time limits.
  • I fixed this issue quickly in another PR which simply skipped the linting test on master branch.
  • Other than that I continued working on my possibly final PR for my GSoC project based on the inputs [1,2,3] received from my mentor.

What is coming up next?

  • As per the feedback received from my mentor on archive support part I have to make some changes which I'd be picking up in coming week.
  • Also, currently the archive dataflow creation code is very verbose and I will try to improve upon it.

Did you get stuck anywhere?

  • Not really, I have been thinking a lot about how the verbosity of dataflow creation code can be reduced, how various blocks of code which are pretty similar can be removed and how the overall mechanism can be made to look more pythonic and clean.
View Blog Post

Weekly Blog Post #8 [July 26, 2021]

programmer290399
Published: 07/27/2021

What did you do this week?

  • This week I worked on two PRs:
    1. high_level: Move code into a directory and splitting out into files #1172
    2. WIP: model : Add Support for Archive Storage of Models #1174
  • The first one was related to an issue I faced earlier and the second PR was dependent on the first one.
  • So before making the first PR I opened up an issue to get feedback from my mentor before I proceeded with the implementation.
  • After the first PR was merged I continued working on the second one.
  • There were some issues I faced in both of these PRs which I'd be discussing later in this post.
  • Other than that I rebased my other two PRs which are ready to merge with all relevant CI tests passing.
    1. util:log: Added log_time decorator #1101
    2. WIP : ci : lint : commits : Adding ci job to validate commit message format #1076

What is coming up next?

  • I would be working on to complete all the tasks in my possibly the final PR to fix issue #662
  • This would take a long time as I have to make changes in all the models and update their respective tests and docs.
  • Other than that I would be fixing a logging issue, I still have to think about how it can be fixed.

Did you get stuck anywhere?

  • Yes, starting with the high_level splitting PR, I was stuck on a failing docstring test, I don't know how it was working before I made this change, after a ton of debugging I just gave up and solved it with a pretty trivial solution.
  • Other than that I was not really stuck but spent quite a lot of my time in thinking how the tar support should be implemented in the Model class, I have pushed a rough implementation to get feedback from my mentor(s).
View Blog Post
DJDT

Versions

Time

Settings from gsoc.settings

Headers

Request

SQL queries from 1 connection

Static files (2312 found, 3 used)

Templates (28 rendered)

Cache calls from 1 backend

Signals

Log messages