gavishpoddar's Blog

Part of the journey is the end | Final Report

gavishpoddar
Published: 08/24/2021

Hi, community

Part of the journey is the end. It is time for me to work on my final work report for final evaluation of Google Summer of Code 2021. This week, I will devote my time mainly to write my final report.

Final Work Submission Report

Hello Everyone! My name is Gavish Poddar and I'm excited to tell you about my GSoC journey. For the past couple of months, I have been working on an awesome project dateparser. The dateparser aims to parse datetime from a string.

My GSoC journey would not have been successful without the guidance of my mentors Marc Hernández, Konstantin Lopuhin and Kishan Mehta.

What I Have Learned?

The whole GSoC journey was full of learning thanks to my mentors. I learned how to find good open source dependencies to include in our project. I tried my hands on improving code coverage and writing tests for the code. I learned how to optimize code and the need for extensive research before feature addition.

What I Have Contributed?

As mentioned in my proposal I worked on the implementation of the Optional Language Detection for dateparser and fixing as many issues as possible in the search_dates function of the dateparser.

Optional Language Detection

PR - Optional Language Detection

Implemented optional language detection to improve language detection. This allows to plug in any language detection library with the dateparser. Out of the box, dateparser supports two libraries fasttext and langdetect. The optional language detection works with both parse and search_dates. This PR also introduces a new setting DEFAULT_LANGUAGES which is used if no language is detected by default language detection and the optional language detection.

Reimplimenting search_dates (extended goal)

PR - Reimplimenting search_dates

A reimplemented and simplified search_dates improves the results and fixes many issues. The entire search_dates is newly implemented and would be easier to maintain. This PR introduces a new feature search_first_date which returns the first date in the given string. This PR also fixes around 13 issues.

Other search_dates improvements


Adding support for date-related objects last decade, next decade, etc in search_date. This PR fixes 1 issue.

PR - Improvements in locale:translate_search fixes


Adding support search_date period separator support. Date string like 23.12.2000 can be parsed. This PR fixes 5 issues.

PR - search_date period separator support


Other Important Details

As part of our GSoC project, Python Software Foundation requires us to post a weekly blog where we usually post about what we have done in the week and what is coming up next. We can also write about any blockages or issues we are facing. I have also written my weekly blogs so if you want to know weekly details of my project you can refer them here.

Weekly Blogs

Future Work and Final Note

The project is very actively maintained the new search_dates and my contributions would improve the library. The main goal of the proposal is achived with the implimentation of the optional language detection and the PR is mergeable. I plan to keep working on the project and contribute as much as I can. Contribute to the search_dates function of the library (dateparser) would be my primary goal.

It was overall a wonderful experience and I learned a lot.

I would like to thank Google, Python Software Foundation and Zyte for providing me with the opportunity and my mentors Marc Hernández, Konstantin Lopuhin and Kishan Mehta.

Thank you for reading!

View Blog Post

GSoC Weekly Check-In #6

gavishpoddar
Published: 08/17/2021

Hi, community

1. What did you do this week?

This week was a tots of coding and finalising the PR's and reviews from mentors.

This week I also created 1 new PR which solves 5 issues and extends support of search_dates function.

2. What is coming up next?

This week I look forward to solve issues from code reviews and wrap up all the workings during the GSoC.

3. Did you get stuck anywhere?

Nothing major, it was a great week with lots of coding.

Thank you for reading!

View Blog Post

GSoC Blog Post #5

gavishpoddar
Published: 08/10/2021

Hi, community

1. What did you do this week?

In this week I worked towards the completion and the PR is now mergeable we did a lots of micro improvements on the PR and a lots of reviews, suggestions and discussions. The PR with reimplementation of the search_dates is also complete.

2. What is coming up next?

In this week I will wrap up the Optional Language Detection PR and prepare for the final submission.

3. Did you get stuck anywhere?

No.

Thank you for reading!

View Blog Post

GSoC Weekly Check-In #5

gavishpoddar
Published: 08/02/2021

Hi, community

1. What did you do this week?

This week we did a lots for code review and testing the language detection PR and improvising the search_dates function along with adding few new functions to the new search_dates. In the language detect function we tested and practicality checked most of the lines of code to make sure that they may not cause other issues. This week I also created 3 micro PR's .

2. What is coming up next?

In this week I look forward to complete the implementation of the new search_dates and take reviews form my mentors for improvements so that both the major PR's could be approved within the GSoC timeline.

3. Did you get stuck anywhere?

I didn't got stuck with any thing but this week was a lots of coding and feedback from my mentors that took and lot of time. Few tests cases are still failing but I expect to fix them within this week.

Thank you for reading!

View Blog Post

GSoC Blog Post #4

gavishpoddar
Published: 07/26/2021

Hi, community

1. What did you do this week?

I work on search_dates this week included creation of many files new supporting functions, created previous tests work with new implementation.

Debugged various issues, checked all settings. And creating docs and tests for language detection.

2. What is coming up next?

In the this week, I will try to complete the implementation of the language detection PR and complete the implementation of the new search_dates.

3. Did you get stuck anywhere?

I got stuck with a tox test (python segmentation fault) now its fixed.

Thank you for reading!

View Blog Post
DJDT

Versions

Time

Settings from gsoc.settings

Headers

Request

SQL queries from 1 connection

Static files (2312 found, 3 used)

Templates (28 rendered)

Cache calls from 1 backend

Signals

Log messages