So I took this week off to prepare for college tests meanwhile i have been working on license extraction from copyrights string i found a package called scancode toolkit which have a sub-module called licensedcode which can help in this objective but since its in early production phase i am coming across a lot of bugs like scancode library does not work on py3.7or py3.8. and its api module is also buggy so i am looking into it. I hope I will finish this by the end of this week.:P