We won’t be using Ipython in this example, as it kept crashing (& sometimes taking down Firefox). You will find the actual emails are in MIME format. Once you download the files, spend some time looking at their structure, and how they are arranged. On Windows, you’ll need 7zip to unzip them. Since the collapse was only 15 years ago(its 2016 now), I guess every reading this has heard of Enron, a company that was the top biggest company in the world one day, and bankrupt the next.ĭownload the emails from here. In this first video, I given an introduction to Enron, and the email corpus. The question always is: Where do we even start? Normally, emails are very sensitive, and rarely released to the public, but because of the shocking nature of Enron’s collapse, everything was released to the public.īecause it is so large, it makes analysis complicated. Almost half a million files spread over 2.5 GB. ![]() The Enron Email Corpus is one of the biggest email data sources in the world. Build a Spam Filter using the Enron Corpus ![]() Introduction to NLP and Sentiment AnalysisĦ.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |