Search results
Results From The WOW.Com Content Network
Ling-Spam Dataset Corpus containing both legitimate and spam emails. Four version of the corpus involving whether or not a lemmatiser or stop-list was enabled. 2,412 Ham 481 Spam Text Classification 2000 [38] [39] Androutsopoulos, J. et al. SMS Spam Collection Dataset Collected SMS spam messages. None. 5,574 Text Classification 2011 [40] [41]
Kaggle is a data science competition platform and online community for data scientists and machine learning practitioners under Google LLC.Kaggle enables users to find and publish datasets, explore and build models in a web-based data science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.
Bogofilter examines tokens in the message body and header, and refers to wordlists stored by BerkeleyDB, SQLite or QDBM to calculate a probability score that a new message is spam. Bogofilter provides processing for plain text and HTML and supports reading multi-part MIME message including base64, quoted-printable , and uuencoded text or HTML.
For these approximations to make sense, the set of learned messages needs to be big and representative enough. It is also advisable that the learned set of messages conforms to the 50% hypothesis about repartition between spam and ham, i.e. that the datasets of spam and ham are of same size. [8]
The Enron Corpus is a database of over 600,000 emails generated by 158 employees [1] of the Enron Corporation in the years leading up to the company's collapse in December 2001. The corpus was generated from Enron email servers by the Federal Energy Regulatory Commission (FERC) during its subsequent investigation. [ 2 ]
An email box folder filled with spam messages.. Email spam, also referred to as junk email, spam mail, or simply spam, is unsolicited messages sent in bulk by email ().The name comes from a Monty Python sketch in which the name of the canned pork product Spam is ubiquitous, unavoidable, and repetitive. [1]
Apache SpamAssassin is a Perl-based application (Mail::SpamAssassin in CPAN) which is usually used to filter all incoming mail for one or several users.It can be run as a standalone application or as a subprogram of another application (such as a Milter, SA-Exim, Exiscan, MailScanner, MIMEDefang, Amavis) or as a client (spamc) that communicates with a daemon (spamd).
QSO logger for Emacs with a customizable dynamic form for rapid data entry into an ADIF file. Suitable for general logging or contesting, it can be customized to use almost any combination of fields in the ADIF 3.1.4 specification. Ham Radio Deluxe: Proprietary Windows