Aloaha PDF2TXT API

The Aloaha PDF2TXT API is included in all Aloaha PDF tools.

Please find below a code sniplet to demonstrate the usage:


Dim pdf
Dim pdffile
Dim pageno

Set pdf = CreateObject("aloahapdf.edit")

pdffile="c:\test.pdf"
pageno=1

If pdf.load_pdf_to_mem(CStr(pdffile)) = true Then
          MsgBox pdf.get_page_text(CLng(pageno),0)
End If

Set pdf = nothing


PDF SPAM - the latest trend!

Recent research shows that between 60% and 90% of emails received nowadays is considered as spam. The Spam headache for administrators is constantly increasing because spammers are always at least one step ahead of anti-spam software vendors. Spammers registered a considerable success with image spam because the anti-spam software industry failed to come out with proper counter-measures to stop image spam. 

Based on their success with image spam spammers improved their methods in June 2007 with a new technique that is even more problematic than image spam. Instead of embedding the image within the email itself, they ‘repackaged’ it within an attachment using one of the most common file formats in use today – a PDF file. 

Currently that means that spammers are not only one step ahead but at least two steps ahead. The reason is that anti-spam vendors do not understand PDF technology and introduce useless filters such as blocking emails which contain a PDF smaller than X KB. How can someone seriously consider this an efficient filter?
Those recently introduced “PDF Filter” do harm organisations more than the PDF SPAM itself. Most businesses today transfer essential documents using the PDF format and cannot afford even one missed document. 

To address the PDF spam threat, administrators need to press anti-spam vendors to introduce proper PDF analysis or just drop useless filters. 

With the Aloaha PDF SDK it is very easy and fast to analyse a PDF document. For example the PDF can be converted to ASCII to find out if it is an image only PDF. PDF SPAM contains currently never more than one page and one PDF Layer. Even the media size of PDF SPAMs usually is a custom format and not A4, Letter etc.
In IIS/Exchange it is extremely easy to write a so called event sink/script sink which could be used to analyse emails and drop image SPAM. 

Please contact aloaha@wrocklage.de in case you need assistance for the implementation of your own script sinks. We can also offer you high performance C++ Event sinks. 




Please ask...

Although we really tried hard, there are always questions left open.

Please send us your question.
We would be glad to answer it.


Home / Knowledge Base / PDF SDK / PDF2TXT
create PDF Software

PDF2TXT

Any questions?
Aloaha Software - PDF SDK / Freeware APIs / PDF2TXT / PDF SPAM / PDF Image SPAM
ProductsKnowledge BaseShopSupportPress
Deutsche VersionEnglish version