h4k.com

2008-04-20

WordPress Automatic Update Plugin

Filed under: Web Development — admin @

Updating the popular WordPress blog software to the newest version can be automated using the WordPress Automatic Update plugin. The plugin takes care of everything including creating backups of the databases and even puts up an upgrade in progress notification on the front page. As with any software it is important to apply the relevant security updates on a timely basis so to help prevent the site from being attacked or used to unknowingly distribute exploits to visitors of the site. It seems that this type of update automation should be built in to the WordPress software by default but it is not so this is the plugin should be installed.

http://wordpress.org/extend/plugins/wordpress-automatic-upgrade/


2008-01-13

Firebug Web Development and Analysis Firefox Extension

Filed under: Firefox, Web Development — admin @

Firebug is a handy web development or research tool that can be used to analyze each individual component of a web page. From the home page:

Firebug integrates with Firefox to put a wealth of development tools at your fingertips while you browse. You can edit, debug, and monitor CSS, HTML, and JavaScript live in any web page.

https://addons.mozilla.org/en-US/firefox/addon/1843


2007-12-09

Hiding Email Addresses from Spammers in a Split Image

Filed under: Spam, Web Development — admin @

Posting your email address on the Internet unfortunately invites spammers to your email box. Some people recommend putting your email address into an image to prevent spammers from easily getting it. It is possible for spammers to use Optical Character Recognition (OCR) applications to translate those addresses into textual spam lists. This can be defeated by splitting the email image into multiple pieces.

First, create an image containing your email address in an image manipulation application such as these image manipulation applications. Divide the image into a couple separate images; three should be fine. When posting your email address put up multiple image links next to each other. The image will look like a single word but a spam bot would only get a third of the email address out of each image. For example:

<img src=”http://h4k.com/images/emx2.jpg” /><img src=”http://h4k.com/images/erq5.jpg” /><img src=”http://h4k.com/images/est8.jpg” />

The html code should be on a single line in the html file. See the contact page for an example.


2007-12-03

Methods of Restricting Web Site Caching

Filed under: Privacy, Web Development — admin @

Restricting web site caching from protocol-following Internet spidering applications can be accomplished by adding a robots.txt file to a web site and inserting metadata into the HTML code of each web page. Robots.txt is a web standard to instruct spiders and bots to what they can and cannot access. Not all bots follow the robots.txt standards (see here) but many do so it is still a good idea to have a robots.txt file.

For a robot.txt file that blocks all spidering add the following lines to a blank plain text file:

User-Agent: *
Disallow: /

Save the file as “robot.txt”, then upload to the top directory of the web site. It needs to be accessible at the top level of the web site. For example, the h4k.com robots.txt file can be found here: http://h4k.com/robots.txt

The metadata tags below need to inserted on each web page that needs the caching to be restricted. If the web site runs Wordpress software it can automatically insert the metadata tags into all HTML files by editing the header php code. Log in to the Wordpress console then go to Presentations –> Theme Editor –> and select “Header” under the “theme files” listing. Directly underneath the area where it says:

<title><?php bloginfo(‘name’); ?><?php wp_title(); ?></title>

insert these metadata tags:

<META name=”ROBOTS” content=”NONE”>
<META http-equiv=”CACHE-CONTROL” content=”NO-CACHE”>
<META http-equiv=”EXPIRES” content=”0″>

* Note: you will need to currently replace the quotes with newly typed ones. The web page is not displaying the proper characters for the quotes (yet).

This also instructs web spiders to not look at anything, instructs the web browser not to cache the web page, and has the web page set to expire immediately.

Alternatively, you may want web crawlers and search engines to be able to see your web site but not make a copy of the content. To set up a web page to allow web spiders and search engines to index the information but not make a cached copy of the web page the ROBOTS metatag should be set to “NOARCHIVE”.

<META name=”ROBOTS” content=”NOARCHIVE”>

Lots of good information about metatags can be found here.

It is also important to have a well constructed robots.txt file to prevent the leech bots from wasting bandwidth on your site and preventing them from making copies of your site. Wikipedia.org has a well documented robots.txt file that the h4k.com robots.txt file is based off of.





h4k.com