Skip to main content
Home
Rampancy
Death by intelligence.
  • home
  • info
  • timeline
  • articles
  • sheetmusic
  • images
  • movies
  • podcast
  • schedule
 

Narcogen@Twitch
Mastodon
Bluesky

Features

  • Articles
  • Bungie Humor
  • FAQs
  • Linktree
  • Rampancy Let's Plays
  • What is Rampancy?

Bungie Let's Plays

  • Destiny
  • Marathon
  • Myth

User login

  • Reset password

Social Media

Mastodon
bluesky

  1. Home »
  2. Take This Cron Job And Shove It

Tue, 02/10/2004 - 23:18 by narcogen

Take This Cron Job And Shove It -

Editor's Note: This is a little off-topic for Rampancy proper, and the full article will only be of interest if you're a fansite admin familiar with Apache and PHP. The upshot of it all, for Rampancy, is that the site should be performing now better than it ever has, and in the near future it should be much easier to navigate. Gone will be the days of linking to Rampancy articles by ID number; all common pages will have a name.

But if you're interested in how this longed-for feature finally got activated, as well as why the site's been such a dog this week, read on.

Earlier this week, I began migrating all of the sites hosted by Synfibers that were using the PHP-based CMS called Drupal from version 4.2 (and one site that was hosted on a modified 4.0 installation) to the latest and greatest edition, 4.3.2.

For the most part, the process was pretty painless. Unlike some previous upgrades, there were very few times I had to manually alter a site's database to get the new code tree to work, and most of the time these were mentioned explicitly by the upgrade script. The first day I did one site, and let it sit overnight. When everything seemed to be fine, I did five more sites the next day.

That's when everything seemed to slow down to a crawl, about every fifteen minutes or so. I thought it was the scheduled cron job that updates the database with XML newsfeeds and so on; I figured some of the newsfeed URLs were bad or had malformed XML, so I started weeding out the feeds, removing sites that had invalid XML or sites that didn't seem to have active newsfeeds anymore.

Then Verio tech support emails me, warning of dangerously high CPU usage in some Apache webserver threads, and suggests that it might be due to Googlebot. I thought at first it was again the cron jobs and that he just wasn't familiar with my usage of Drupal and the need for these cron scripts to be run frequently. However, as it turns out he did, and he had already dismissed the cron jobs as a potential cause because the times when they ran didn't seem to match up with the times when these high-CPU-usage Apache processes were spawning.

So that meant that either the search bots were the culprit, or somebody was DDOSing Synfibers. That seemed unlikely, so I started changing some things. I used robots.txt to exclude cron.php from their view, figuring that perhaps the bots were hitting that page and causing multiple cron script instances, which might bog things down. I took a look at top on the server and noticed some Apache processes going as high as 95% CPU utilization, in effect bringing the server to its knees. [Synfibers]

category: 
offtopic
topic: 
site
tech

Comments

Anonymous's picture

Anonymous (not verified)

Unfinished?

Fri, 02/11/2005 - 02:26
Did you only post half of this article, you kinda just stopped and never tied everything together. 8-/
narcogen's picture

narcogen

Follow the link

Fri, 02/11/2005 - 03:47

This is just a blogged preview from the full story at Synfibers.com.

http://synfibers.com/mod_rewrite


----

Rampant for over five years.


Rampant for over se7en years.



Anonymous comments are moderated. Please do not post duplicate comments. Halo, Halo content and Halo characters are property of Microsoft. Bungie, Marathon and Destiny are property of Bungie LLC. Myth is property of Take Two. Other trademark names are property of their respective owners. Copyrighted materials used under fair use.Original contents © Copyright Synthetic Fibers 2000-2024 except where otherwise noted.This site uses cookies to store session information on your device.