10 Simple, Elegant, and Free Wordpress Themes
1. stylized
2. curiouslygreen
3. standardissue
4. nonzero
5. emporium
6. splendid
7. unembellished
8. reference
9. shallowgrunge
10. plainoffice
FreeBSD 8.0 – Code Freeze – Whats cooking? Why FreeBSD? Why are you using linux?
FreeBSD 8.0 – Code freeze [www.evilcoder.org]
In continued preparation to the FreeBSD 8.0 Release, the tree had been frozen by re@. This means that development had been slowed down a lot because everything needs to be reviewed by the Release Engineers first. This step will make sure that the system will get as stable as possible before a fork will be done to RELENG_8. Stay tuned!
What's cooking for FreeBSD 8? Plenty. Jails v2, Xen dom-U support, ULE 3.0, and DTrace will all likely make it into the release, not to mention all the other new features, stability and code improvements. What else is cooking in FreeBSD 8? Here is a very long post to pump you up.
FreeBSD - What's cooking for FreeBSD 8? [ivoras.sharanet.org]
Dizzy? Pull yourself together. From Wikipedia [en.wikipedia.org]:
As of 2009, FreeBSD 8.0 is the "bleeding edge" development version, called -CURRENT in FreeBSD development terminology. It will feature superpages [en.wikipedia.org], Xen [en.wikipedia.org] DomU support, network stack virtualization, stack-smashing protection [en.wikipedia.org], TTY layer rewrite, much improved ZFS support and, new USB stack, and rewritten NFS client/server introducing NFSv4. Inclusion of improved device mmap() extensions will allow the technical implementation of a 64-bit Nvidia display driver for the amd64 platform. FreeBSD 8.0 is planned to be released in the 3rd quarter of 2009.
Yahoo! has been using it for years. I have been using it for years. Why FreeBSD? IBM tells you why in 2005.
Why FreeBSD - A quick tour of the BSD alternative [www.ibm.com]
FreeBSD tells you why.
About FreeBSD's Internetworking [www.freebsd.org]
FreeBSD made you a slideshow about 7.0 explaining why.
Introducing FreeBSD 7.0 [people.freebsd.org]
Right now, you might be feeling anxious. You've been using some linux distro with a name you can't even pronounce or spell properly and a confusing package management system. You thought it was the best. Better than Windows right? You don't know anymore. It doesn't make sense. You're not sure if it ever did make sense. The room starts spinning. You realize this must have been a joke. A Penguin? Some guy named Linus? Ubuntu? Why didn't you see the clues? Is this the real life?
Why are you using linux?
How to install Lighttpd, PHP 5, and MySQL 5.1 on FreeBSD 7.2
Installing MySQL
Before we start, make sure your ports are updated.
portsnap fetch
portsnap update
Right. Now we're ready.
cd /usr/ports/databases/mysql51-server
make install clean
This might take a while. Do something productive.
When that finishes we'll install the database, and make sure user and group mysql is the owner of the database folder.
/usr/local/bin/mysql_install_db
chown -R mysql /var/db/mysql/
chgrp -R mysql /var/db/mysql/
Next, we'll set MySQL to start at FreeBSD boot.
echo 'mysql_enable="YES"' >> /etc/rc.conf
Finally, we'll start MySQL and change the root password to keep out the hackers.
/usr/local/etc/rc.d/mysql-server start
/usr/local/bin/mysqladmin -u root password newpassword
If you did everything right, MySQL should now be running.
Installing Lighttpd
Assuming our ports are still up to date after that MySQL compile, we'll start the Lighttpd install.
cd /usr/ports/www/lighttpd
make install clean
When that's finished, we'll create the document root folder.
mkdir -p /usr/local/www/data
Now, create the access and error log files. Those files are:
/var/log/lighttpd.access.log
/var/log/lighttpd.error.log
Next, we'll give Lighttpd ownership of it's log files.
chown www:www /var/log/lighttpd.access.log
chown www:www /var/log/lighttpd.error.log
Now it's time to open up the Lighttpd configuration file. Great.
vi /usr/local/etc/lighttpd.conf
Uncomment the line "mod_fastcgi", in "server.modules".
Append the following code in the fastcgi module section.
fastcgi.server = ( ".php" =>
( "localhost" =>
(
"socket" => "/tmp/php-fastcgi.socket",
"bin-path" => "/usr/local/bin/php-cgi"
)
)
)
Finally, we'll set Lighttpd to start with FreeBSD.
echo 'lighttpd_enable="YES"' >> /etc/rc.conf
Before we start Lighttpd, Lets install PHP.
Installing PHP
Installing PHP is very easy. Ready? Ok.
cd /usr/ports/lang/php5
make install clean
Now we'll install some PHP extensions. Make sure you check the mysql and or mysqli boxes to be able to use MySQL in PHP.
cd /usr/ports/lang/php5-extensions
make config install clean
Next, copy php.ini-recommended to php.ini.
cp /usr/local/etc/php.ini-recommended /usr/local/etc/php.ini
With PHP installed, we can now start Lighttpd.
/usr/local/etc/rc.d/lighttpd start
It should work now, unless I missed a few steps. Create a phpinfo script in /usr/local/www/data and load it in your browser to test!
How to install MySQL 5.1 on FreeBSD 7.2
I've had trouble in the past with installing MySQL on FreeBSD. This is the best method I've used that works.
Before we start, make sure your ports are updated.
portsnap fetch
portsnap update
Right. Now we're ready.
cd /usr/ports/databases/mysql51-server
make install clean
This might take a while. Do something productive.
When that finishes we'll install the database, and make sure user and group mysql is the owner of the database folder.
/usr/local/bin/mysql_install_db
chown -R mysql /var/db/mysql/
chgrp -R mysql /var/db/mysql/
Next, we'll set MySQL to start at FreeBSD boot.
echo 'mysql_enable="YES"' >> /etc/rc.conf
Finally, we'll start MySQL and change the root password to keep out the hackers.
/usr/local/etc/rc.d/mysql-server start
/usr/local/bin/mysqladmin -u root password newpassword
Full code:
cd /usr/ports/databases/mysql51-server
make install clean
/usr/local/bin/mysql_install_db
chown -R mysql /var/db/mysql/
chgrp -R mysql /var/db/mysql/
echo 'mysql_enable="YES"' >> /etc/rc.conf
/usr/local/etc/rc.d/mysql-server start
/usr/local/bin/mysqladmin -u root password newpassword
Create a PHP web crawler or scraper in 5 minutes using php-CURL, and php-DOM
This post is in response to the articles floating around the internet named, for the most part, "Create a PHP web crawler or scraper in 5 minutes." Here we use PHP DOM and some more complex methods to create "infinitely extendable web crawler in under 5 minutes." Actually it depends on how fast you can read and copy the code.
The Framework
We start with a blank DOM_Crawler class:
class DOM_Crawler { }
Next, we'll create the methods to fetch, prepare, load into DOM, and collect links.
class DOM_Crawler { protected $markup = ''; protected $dom = null; protected $base_url = null; public function __construct ($url) { } protected function _load_markup ($url) { } protected function _prepare_markup ($content, $encod='') { } protected function _init_dom () { } public function get ($type) { } protected function _get_links () { } protected function _base_url ($url) { } }
Fetching Content
As in the other examples, the constructor will accept a URL then pass it on to the method "_load_markup()." Php-curl is used here instead of file_get_contents() as file_get_contents() is deprecated in favor of using the CURL libraries. "_prepare_markup()" is then used to fix encoding issues before loading into Php-DOM. "_prepare_markup()" is explained in the comments of PHP: DOMDocument::loadHTML - Manual. "init_dom()" will initialize Php-DOM.
public function __construct ($url) { $this->base_url = $this->_base_url($url); $this->markup = $this->_load_markup($url); $this->markup = $this->_prepare_markup($this->markup); $this->_init_dom(); } protected function _load_markup ($url) { $ch = curl_init(); $timeout = 10; curl_setopt ($ch, CURLOPT_URL, $url); curl_setopt ($ch, CURLOPT_RETURNTRANSFER, 1); curl_setopt ($ch, CURLOPT_CONNECTTIMEOUT, $timeout); $contents = curl_exec($ch); curl_close($ch); return $contents; } protected function _prepare_markup ($content, $encod='') { mb_detect_order("ASCII,UTF-8,ISO-8859-1,windows-1252,iso-8859-15"); if (!empty($content)) { if (empty($encod)) { $encod = mb_detect_encoding($content); } $headpos = mb_strpos($content,'<head>'); if ($headpos === false) { $headpos = mb_strpos($content,'<HEAD>'); } if ($headpos !== false) { $headpos += 6; $content = mb_substr($content, 0, $headpos) . '<meta http-equiv="Content-Type" content="text/html; charset=' . $encod . '">' . mb_substr($content, $headpos); } $content = mb_convert_encoding($content, 'HTML-ENTITIES', $encod); return $content; } } protected function _init_dom () { $this->dom = new DomDocument; $this->dom->loadHTML($this->markup); $this->dom->normalizeDocument(); }
Crawling the Content for Data
From the other examples: "Our get() method will accept a $type string which essentially will simply be used to invoke another method actually doing the processing. As you can see below we construct the method name as a string, then make sure it is available so now developers can utilize this simply by invoking $crawl->get('images');."
The "_get_links()" will start by fetching all tags named 'a', then it uses DOMElement::getAttribute to return the actual URL from the "href" attribute. If any link lacks the "base" or "scheme, user, password, host, port," we'll fix it by adding $this->base_url. '#' links are simply not included in the array.
public function get ($type) { $method = "_get_{$type}"; if (method_exists($this, $method)) { return call_user_method($method, $this); } } protected function _get_links () { if (!empty($this->markup)) { $anchors = $this->dom->getElementsByTagName('a'); foreach ($anchors as $anchor) { $href = $anchor->getAttribute('href'); if (mb_substr($href, 0, 1) == '/') { $links[] = $this->base_url . $href; } else if (mb_substr($href, 0, 1) != '#') { $links[] = $href; } } return $links; } return false; } protected function _base_url ($url) { $parsed_url = parse_url($url); $base_url = $parsed_url['scheme'] . '://'; if (!empty($parsed_url['user'])) { $base_url .= $parsed_url['user']; if (!empty($parsed_url['pass'])) { $base_url .= ':' . $parsed_url['pass']; } $base_url .= '@'; } $base_url .= $parsed_url['host']; if (!empty($parsed_url['port'])) { $base_url .= ':' . $parsed_url['port']; } return $base_url; }
Final Web Crawler Code
class DOM_Crawler { protected $markup = ''; protected $dom = null; protected $base_url = null; public function __construct ($url) { $this->base_url = $this->_base_url($url); $this->markup = $this->_load_markup($url); $this->markup = $this->_prepare_markup($this->markup); $this->_init_dom(); } protected function _load_markup ($url) { $ch = curl_init(); $timeout = 10; curl_setopt ($ch, CURLOPT_URL, $url); curl_setopt ($ch, CURLOPT_RETURNTRANSFER, 1); curl_setopt ($ch, CURLOPT_CONNECTTIMEOUT, $timeout); $contents = curl_exec($ch); curl_close($ch); return $contents; } protected function _prepare_markup ($content, $encod='') { mb_detect_order("ASCII,UTF-8,ISO-8859-1,windows-1252,iso-8859-15"); if (!empty($content)) { if (empty($encod)) { $encod = mb_detect_encoding($content); } $headpos = mb_strpos($content,'<head>'); if ($headpos === false) { $headpos = mb_strpos($content,'<HEAD>'); } if ($headpos !== false) { $headpos += 6; $content = mb_substr($content, 0, $headpos) . '<meta http-equiv="Content-Type" content="text/html; charset=' . $encod . '">' . mb_substr($content, $headpos); } $content = mb_convert_encoding($content, 'HTML-ENTITIES', $encod); return $content; } } protected function _init_dom () { $this->dom = new DomDocument; $this->dom->loadHTML($this->markup); $this->dom->normalizeDocument(); } public function get ($type) { $method = "_get_{$type}"; if (method_exists($this, $method)) { return call_user_method($method, $this); } } protected function _get_links () { if (!empty($this->markup)) { $anchors = $this->dom->getElementsByTagName('a'); foreach ($anchors as $anchor) { $href = $anchor->getAttribute('href'); if (mb_substr($href, 0, 1) == '/') { $links[] = $this->base_url . $href; } else if (mb_substr($href, 0, 1) != '#') { $links[] = $href; } } return $links; } return false; } protected function _base_url ($url) { $parsed_url = parse_url($url); $base_url = $parsed_url['scheme'] . '://'; if (!empty($parsed_url['user'])) { $base_url .= $parsed_url['user']; if (!empty($parsed_url['pass'])) { $base_url .= ':' . $parsed_url['pass']; } $base_url .= '@'; } $base_url .= $parsed_url['host']; if (!empty($parsed_url['port'])) { $base_url .= ':' . $parsed_url['port']; } return $base_url; } }
Usage
$links[0] = 'http://www.reddit.com/'; for ($i = 0; count($links) < 100; $i++) { $crawler = new DOM_Crawler($links[$i]); $links = _merge_unique($links, $crawler->get('links')); } print_r($links); function _merge_unique ($arr, $arr2) { foreach ($arr2 as $key => $value) { if (!in_array($value, $arr)) { $arr[] = $value; } } return $arr; }
Returns:
Array ( [0] => http://www.reddit.com/ [1] => http://www.reddit.com/r/funny/ [2] => http://www.reddit.com/r/gaming/ [3] => http://www.reddit.com/r/pics/ [4] => http://www.reddit.com/r/politics/ [5] => http://www.reddit.com/r/programming/ [6] => http://www.reddit.com/r/reddit.com/ [7] => http://www.reddit.com/r/science/ [8] => http://www.reddit.com/r/technology/ [9] => http://www.reddit.com/r/worldnews/ [10] => http://www.reddit.com/r/WTF/ ..... [201] => http://www.reddit.com/help/faq [202] => http://www.reddit.com/help/reddiquette [203] => http://www.reddit.com/bookmarklets/ [204] => http://www.reddit.com/buttons/ [205] => http://www.reddit.com/code/ [206] => http://www.reddit.com/socialite/ [207] => http://www.reddit.com/widget/ [208] => http://www.reddit.com/iphone/ [209] => http://www.reddit.com/blog/ [210] => http://justin.tv/reddit [211] => http://www.reddit.com/ad_inq/ [212] => http://www.reddit.tv [213] => http://www.redditall.com [214] => http://www.baconbuzz.com [215] => http://reddit.destructoid.com [216] => http://www.thecutelist.com [217] => http://reddit.independent.co.uk [218] => http://www.redditgadgetguide.com [219] => http://www.weheartgossip.com [220] => http://www.idealistnews.com [221] => http://www.wired.com [222] => http://www.arstechnica.com [223] => http://www.style.com [224] => http://www.epicurious.com [225] => http://www.concierge.com [226] => http://reddit.com/help/useragreement [227] => http://reddit.com/help/privacypolicy [228] => http://www.reddit.com/feedback )










