Recipe for Tweakers.net based on built in from Kovid Goyal

#!/usr/bin/env python2
# vim:fileencoding=UTF-8:ts=4:sw=4:sta:et:sts=4:ai
from __future__ import with_statement

''' Changelog
2012-04-27 DrMerry:
Added cover picture
removed some extra tags
'''

__license__ = 'GPL v3'
__copyright__ = '2009, Kovid Goyal <kovid@kovidgoyal.net>'
__docformat__ = 'restructuredtext en'

import re
from calibre.web.feeds.news import BasicNewsRecipe

class Tweakers(BasicNewsRecipe):
title = u'Tweakers.net'
__author__ = 'Kovid Goyal'
language = 'nl'
oldest_article = 4
max_articles_per_feed = 40
cover_url = 'http://tweakers.net/ext/launch/g/logo.gif'

keep_only_tags = [dict(name='div', attrs={'class': 'columnwrapper news'}),
dict(name='div', attrs={'class': 'article'})
]

remove_tags = [dict(name='div', attrs={'class': 'reacties'}),
{'id': ['utracker', 'socialButtons', 'b_ac']},
{'class': ['sidebar', 'advertorial']},
{'class': re.compile('nextPrevious')},
]
no_stylesheets = True
filter_regexps = [r'ads\.doubleclick\.net', r'ad\.doubleclick\.net']

feeds = [(u'Tweakers.net', u'http://tweakers.net/feeds/nieuws.xml')]

def preprocess_html(self, soup):
for a in soup.findAll('a', href=True, rel=True):
if a['rel'].startswith('imageview'):
a['src'] = a['href']
del a['href']
a.name = 'img'
for x in a.findAll(True):
x.extract()
return soup

def postprocess_html(self, soup, first):
for base in soup.findAll('base'):
base.extract()
return soup

Recipe for Tweakers.net based on built in from Kovid Goyal

Trending Articles

Nalgonda District Police Office Mobile Numbers List in Telangana State

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Teen Shot In Miami Drive-By Dies From Injuries

Practice Sheet of Right form of verbs for HSC Students

VMOU RSCIT Result 2017, RSCIT Result VMOU rkcl.vmou.ac.in Name Wise

Moondru Mudichu 02-03-2017 – Polimer tv Serial

O'CONNELL MICHAEL F. 11/29/197...

[同期の失敗] について

Could Not Find the Application that Created this file

Edna Murto, 90, longtime resident of Ely, dies

GTA 5 PPSSPP Zip File Download For Android Mediafire 382 MB

Mp3 Download: Mandoza - Godoba

Arrow Flash 2 – Sinhala Dubbed – Episode 17 – 28th February 2016

Download: Bicko Bicko ft Rich Bizzy & Crew G- Wanfulanganya (Prod by: Bicko...

Arrest logs for Wednesday, March 20, 2019

Bureau of Internal Revenue: Regional Offices (Directory)

SEAGCD2 - Editorial

Not right!

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

EXERCISE