Commit Graph

83 Commits

Author SHA1 Message Date
0b4eff93e2 Exclude repositories from ftp browser
There is no need to browse into the repositories on the main website.
It's still possible to browse there directly on the ftp servers of course,
for those that need to debug a repository install and things like that.
2013-10-18 17:59:52 +02:00
0446a3ef68 Create pkill command that doesn't kill itself 2013-09-04 20:12:06 +02:00
c4a72a3317 Update command should run the kill as sudo 2013-08-18 19:40:04 +02:00
5834e5efda bring the example code for the mediawiki plugin more in line with what
we actually use in production, this mostly consists of the "new style" support
for raw data passthrough, but does not fix the actual problem (which is dealing
with url "double" encoding/decoding - or rather the loss of information about
that) yet...
Will fix that and sync up exactly with the production code for MW 1.19 soon...
2013-08-13 23:56:32 +02:00
7dc9e105f9 Ensure db connection is closed before script exits
This is required in django 1.4, but was closed automatically back
in 1.2 when it was created.
2013-07-01 22:49:46 +02:00
78de94d17c Update community authentication to pass an arbitrary datablock instead of url
This makes it possible to pass URLs that will fail when they end up being double
escaped in some cases, since they contain non-url-safe characters. Instead, they'd
be base64-encoded, and thus safe.

Also update the django community auth provider to do just this, including encrypting
the data with the site secret key to make sure it can't be changed/injected by
tricking the user to go directly to the wrong URL.
2013-06-20 15:16:47 +02:00
e5385a5222 Update for new version of feedparser library 2013-06-16 21:54:09 +02:00
be9b0c25ef fix a minor typo 2013-05-26 19:59:03 -04:00
cc9d8e0e01 change example rewrite rules to something that also catches the "nocache"
versions of the mw pages
2013-05-26 19:58:09 -04:00
976549931b remove the code to add default user_options for the upcoming MW 1.19 upgrade
user preferences in MW pos 1.18 got moved to a seperate table and only stuff
that is _NOT_ default should be stored there. Despite what the documentation
says actually having data left in user_options is harmful and will break
random functionality like preference handling
2013-05-26 13:31:00 -04:00
232c78aaa6 Add header as required by the w3c validator these days 2013-03-07 21:43:06 +01:00
a1e9de73e6 Add new book "PostgreSQL: Up and Running" 2013-01-01 18:38:02 +01:00
33ed40343b Add support for varnish purging based on expressions
Previously we would only purge based on URLs, but some of the upcoming
new work requires arbitrary expression purging.

NOTE! Require the creation of the new SQL procecure in the database,
either from varnish.sql or varnish_local.sql depending on if it's prod
or dev.
2012-10-03 12:48:47 +02:00
45f53d5d03 When loading the current version of docs, also purge /current/ 2012-09-24 12:38:49 +02:00
d8169e0f05 Add small tool to run w3c validator on local pages before deployment 2012-09-10 14:02:19 +02:00
f940801e3f Add generic PHP code for decryption/parsing
In general, you're better off looking at the mediawiki plugin which
is a bit more complete..
2012-06-03 12:29:02 +02:00
2a13d4a48c Add community auth v2 provider for mediawiki 2012-06-03 12:20:40 +02:00
7970aaf38c Add redmine community auth plugin
Written by Alex Shulgin (ash at commandprompt.com)
2012-05-25 13:10:58 +02:00
cc915eb973 Oops - don't put the email in the name field, use the name instead 2012-05-23 16:05:33 +02:00
c3c882c1d4 Add script to migrate a single account from the old system to the new
Required in order to be able to do password recovery on accounts, since
we only migrate users when they actually log in...
2012-05-23 16:02:18 +02:00
47b839c472 Close db connection in docloader
To make Stefan happy...
2012-05-18 10:53:14 -04:00
f9486b54f1 Don't generate http requests with two Host: headers
This would cause http 400 requests at the server, unsurprisingly.
2012-02-07 13:05:41 +01:00
03794e3157 Remove dead code 2012-02-07 13:04:19 +01:00
1cf436ac4f Weigh hits based on score when given in sitemap
In particular, this is used to prioritise hits in the documentation,
and in the current version of the documentation over older versions.
2012-02-07 12:02:59 +01:00
9c8cfba180 Probably a good idea to use the same column name everywhere... 2012-01-30 10:43:47 +01:00
f448173b18 Bring back the groupid in list-of-lists for search
Seems we were actually using this :-)
2012-01-30 10:40:02 +01:00
36d1b1b6a4 Need to bring back the lastcrawl table for nagios monitoring 2012-01-29 14:00:55 +01:00
8501359ef0 Don't confuse total number of pages with total number of hits 2012-01-29 13:28:33 +01:00
e6e19af354 Forgot to qualify the suburl column in one location... 2012-01-29 13:28:21 +01:00
ae384e0f79 Don't break on months that have exactly one message in the database before we start 2012-01-27 18:30:31 +01:00
4f4f13d74e Fix typo that caused exception when reporting an exception 2012-01-27 18:30:22 +01:00
621e55f532 Update message counts when finished crawling lists 2012-01-22 21:58:03 +01:00
81fedf6def Make it possible to crawl more than one list at a time, without doing all 2012-01-22 12:06:32 +01:00
b8a2015be2 New set of web search crawlers and infrastructure
Replaces the old search code with something that's not quite as much
spaghetti (e.g. not evolved over too much time), and more stable (actual
error handling instead of random crashes)

Crawlers are now also multithreaded to deal with higher latency to some
sites.
2012-01-21 15:27:06 +01:00
69e4a9cc50 Add new frontend server to the pgq config. 2012-01-06 10:39:04 +00:00
4e3dc6c795 Add sample login.html for django admin logins 2011-12-28 15:03:55 +01:00
88d0b6148d Implement single sign-out for community login 2.0
In order to provide a consistent user experience, we must sign the
user out from the main website if the community site provides a logout
button - else that button will appear not to work...
2011-12-27 19:35:51 +01:00
4f23ee8c52 Add sample perl code to decrypt authentication tokens
Code from Claes Jakobsson
2011-12-22 19:25:33 +01:00
5ef06f359d Add sample django community authentication consumer 2011-12-22 19:25:33 +01:00
6db6283401 Include a timestamp in the authentication token
This way we can expire a token after e.g. 10 or 30 seconds, making
it impossible to do a replay attack later.
2011-12-22 19:25:32 +01:00
1f78460779 Implement community authentication 2.0
This system relies on http redirects and signing in to the main website
instead of using cross-internet pgsql connections and signing in individually
to each website.
2011-12-22 19:25:32 +01:00
dcc7c4474d Do a sleep before reloading the website, to avoid interfering with automirror
This should hopefully get rid of transient errors caused by automirror hitting
the site during reload, before our script has a change to pull the local site.
2011-12-18 17:05:09 +01:00
209e19fa86 Make two quick requests to the website after reload
Sometimes we get a http 503 error from lighttpd if we hit the system
right after reload - make sure that we hit these errors from
the update script instead of leaking it to the end user or our
mirror script.
2011-12-11 14:20:33 +01:00
e80c739046 Make sure the host header is set to www.postgresql.org 2011-12-06 20:10:19 +01:00
758a721405 Allow for the ftpsite pickle to be uploaded via https
Previously this had to be rsynced outside of the website. By allowing the
upload here, and automatically purging the data from varnish, we will reach
"almost instant" updates of the ftp site structure on the web.
2011-12-06 19:13:32 +01:00
1ea8ecd12a Include docs load timestamp in developer docs
This makes it possible to figure out when the docs were actually
loaded, since developer docs don't carry a version number. This is
actually going to be the docs *load* timestamp, and not build timestamp,
but they should be close enough together that it shouldn't matter.

Fixes #108
2011-12-04 15:14:54 +01:00
585a0e2efe Purge the correct URL when loading developer docs
They're /docs/devel/, not /docs/0/ - that's just used to cheat the
version system  being a decimal field...
2011-12-04 15:01:43 +01:00
22d121f643 Purge related URLs from varnish when a new item is fetched from RSS
Fixes #95
2011-12-04 14:34:52 +01:00
71903d7f0c Parse files in the ftp tree as utf8, so as not to get random encoding errors
Any characters that aren't utf-8 will get replaced with the unicode
replacement character, instead of throwing an exception.

Fixes #106
2011-11-28 21:14:40 +01:00
4c7c2f34a1 Remove more references to wwwmaster 2011-11-27 17:52:36 +01:00