Paywalls: something fresh is on the air (write this: shareable paygate)

A fresh concept is evolving around the paywalls. In the last few days journalists are talking about how bad is the practice of penalize the most loyal readers.

As Jeff Jarvis pointed out in Why not a reverse meter?, “media are valuing our readers/users/customers opposite how we should, rewarding the freeriders and taxing — and perhaps turning away — the valuable users“.

Paywall was some sort of dubious “innovation” one year ago and still is controversial. It is faulty, to say the least. Reversing it seems a good idea, but how? It can be more difficult than it seems. There are no good, practicable answers yet. “So never mind the idea of the reverse meter, but retain the lesson of it: Value should be encouraged, not taxed” (Jarvis again, my bold).

At GigaOM Mathew Ingram titled: Don’t penalize loyal users with paywalls, reward them. In the article, Ingram exposes the ideas of Raju Narisetti, Washington Post managing editor, who talks about a “positive meter” and said in a presentation: “Don’t penalize engaged readers of websites with a pay wall: reward your active users“. (Slides at the end, thanks to Narisetti and Ingram.)

Ingram ends with this pertinent interrogation: “The fact that anyone actually pays those monthly fees is a tribute to the brand loyalty that readers have developed to places like the New York Times. So why not try to come up with ways of turning that loyalty into a benefit instead of a penalty?

So, something fresh is on the air. On Twitter, @SaDuros was interested on the subject, just like me (example tweet). I mentioned a good way of valuing loyalty of best kind — that from paying subscribers. Thomas Baekal was a model. He calls it a paygate. The basics: the subscriber can share with his/her network the full article he paid for.

Baekal wrote about it more than a year ago. As a matter of fact that article convinced me to subscribe his Plus section. The subscription gives me the privilege to share it with you, full text: Forget The Paywalls Build Shareable Paygates.

Almost one year living with The New York Times porous paywall, I think journalism industry is ready to the paywall 2.0: discovering ways to reward the paying readers. Rewarding best commenters with some sort of clout or karma (or a simple points system) is elementary. Shareable paygates are interesting and promising. Between the two, there is a new territory to explore.

The digital turning point: social media, work, newspapers, privacy and democracy

“A decade of studies by the USC Annenberg Center for the Digital Future creates a portrait of the American user of the Internet reaping the benefits of online activity, while at the same time paying a tremendous price in the form of time, privacy, and well-being” (quote from USC Annenberg | Is America at a digital turning point?).

The article identifies nine major issues. Most of them applies not only to America, but also in Europe and the rest of the e-mature economies, those with wigh levels of IT for a long time. Notably the following six:

  • Social media explodes – but most content has no credibility.
  • The desktop PC is dead; long live the tablet.
  • Work is increasingly a 24/7 experience.
  • Most print newspapers will be gone in five years.
  • Our privacy is lost.
  • The Internet’s role in the political process is still a question.

New York Times makes the right transition: from print to technology

Janet Robinson is going to step down at the end of the month, announced The New York Times.

Chairman Arthur Sulzberger Jr. will serve as CEO for a time, while NYT company look “to the technology sector for a new chief executive as its businesses shift to online formats”, writes Amy Chozick (Media Decoder: Janet Robinson, Chief Executive of Times Co., to Step Down).

I think this is the right move. Media companies need to embrace technology not merely as a means to an end (be it print or web), but as core factor.

Information is going to be more and more data. Gathering, processing, presenting and distributing it needs more and more technological skills and technology understanding at all levels of a media organization, from management to editors.

Being the first, NYT will gain an advantage over their competitors. Again.

Varnish: time, resources and money saver. A good friend for journalists and bloggers

Let’s call him “Jim”. Jim had this problem: his blog was so successful that he had to upgrade its server. For a few months, it worked. But soon the problem repeated: hundreds of page-views for second, MySQL and Apache so busy that the server was like melting.

One simple update on Facebook — and the load rockets to 40, even 50. WordPress cache plug-ins and tweaks on the limit. Nothing else to do, but upgrade the hardware again?

Jim doesn’t like the idea. Ads are paying good, but raise the operational costs will cut the profit.

Solution: an HTTP cache/proxy/accelerator as front end, saving Apache+MySQL efforts.

I had heard about Varnish before, but at that time it looked to complex to my simple problems — and Jim’s problems as well. But as I was planning a new topics mash-up for a new client — a Belgian television website — I decided to take a second look; if it delivers as they promise, this HTTP accelerator can save a lot of money.

We tested it and here’s the conclusion: For Jim, using Varnish resulted on a day/night difference. Now, more than 90% of the page requests don’t event reach Apache + WordPress caching + MySQL. Server load went down to the 6-10 range and now he can engage more readers promoting his articles on Facebook.

And how did we do?

I read about it for two days. Everything, from blog analysis to documentation to examples and stories of implementation. Google was my dedicated, helping friend.

At the third day, action. It was so simple and easy. It’s more difficult to describe :)

Varnish has good install guides — replace them I will not, look here for what you need, if you need.

1. Preparation

I lost a few hours trying to understand how Varnish treats domains. You can serve any site with a Varnish front end. Some projects will perform better with a Varnish autonomous server. But the majority can share the same machine bot for the proxy/accelerator and the web server.

The problem comes with the default HTTP port. One can have Varnish and Apache (or nginx) sharing the port with some tricks. Documentation points to examples of different ports for Apache. Mut why complicate? I made it simple: Apache listens to port 80 on one IP address, Varnish listens on another IP address. As long as you have at least one spare IP address…

This solution has side benefits. First, you don’t have to change anything in your httpd conf files. One simple DNS modification is enough. For my case, and Jim’s, the simpler, the better. We both operate several domains on the same server and messing with httpd confs is always a headache.

2. Varnish and Plesk
I use Plesk no more, but Jim does. After much reading and testing, the best thing to do was edit httpd.conf and add a less broaden Listen directive.

We commented this line…

# Listen 80

… and add one line per each of Jim’s server IPs …

Listen your-ip-1:80
Listen your-ip-2:80
..
Listen your-ip-8:80

… Except one: the IP address we reserve for Varnish

It’s known that Plesk rewrites Apache conf files for each domain. But apparently it doesn’t touch httpd.conf — at least, not in Jim’s case (Plesk 9.5.2).

3. Configuration
Varnish has two configurations. First, you tell how the daemon should operate. That can be done in the command line or using the varnish daemon configuration file. Read the documentation for exhaustive coverage. The next is just a working example: mine.

DAEMON_OPTS="-a 127.0.0.1:80 \
-f /etc/varnish/default.vcl \
-s malloc,3G"

First line points out the IP address Apache is listening. Depending on your configuration, it can be 127.0.0.1, localhost or a real IP.
Second line indicates the second configuration file — a more complex one.
Third line tells Varnish to cache pages in memory, using no more than 3 GB (using memory is generally better for performance than using disk, but you can use disk also).

Now, the VCL configuration (VCL 3.0 is described here).

There aren’t many examples out there of VCL for WordPress installations. And some of them didn’t work for me. Cookies are the problem. After some reading and testing, I came to this VCL configuration, a mix of solutions from others with a touch (like req.grace):

backend origin {
.host = "127.0.0.1";
.port = "80";
}

sub vcl_recv {
set req.backend = origin; # using our unique backend
set req.grace = 2h; # MAGIC: Varnish will serve your pages for 2 hours EVEN if Apache stops.
# set standard proxied ip header for getting original remote address
set req.http.X-Forwarded-For = client.ip;

# logged in users must always pass
if( req.url ~ "^/wp-(login|admin)" || req.http.Cookie ~ "wordpress_logged_in_" ) {
return (pass);
}

# I have a community section I don't want to cache
if( req.url ~ "^/community/" ) {
return (pass);
}

# don't cache search results
if( req.url ~ "\?s=" ){
return (pass);
}

# always pass through posted requests and those with basic auth
if ( req.request == "POST" || req.http.Authorization ) {
return (pass);
}

# else ok to fetch a cached page
unset req.http.Cookie;
return (lookup);
}

sub vcl_fetch {
set beresp.grace = 2h;

# remove some headers we never want to see
unset beresp.http.Server;
unset beresp.http.X-Powered-By;

# only allow cookies to be set if we're in admin or community area
if( beresp.http.Set-Cookie && req.url !~ "/wp-(login|admin)" && req.url !~"/community" ) {
unset beresp.http.Set-Cookie;
}

# don't cache response to posted requests or those with basic auth
if ( req.request == "POST" || req.http.Authorization ) {
return (hit_for_pass);
}

# only cache status ok
if ( beresp.status != 200 ) {
return (hit_for_pass);
}

# don't cache search results
if( req.url ~ "\?s=" ){
return (hit_for_pass);
}

# else ok to cache the response
set beresp.ttl = 300s;
return (deliver);
}

Do you have working examples to share?

I’ll be back

No. I’m not just quoting Mr. Schwarzenegger. I will be back to posting here.

I’ve been so busy programming. A web application and a web service occupied me all the time in recent months.

But worth it. A group of media in my country, Portugal, is making good use of my newsroom intelligence service (more on this shortly). And the web app “Topics” proved its worth and I am now to internationalize it (a French television will join the portuguese newspapers Jornal de Negócios and Correio da Manhã, which already use the web app Topics).

I intend to share some experiences. The most recent is the use of Varnish as a HTTP accelerator. It rocks — and it will save you big money in IT resources.

A choice of useful links to every (wannabe) journalist programmer

Jack Lail is News Director of Innovation for the Knoxville News Sentinel. Amongst other random mumblings, he wrote about Some data on database journalism. Because he was speaking to a journalism class about Computer-Assisted Reporting, Computational Journalism and Open Government Data Advocacy efforts and needed some material.

Foto credit: www.sxc.hu. Modified.

Go there an read the full post. Honoring his 3.0 CC license, I republish below a choice of his links, useful to every (wannabee) journalist programmer.

Computer-Assisted Reporting:

Tools

The new era of digital news publication (answers the question: what is a journalist programmer?)

In the new era of digital news publication does still apply the journalism jargon developed along two centuries of printing press?

In a word: no.

In an article published a week ago at ClickZ, Heidi Cohen synthesizes the 3 Cs of digital news consumption:

What makes a platform relatively desirable to a news consumer? Here are the three Cs of what they want from news:

  • Customized. It’s tailored based on individual needs, interests, location, political views, and other factors.
  • Curated. It’s selected by a combination of professional news editors and one’s social graph. This serves as a lens for which information is viewed and from what perspective.
  • Contributory. It’s enhanced and modified by the addition of opinions and sharing of information through various forms of social media (e.g., Facebook, Twitter, blogs, and comments).

Jeff Jarvis wrote that a while ago (in 2008), but almost anybody in the newsrooms was reading: The building block of journalism is no longer the article.

Divide the journalistic product, whatever it will be, no longer in articles, “breaking news”, respectful “opinions” from well established opinion-makers. Digital news publication are divided in three main areas, or keywords if you prefer. They are Firehose, Trend and Memory.

  • Firehose
    All the headlines, real time. Breaking news fit here, tips fit here, news fit here, new products, new events, new everything. Think Twitter, Facebook and news aggregators. You must publish first for the firehose. Your firehose (newspaper/blog feed, Twitter timeline) as well as the firehose defined by the consumer (customization, on Cohen’s description).
  • Trend
    What are people talking about, what is interesting? Generic or niche, what are *my* people discussing, being impressed by, affected by? The headlines or events “chewed” and considered, be it by conversations or explained in posts all over the web. Curating happens here. So does contributory journalism (a.k.a. citizen journalism).
  • Memory
    No understanding and education exist without memory. The memory is the data. The headlines that were trends in the past. Whenever it deserves a better explanation, the occurrence (what made the news) will be compared with the memory bank.

A journalist programmer is a journalist trained with informatics tools and skilled in detecting, editing and publishing trends, and knows which memory databases to query, and how to query them, every time she/he is called to explain or help understand what happened.

Harry Potter’s Daily Prophet is not good enough: present you Text 2.0

Speaking about all that media frenzy with iPad, how can reading evolve on digital devices? Do you think (as I do) in video instead of pictures like in The Daily Prophet read by Harry Potter?

OK, but think again. Think in a “page” where “sounds, images and other types of ambient events can be triggered appropriately according to the currently read position“, according to the movement of your eyes, according to the questions you ask the “book” or “newspaper” you’re reading.

Think Text 2.0. As they put it: “Text 2.0 is interactive, but without the need of explicit control. By a mere reading detection the system recognizes the current point of attention and enables to create an atmosphere full of sentiments, for example by using sounds of wind and water, emotional music or background themes“.

Using market-available eye-trackers along with HTML, CSS and JavaScript, they created a reading enhancement technology called Text 2.0.

From h+ mag’s Does This Headline Know You’re Reading It? (which was slashdotted):
This is not simply a case of using infrared light, a camera, and eye movement to move a cursor and click buttons: Text 2.0 infers user intentions and enhances the reading experience in far more complex ways. Reading certain words, phrases, or names can trigger the appearance of footnotes, translations, definitions, biographies, even sound effects or animations. Ask how a word is pronounced and you get a verbal answer. If you begin skimming the text, it fades out the less important words. If you glance away, a bookmark automatically appears, pointing to where you stopped reading

See the video below, you’ll understand. This guys are sustained by the German Research Center for Artificial Intelligence.

Andrew Keen use to say a lot of things

Is Innovation Fair? Andrew Keen Says No – titles Dana Oshiro a pertinent article on ReadWriteWeb.

Andrew Keen use to say a lot of things. I usually disagree with Andrew Keen. But this is also true: I *always* read Andrew Keen. Because Andre Keen makes me think.

You can’t get nuggets of truth in 30 seconds on Twitter…Skepticism requires deep thinking. We have an increasing nihilism when it comes to traditional authority and yet few of the new authorities are doing the reading or groundwork. …When we simply assume that all traditional structures are wrong, we risk the populism of a Sarah Palin…

Well, maybe Keen & Others looked for “truth” in Twitter, but not everybody did. I didn’t. And I’m not sure skepticism requires deep thinking. I can be skeptical by reaction, and reactions may occur (they generally do!) in a fraction of time. In a tweet.

Nevertheless, Keen made a point. And this is worth reading:

“Innovation doesn’t lead to justice and fairness. I’d argue there is a more dramatic inequality now then there ever was during the industrial revolution. We have fetishized change, but we are unfettered. If anything, the new media is less transparent and less accountable…I don’t have a problem with Twitter or new media, my problem is that digital utopians have dressed up their ideology to sound like democracy…Google has become the master of seeming like an altruistic and public company and yet laughing all the way to the bank.”

So, go there and read it.

Social media infographics (slideshow)

Oscar Berg loves “great infographics and so do I. He collected several social media-related infographics and come up with this nice slideshow, just look: