2010-02-23

Google’s impact on enterprise content management

Without a doubt Google has had a huge impact on the enterprise perspective on content management (ECM).

The pluses and negatives were highlighted by two blog posts yesterday:

On the plus side, John Mancini of AIIM listed three, "fundamental assumptions about information management that affect the ECM industry," in his "Googlization of Content" post:

  1. "Ease of use. The simple search box has become the central metaphor for how difficult we think it ought to be to find information, regardless of whether we are in the consumer world or behind the firewall. This has changed the expectations of how we expect ECM solutions to work and how difficult they are to learn.
  2. Most everything they do is free...
  3. They have changed how we think about the "cloud." Google has changed the nature of how we think about applications and how we think about where we store the information created by those applications. Sure, there are all sorts of security and retention and reliability issues to consider..."

On the negative side, Alan Pelz-Sharpe made a post today in CMS Watch titled, "Google – unsuitable for the enterprise". Alan introduced his piece by saying:

"For years now Google has played fast and loose with information confidentiality and privacy issues. As if further proof were needed, the PR disaster that is Buzz should be enough to firmly conclude that Google is not suitable for enterprise use-cases." He went on to say, "It is inconceivable that enterprise-focused vendors... would ever contemplate the reckless move that Google undertook in deliberately exposing customers' private information to all and sundry with Buzz."

Google is a hugely successful company, and they are extremely profitable. However, they are not a software company. Fundamentally they are an advertising placement company and everything they do is motivated by maximizing advertising revenue, whether directly or indirectly. 99% of their revenue comes from advertising that pays for every cool project they do and every service they offer.

While Google services to consumers have no monetary charge, they are not free:

  • You agree to accept the presentation of advertisements when you use Google products and services; most people believe these to be easily ignored despite the evidence of their effectiveness.
  • More importantly, you agree to offer provide information about your interests, friends, browsing and search habits as payment-in-kind. Mostly people sort of know this, but don't think about it. If you ask them whether they are concerned that Google has a record of every search they have ever performed, they start to get uncomfortable. I expect most of us have searched on terms, which taken out of context, would take a lot to 'explain.'

While most consumers in democracies are currently cavalier about issues of their own privacy, enterprises most certainly are not. Indeed, the need for careful management of intellectual property, agreements, revenue analyses and a host of other enterprise activities captured in content is precisely why they buy ECM systems.

The furor over Buzz points out that Google did things first and foremost to further its own corporate goals, which clash with those of other enterprises.

In contrast, Google's goals require it to align with user needs, especially for good interfaces. An easy-to-use interface encourages and sustains use. That ought to be obvious to everyone, but when the effects of the interface on usage are easily measureable and directly tied to revenue (as in the case of Google Search), it becomes blatantly and immediately evident. In contrast, the development of an interface for an enterprise software product may take place months or even years before the product is released. Even if detailed usability research is done with test users, and in-depth beta programs are employed, the quality and immediacy of the feedback is less.

Besides easy interfaces, enterprise content management users expect 'Google-like' search, and are disappointed. There are generally two reasons for this:

  • Search results have to be further processed to determine if a user can be presented with each 'hit' based on their permissions
    • Typically 70-90% of the total computational time for enterprise search is taken up by permission checking
  • Enterprises don't invest as much in search infrastructure as they should if the rapid delivery of search results was seen as critical

The second point is probably more important than people admit. In my experience significant computational resources are not allocated to Search by IT departments. I suspect that they look at average resource utilization, not peak performance and the time to deliver results to users. To deliver the typical half second or less response that Google considers to be essential, hundreds of servers may be involved. I am not aware of any Enterprise that allocates even the same order of magnitude of resources to content searching, so inevitably users experience dramatically slower response times.

In summary, the alignment of optimal user experiences with Google's need to place advertisements has advanced the standards of user interfaces and provided many 'free' services, but the clash of Google's corporate goals with the goals of other corporations has shown that the enterprise content has value that is not likely to be traded.

Syndicated at http://conversations.opentext.com/

2010-02-07

The ‘Second Coming’ of Renditions - Video


Long time ECM veterans will remember the concept of document rendition – a transformed alternative. I think we'll see renditions again.

A rendition is essentially another form of a specific version of a document. There are two common types of renditions based on format and content:

  1. The same information content as the original document, but a different file format
  • For example, a spreadsheet file can be renditioned as a PDF
  1. The same file format as the original document, but different content
  • For example, a MS PowerPoint Document written in English can have a rendition that is also a PowerPoint file, but whose content has been translated into French

Renditions for limited bandwidth in the 90's

In the 1990's, one of the common use cases was to deal with the limited bandwidth available at the time. It often took a long time to download and open a document just to see if it contained what you were looking for. Accordingly, Open Text Livelink automatically made HTML renditions of many common formats such as MS Word that were much smaller files and so could be downloaded much faster for quick review.

I remember presenting the use case to customers: "If you want to look quickly at a file without opening the full thing..." Back then bandwidth was so limited it made sense. Now it seldom does, although there are specific use-cases like renditions that contain added content like secured signatures that still have value.

Bandwidth issues are back

Bandwidth is becoming limiting again – not for 'simple' text documents, but for rich media files such as videos. In fact bandwidth issues are so acute that the shape of the Internet has changed radically in the last few years. The explosive growth of video sharing has lead to the rise of Content Delivery or Distribution Networks (CDN) such as Akamai Technologies, Limelight Networks, CDNetworks and Amazon CloudFront to enable effective distribution.

Akamai recently claimed they handle around 20% or the Internet traffic by volume – most of this traffic is rich media which must be delivered very quickly as users expect pages to load extremely quickly even if they contain a video. A recent Forrester report says the expected threshold to load has become two seconds.

For video files to be useful to end users they have to start to play almost instantly. This is usually achieved by:


  • Locating a copy in close network proximity to the end user
    • CDNs use many distributed sites around the 'edge of the Cloud' to ensure that is at least one site close to an end user preloaded with files that are expected to be required
  • Reducing the size of the video through transcoding and compression
  • Streaming – starting to play before all of the content is received
The increasing use of mobile devices with narrow and unstable bandwidth connections, and different format requirements creates further hurdles to serving users rapidly.

Enterprise needs

So what about the enterprise or corporate user? Trained by the web, he/she expects to click on a link and have a video start playing within two seconds. But most internal ECM systems (e.g. for document management) are designed to download a complete file before it is available to the end user.

A story – Here's a scenario I experienced recently. A Finance department prepared a new expense form. To show staff how to use it they prepared a five minute video. The trouble was that their WMV format video was over 300MB. For most staff in a global company, especially remote staff, downloading a 300MB file to view it is just not practical. What Finance needed was to be able to upload the video, and have the system take care of making a rendition that was transcoded and compressed, made stream-able and hosted on a CDN.

There are just too many manual steps and too many options for most newcomers to video creation. Systems should take care of most of those steps. And one excellent way to execute several steps is to have the ECM system create a rendition of a deposited video that contains embed code to start a player and stream video from a CDN. The consumer users can then simply click on the object name in their ECM system and a streamed video starts to play almost instantly – as they have come to expect with sites such as YouTube.

So renditions have a place in the new enterprise again to deal with bandwidth limitations!


Syndicated at http://conversations.opentext.com/

2010-01-14

Really looking forward to Virtual Content World - other ways to be 'virtual'

You've probably heard about the first Open Text Virtual Content World (www.opentext.com/virtualcw) on Tuesday 19 January 2010. Hopefully you can attend. I'll certainly be there in a virtual sense. It's not too late to register, and if you attended Content World 2008 you'll have received a code promotional code for free registration.

For those who can't attend, there is an even 'more virtual' and dare I say free option – watch the many postings on twitter and Facebook.

The twitter hashtag is #otvcw.

The volume of tweets will really pickup on Tuesday if the Content World 2009 experience is any guide, not just from the 'official' event twitter account (@OTContentWorld) but of course from other OT staff like me, and most importantly, customers.

It should be a great event. There as certainly been a lot of organizational activity. Colleagues have told me this virtual event has been as much work as an in-person one.

'See' you there!

Twitter: @MartinSS

Syndicated at http://conversations.opentext.com/

2009-12-18

Customer Community Success Metrics for 2009

Customer communities are all the rage nowadays, but it is not always clear what works and indeed how to measure success.

As 2009 draws to a close we have been reviewing how Open Text customer communities have been doing.

Background: For those not familiar with Open Text, we are a vendor of enterprise-class software to manage digital files (called content). The term enterprise indicates that we sell to organizations not consumers. We have relatively few customer organizations, but they are often typically some of the biggest organizations in business and government. We estimate that at least 1 in 3 Internet users visit sites that run our software! The software we use for our own communities is the same as we sell.

The Sites

For historic reasons, we run three primary community sites (requiring membership) in addition to our typical corporate websites. The community sites are:


  • Open Text Knowledge Centre (KC)
    • Primarily for system administrators of the software we sell


  • Open Text Developer Network (OTDN) which is housed on the KC
    • Primarily for developers using Open Text APIs


  • Open Text Online Communities
    • Primarily for business champions and power users

Site Metrics

  1. The Knowledge Centre is by far the oldest community, dating back to 1996! As you'd expect, it has the most members and the most ongoing activity. Every day approximately 4,000 users access the site, and between 150,000-200,000 documents downloads are performed every month!
  2. OTDN just completed its first full year during which just over 3,200 unique users participated over the past year
  3. Online Communities got started in its present form in 2005. This last year 10,600 members collectively visited 118,000 times over the year
These numbers only measure direct participation. As you might expect, many community members participate through email-mediated discussions.

Convergence

Multiple systems have traditionally meant that there are multiple, disconnected silos of information. As a result, users don't know where to look and administrators have to duplicate critical content between systems.

A better approach is to deploy a single, 'enterprise library' of digital files (content) which contains all of the files, but just one active copy of each. The three sites above will soon converge to use the same enterprise library, which will also be used by our corporate website that is open to the general public.

One single repository can make user navigation harder unless the most relevant content is presented and organized in a fashion that best meets the needs of each type of user (i.e. persona). Communities of users with similar interests or jobs are one approach to organizing content, but of course there are others, including personalization based on the activities and preferences of specific users.

Measuring 2010 success

These communities will continue to develop, but the latest social networking approaches provide new ways to surface important content. As we deploy more social networking approaches during 2010 we'll have a solid base of community metrics from 2009 to judge progress. As you might expect, activities on external sites like twitter, YouTube and facebook are becoming increasingly important.


Syndicated at http://conversations.opentext.com/

2009-11-17

The biggest changes sneak up on you

Content management (ECM) systems can track everything that a user does. Usually this capability is seen in the context of compliance – you can answer the 'who did what' and 'when did they do it' questions. You can also track changes in what users did over time. And so it is that a colleague was able to track how my behaviour has been changing without me noticing it by reporting on how many documents I deposit.

  • A bit of background: I use a number of Open Text Content Server systems. One of these, nicknamed Ollie, is used to support to support content-centric business processes within Open Text. I have authored many documents, mostly in MS Office formats over the course of my nine years with the company.
The 'aha' moment: So when my colleague made a social networking post that he had found that I had deposited almost 700 documents in Ollie, I wasn't surprise. I was surprised though when he pointed out that I hadn't added any documents in the last month! Zero! None!

This of course got me to think: "What had I been doing?"

He asked if I'd mostly moved to social networking-style tools. But no, I've been using collaborative tools of one form or another fairly consistently, and indeed heavily, over the last decade. What I realized as that I have almost entirely shifted to using wikis in place of documents.

  • A bit more explanation is in order. Content Server (formerly Livelink) is a full-featured ECM system. You can add documents of any type, including of course MS Office files. You can also directly author in wikis. On the collaborative/social networking side you can also post to a range of collaborative tools such as forums, discussions, news channels, blogs, etc., and with more recent additions instant messages, status posts, etc. So a user has a range of content and social tools in the same system to use – they can select whatever they feel most suited to the business task at hand. Given these choices you can then track user preference changes over time by analyzing audited events.
On further reflection it shouldn't have been surprising. Once I used to have the Word and PowerPoint applications open all the time. I would typically send documents to colleagues as email attachments or via links to copies in Ollie.

Now I create wiki pages and then rely on automated notifications and RSS for others to learn about them, and of course push awareness by targeted emails. I very seldom open Word to author content, and when I do I get frustrated because all of the embedded code makes it hard for me to reuse the content (unless I force Word to the blog posting mode as I'm doing now).

That's another thing – I repurpose content to multiple channels much more than I used to. I don't simply author 'free-standing' documents and then deposit and email them. I often use the same content in several blogs and/or wikis.

And now I'm starting to create short videos where once I'd have authored a document...

None of this is surprising in an abstract sense. Pundits have been saying that there are huge changes underway and as someone who works in a company at the forefront of how content is managed in organizations, I've been aware of it and promoted it. I just hadn't realized how much my own behaviour has changed; otherwise I wouldn't have been surprised that I didn't deposit a single document in Ollie last month!


Syndicated at http://conversations.opentext.com/