An ABBsolutlYY great day with ABBYY USA!

This past Saturday, January 25th, was the ABBYY USA Winter Holiday party held at Wente Vineyards in Livermore, California.  Founded 130 years ago, Wente Vineyards is the country’s oldest, continuously operated family-owned winery.  The Vineyard itself is absolutely gorgeous with other excellent attractions such as a Bar & Grill, full service restaurant, wine tasting tours and, my personal favorite, a stunning 18-hole Golf Course!


The party itself was scheduled to start that evening at 6pm with a pre-dinner wine tasting  tour starting at 5pm so a couple ABBYY golf addicts and myself were quick to put two-and-two together and a morning golf tournament was born.  There were a grand total of fourteen players that joined the tournament and we played best-ball format.  My group consisted of two long hitters and myself.

golf course

Hole #10 was particularly noteworthy.  Our strategy was for me to tee-off on each hole because I could generally put the ball ‘in play’ although I wasn’t terribly long.  Then the next two guys could be more aggressive and try to put us in a better position.  It worked out well and especially on the 281-yard tenth hole.  I tee’d off and put the ball ‘in play’ about 180 yards, leaving only roughly 100 yards to the hole so this gave us three additional chances to improve.  The next guy in our group, Steve, had a very nice shot that traveled roughly 210 or so yards which only left us about 70 yards, so this was a slight improvement.  Therefore we had two shots to be most aggressive.  Our next guy, Ruben, took the third shot and drove it out of bounds but we had one more chance.  Ruben also took our fourth shot.

hole 10 ruben

We all agreed that he would ‘grip-it and rip-it’ and he did just that as we had little to lose.  His shot ended up only about 10 feet from the hole making his tee shot roughly 270 yards and directly on target!!!  It was nearly a hole-in-one on a par 4 hole.  It was an amazing shot by Ruben and one to remember for a lifetime!

ruben and steve 10 hole

After a great morning of golf with the group that went a bit longer than expected I had less than an hour to rush back to the hotel room I had rented, shower and return for wine tasting.  So Marc, who was also staying at the same hotel, and I hauled it to the hotel to check-in and get ready.  Since Marc wasn’t planning on attending the wine tasting he was nice enough to lend me his car which was much appreciated.  I made the wine tasting event just as it was starting at 5pm so a bit a relaxation or short nap was, obviously, not in the plans.  The wine tasting consisted of a guided tour where the history of the Vineyard and Wente family was explained which was interesting.  Then they took us into the “cave” (their terminology), which was actually a large storeroom/warehouse which stored all the barrels of fermenting wine where the process was explained in detail.  It was a great experience.

wine tasting the cave

Lastly, after the wine tasting tour, it was time of the main event and the Winter Party/Dinner.  There was all the usual chit-chat and conversations you might expect.  At about 7pm we all sat down for dinner, which was absolutely delicious.  Drawings for prizes were held through-out the evening and many people won wonderful items but, unfortunately, I was not among the winners.  Nevertheless I got around to say hello to nearly everyone which I sincerely enjoyed.  I also had the chance to meet many of my co-workers significant others and/or family members for the first time which was great as well.


Overall it was a busy, but wonderful day/afternoon and evening.  Thanks to ABBYY USA for sponsoring such a terrific event and especially thanks to all the organizers!

Increase ECM Automation Processes With Higher Resolution Scanning

Source: Business Solutions Magazine


Written by: Kevin Neal, product manager – production scanners, Fujitsu Computer Products of America

When we talk about software automation, it’s safe to say that we truly live in remarkable times. Automation, as it will be referred to in this article, can be defined as allowing a computer to accomplish tasks that traditionally took human intervention and/or action to complete.The rapid adoption of automation via software is driven by several basic technical factors, including high-powered, affordable CPUs (more cycles and lines of code executed per second), drastic increases in memory capacity in conjunction with reduced prices, as well as the ever-evolving intelligence within software packages. The computing resources behind all of the advancements are helping to reduce costs, improve efficiencies, and assist with compliance and regulation.

Software automation is becoming more pervasive among ECM (enterprise content management) and document scanning solutions. The virtue of implementing ECM solutions has historically been cost reduction, which could have meant decreased headcount or reallocating employee resources to other business units. It may even have been tangible costs such as reducing mailing and shipping charges, eliminating expensive fax transmissions, or increasing physical storage space too, by removing cabinets and file drawers.

Because of computing advancements, businesses and organizations are no longer asking the questions of whether ECM systems are truly viable. Instead, they are asking more pointed questions about how much the return on investment is and how quickly they will realize the ROI. In fact, according to Gartner, Inc. the worldwide ECM software market is expected to grow more than 12% per year through 2010, from $2.6 billion in 2006 to more than $4.2 billion in 2010. These days, it’s more about which hardware, software, and services best fit the needs rather than whether or not to put a solution in place.

With most of the pain points of the DIP (document image processing), DIM (document image management), and/or ECM solutions behind us, we now have an opportunity to do more remarkable automation tasks with software. But the success or failure of the entire system is closely tied to the ‘on-ramp’ of electronic document automation and your document scanner, in particular. In the next few paragraphs, I’ll examine several important software automation solutions from some of the premier forms processing and capture software companies in the industry.

High Resolution Maximizes Recognition Results (Contributed by ABBYY)
When scanning for OCR (optical character recognition) or data capture, start with an excellent quality original. This may be the single most important consideration to achieve optimal results for recognition and capture, as well as for the purposes of long-term preservation. In fact, using a high-quality image takes on increasing importance as more users depend on electronic documents to take the place of paper-based originals because of the searchability and cost savings. On the downside, once scanned, the paper document is often no longer available — so it is important to retain maximum quality from the outset.

Today, 300 dpi (dots per inch) color remains the gold standard for scanning. However, high-quality grayscale is an option when color is not achievable (since color scanning often results in 32-bit files). Whenever possible, maintain color images. Color provides additional depth, which enhances the ability of recognition software to gather additional information about the scanned document in order to maximize accuracy. In short, consider quality first when scanning for recognition and archiving.

Classification Of Forms (Contributed by ReadSoft)
Organizations are turning to one portal for all incoming documents — no matter if they arrive on paper or in electronic form. Technology is available to automatically sort incoming documents and classify them according to case. This enables the simple inputting of all incoming mail into a scanner (without any separator sheets) and lets the computer sort the documents. If documents arrive in electronic form, they are also easily incorporated into the flow. By digitizing paper documents through high resolution scanning, users can easily search and retrieve all incoming mail. What will this do for an organization? Efficiency increases when each and every document is distributed correctly. Fast access to status reports and audit trails gives users better control over information flow. In addition, a smooth integration with back end systems such as customer management applications, databases, and archives boosts the performance of IT systems. The overall result of high resolution scanning is automated classification and sorting — less need for document preparation, one portal for all incoming documents, (paper and electronic), electronic distribution to authorized staff, and control of information flows.

300 dpi — Friend Not Foe For Automated Document And Data Capture (contributed by AnyDoc Software, Inc.)
The idea that scanning documents at 300 dpi will create backlogs and bottlenecks within automated document and data capture solutions is an outdated myth. In fact, within many solutions, product settings default to 300 dpi to maximize character recognition with little or no adverse impact on processing or transmission speed or storage capabilities — and with a great positive impact on recognition accuracy. And, when processing healthcare forms such as explanation of benefits (EOB), Health Care Financing Administration (HCFA) and Uniform Bill (UB04s) — known for their notoriously small font and extremely high character density per page, proper resolution is critical. At a 300 dpi setting, recognition engines are optimized and file size is still very manageable. Because the average size of a 300 dpi 8.5” x 11” bi-tonal TIFF image is 40 KB, it means approximately 3,000,000 document images can be stored on a standard 120 GB hard drive.

In decades past, files competed for space that was limited and expensive, but no more. Now, a 40 KB file travels on today’s fast networks at what can be conversationally considered to be the speed of light. A lower scanning resolution can negatively impact data recognition, which is not offset by the saving of space — no longer the limited commodity it once was.

And, some of the better document processing packages will process at 300 dpi, but output at a lesser (i.e. 200) dpi, giving you the best of both worlds. Scanning at a higher resolution can dramatically improve data recognition, decrease the need for human intervention, and increase the efficiency of all downstream applications without negatively impacting electronic transmission or storage space.

More dots per inch (dot) for increased automation
So, maybe now you’re thinking — “Of course I want everything automated and I’ll scan everything at 300 dots per inch and/or color, or both.” Well, not so fast. First, we must consider the risks versus the rewards for this type of a decision as we addressed in an upcoming article entitled “Trends Towards Higher Resolution Scanning.”

To quote Gartner, “The quality, performance, and ease of use of software products will improve.” This will help drive adoption; however, an inefficient document capture solution, due to settling for anything but the most software automation, should be unacceptable these days considering the pros and cons of higher resolution scanning.

In a day and age where no two ECM solutions are built alike, and organizations have choices for software automation components, it’s important to implement the best-of-breed solutions that garner optimal automation results. Whether it is OCR, ICR (), forms processing, separation, classification, unstructured forms, bar code recognition, etc., each step in the automation process and the rest of the automation workflow is directly related to a prior event, and it all starts with document scanning. As more desktop scanners are deployed throughout organizations, there is certain to be an ever increasing demand for ease-of-use and automation. Give your ECM solution the best chance for automation success and don’t underestimate the trends towards higher resolution scanning.

For more information on topics covered in this article or more information in general please visit:

Fujitsu –


AnyDoc Software –

ReadSoft –

Kevin Neal, product manager – production scanners, with Fujitsu Computer Products of America has been involved in the document scanning/enterprise content management industry for over 18 years. He has held various customer service, sales and management positions for many hardware and software products during his career. In addition, he has years of experience installing, configuring, and troubleshooting networking components as a consultant and network administrator. Currently he handles product management responsibilities for Fujitsu’s complete line of production scanners.

– See more at:

Demystifying Forms Processing and Data Capture

Demystifying Forms Processing and Data Capture

Forms Processing is a proven technology that allows organizations of all sizes to benefit by improving efficiency and decreasing operational costs.  There are many case studies available online to support these facts.  When implemented properly the cost of a Forms Processing solution can easily be justified with a tangible 12-18 month return on investment.  With such overwhelming evidence of decreased operational costs and drastically improved efficiency then a logical question would be why wouldn’t every business in the world be using this wonderful technology?  Traditionally only large organizations with dedicated technical staff and humungous IT budgets could consider implementing a sophisticated Data Capture solution but times are changing.  No longer does it have to take years to realize the benefits of Forms Processing once only available to Fortune 1000 type companies.  In this blog post I hope to dispel the myth that this useful technology is only available to Enterprise organizations.

While the concept of automatically extracting information from a hard copy document is not new, what is new is a different method of implementation.  Specifically, the “cloud” offers an intriguing opportunity for Data Capture.  Why?  First, Data Capture is a very CPU intensive process and the cloud offers unmatched processing power within gigantic data centers.  Second, sharing resources and ‘renting’ a cloud service such as ‘Cloud Capture’ reduces the barrier to entry.  No longer is it the case where the upfront cost to implementing Data Capture should be an issue.  The cost of Data Capture can now be a Operating Expense versus a Capital Expenditure.

I have written previously about the “No Folder Zone” and in this blog post I will elaborate on the solution to avoid using Folders as a cop-out for a truly effective Information Capture solution.  In a traditional installation environment of on-premise software.  After the Forms Processing system is installed, tuned and tested then it is ready for deployment.  This is the point where the Document Capture system Crosses the Chasm and the organization can now truly benefit from the 80% investment and turn this effort into 80% benefit.

The basics of Forms Processing are quite simple and straight-forward.  The idea is to create a template overlay of the form for which you wish to extract information.  As seen in the photo to the left, you would basically draw zones over the image where you can capture typed text (Optical Character Recognition, or OCR), handwritten text (Intelligent Character Recognition, or ICR) or even check boxes (Optical Mark Recognition, or OMR).  After the template is created then the next time the system encounters this type of form then these fields will be automatically captured and eliminate manual data entry.

One of the most important objectives of any data capture system should be the quality of the information being captured versus just the pure speed of the system.  The accuracy of information captured is based on many factors including original document quality, image enhancement or scan resolution but a critical step is to validate, or verify, any questionable data BEFORE it enters your information system.  There are many effective methods to capturing highly accurate data including logic such as a Social Security Number field should contain only numbers instead of letters and, therefore, the number “5” would not be incorrectly recognizing as a letter “S”.  In a perfect world you would hope for no verification at all but this is simply not reasonable all the time.  A good rule of thumb is that 2% verification is acceptable which means 98% of work is done for you quickly and automatically.  This translates into major efficiency gains.

A key misconceptions about Data Capture, or Forms Processing, is that the integration into back-end systems needs to be complicated or costly.  While this could be true the fact of the matter is that all electronic information systems rely on some flavor of a database.  And basically a database is composed of a bunch of tables with fields.  In context of Forms Processing think about a table of Document Types.  Then in the Document Types table you have the various types of documents you wish to capture and the Fields are the index values you wish to extract from an image.  So the real magic is “matching” the extracted index values to the fields in the database.  I think the term “Field Mapping” most accurately describes this integration of Data Capture technology with Electronic Information Systems.  Fortunately, new trends in open connectivity such as Web Services and Content Management Interoperability Services (CMIS) is making the connectivity between Capture and Storage much more affordable and less time-consuming than ever.

As I mentioned earlier in this blog post, all applications have some flavor of a database to store information.  It’s just a fact of how things operate and if you really think about it all we have to do is match Data Capture fields with database fields to make a fully integrated Data Capture solution.  Often times we get wrapped-around the axel on the technical details but when we simply integration to it’s lowest common denominator then we can truly dispel the myth that Forms Processing is too complicated or expensive for everyone to utilize.

Now that I’ve covered the basics of Forms Processing and illustrated the fact that interoperability can be achieved rather easily in certain cases, I hope that we can move out of the stone ages of manual data entry and realize a truly efficient organization with Automatic Data Capture.

AIIM has just published a whole suite of educational videos on a collection of interesting topics including one on Information Capture (

Microsoft SharePoint – FAQs


1. What are the benefits of utilizing SharePoint for document imaging/ECM?

One of the main benefits of utilizing SharePoint for document imaging/ECM is the limited learning curve of both the users as well as systems administrators. With Microsoft operating systems and office applications being the primary graphic user interfaces most people are using in their organizations, it just makes sense the commonality between SharePoint and an application such as Outlook give the users a comfort level that typically does not involve complicated training. This decreased learning curve offers quicker adoption of the technology allowing organizations to focus on building out SharePoint sites for actual use which leads to tangible enhanced productivity. This is important because sometimes an organization can get burdened with months of installation, configuration and training before a system ever goes “live” which is not only time-consuming but is costly and leaves a bad impression on the ultimate success or failure of the system. Quickly demonstrating enhanced productivity through user adoption of a familiar graphical user interface within a departmental process such as invoice processing, for example, leads to a desire for stakeholders within organizations to sponsor additional departmental process improvement projects or even complete enterprise roll-outs of document imaging/ECM systems.


2. What is one of the most common misconceptions about scanning into SharePoint?

The idea that scanning into SharePoint is difficult or expensive seems to be a common misconception. There are more options than ever to scan a document into SharePoint and we think breaking down the high-level techniques for scanning and applying them to how organizations typically may scan into SharePoint is important.

There are three basic ways to get a scanned image with corresponding metadata, or search terms, into SharePoint. The first scanning option is Manual Indexing where users scan a document, then connect to SharePoint where a SharePoint Document Type has some associated metadata. The user types in the metadata for this particular scanned document then simply uploads the document directly into SharePoint. The second scanning option for SharePoint is Automatic Indexing. This is a more automated, but also more costly, option that is typically used to process higher volumes of documents. With the Automatic Indexing option, information from the scanned pages such as bar code values or printed characters such as invoice numbers, social security numbers or other data can be automatically extracted and sent directly into SharePoint. Lastly, a network scanning approach can involve either Manual Indexing or Automatic Indexing, however the important point about this method of scanning into SharePoint is the appeal from an ease of use standpoint for users and an effective device management perspective from network administrators. Network scanners typically are dedicated use devices where scanning into ECM systems such as SharePoint is their sole purpose; therefore, making scanning easy was a priority in their design. Features such as bright, colorful touch screens make image preview simple and easy. Integrated hardware keyboards make indexing documents quick and efficient as well. So, as we have illustrated, there are several high-level methods for scanning into SharePoint and the right method really depends of your organizational requirements.


3. How do you add ‘scanning to SharePoint’ functionality to a SharePoint
server? And, is it expensive?

Adding ‘Scan to SharePoint’ functionality is surprisingly simple. The wonderful thing about adding document scanning capabilities to SharePoint is that it involves no additional software installed on the server itself. There is optional third-party Imaging software that can be installed on the server to optimize performance, improve scalability and enhance search, but this is not a requirement to scan documents.

Simply install the Fujitsu scanning software application on a workstation. Then once the Document Libraries have been created in SharePoint with the corresponding metadata, or search terms, all that needs to be done is to connect to the SharePoint site and supply login credentials. After this simple configuration is completed users will never have to configure the software again. When new Document Types are added to SharePoint, or if metadata fields change, then the user will dynamically see these changes without ever having to change the scanning application.

The expense to add ‘scanning to SharePoint’ can literally be as inexpensive or expensive as an organization’s scanning volumes and/or requirements dictate. Many scanner hardware vendors provide some simple options for scanning to SharePoint in-the-box with the scanner so the expense is just the scanner itself and not additional software. However, if an organization requires a higher level of automation to do sophisticated data extraction such as automatic document recognition, document separation, then capture the data and automatically release to SharePoint, this could be a more expensive proposition. It’s important to remember that this expense can be easily justified with reduced human labor, examples of this could be the ability to take advantage of more pre-pay discounts on invoices or better customer service with immediate access to information.

Example of simply connecting to SharePoint once. Supply some basic information once then all SharePoint updates are instantly available dynamically and visible in your document scanning software application.

4. How important is document capture software compatibility with SharePoint?

It can be, but the truth is that the capabilities are limited. Also, it is important to note that no matter what version of SharePoint you have (Microsoft Windows SharePoint Services 3.0, Microsoft Office SharePoint Server 2007 or SharePoint 2010) the software must be configured before it is usable. Microsoft Windows SharePoint Services 3.0 is the free component of SharePoint that is included with Windows Server 2003 or it is a free download for customers who have Windows Server 2008.

Also, it is important to note that while SharePoint has its strengths, just like any other product, it also has its weaknesses. We have seen many situations where SharePoint is being used in conjunction with other complementary document management systems. The right solution truly depends on an organization’s business requirements and all options should be thoroughly investigated.


5. What capabilities are needed to help end users have a better ‘Scan to SharePoint’ experience?

There are a few capabilities that are needed to help end users have a better experience when scanning documents to SharePoint. First, image enhancement is an absolute must. Anyone that has ever scanned documents must have felt the pain of having to rescan documents for various reasons. Maybe the image quality was poor? Maybe the page was scanned upside down? Maybe only the front side was scanned on a double-sided document? These are just a few examples of situations where the user would have to stop their process to rescan these documents, which not only is a waste of time but also is costly in lost productivity terms. Image enhancement technology, which can dynamically adjust for perfect image quality and perform automatic tasks such as automatic page orientation, intelligent blank page removal, automatic color detection, automatic cropping and automatic deskew, is key to helping users have a pleasant experience. If scanning documents is a chore then users will resist using technology that is difficult to use.

Secondly, the method of capture is another critical consideration. There are several methods for capturing documents into SharePoint. Some of the common approaches are manual indexing, automatic indexing and network scanning. Manual indexing is ideal for ah-hoc or low volume scanning. With manual indexing the scanning application captures an image, then the image is presented and the user types the metadata into the fields configured on the SharePoint Server. This approach is the most cost effective, yet still adds the important step of capturing important metadata to be associated with the scanned images. Alternatively, automatic indexing is ideal for large volumes and/or when the documents have some sort of fixed content structure. For example, the Census 2010 forms have fixed structure where a particular field such as Social Security Number is always in the same place of the document. It’s easy to design document scanning templates that can automatically and quickly extract this information and place both the scanned image as well as the associated metadata directly into SharePoint. Lastly, the network scanning approach is one of the newest methods of capturing scanned images into SharePoint. The benefits of network scanning are typically about the ease of use with simple touch screen operation for the user and the ease of deployment and on-going maintenance for the administrator. A network scanner can be configured to use either the manual indexing or automatic indexing approach as described above. So, as you can see, there are several methods for capturing scanned documents into SharePoint and the right approach, or combination of approaches, really depends on an organization’s requirements levitra over the counter.

Example of Manual Indexing into SharePoint.


6. How are hardware vendors addressing the vigorous adoption of SharePoint?

Scanner hardware vendors are clearly trying to address the vigorous adoption of SharePoint by including some level of SharePoint integration in-the-box or even embedded into devices such as network scanners. Scanning to SharePoint has to be easy to setup and easy to use. Often times SharePoint is deployed as a document management system where this may be the system administrator’s first experience with this type of software. To help reduce the burden on system administrators, many scanner hardware vendors offer simple solutions for configuring and using the scanning software. Therefore allowing the the system administrator to focus their time learning the server-side functionalities such as creating Document Libraries, created Columns for metadata or establishing document workflow.

Example of embedded SharePoint connectivity using the Fujitsu network scanner.

7. What makes scanning to SharePoint different than scanning to any other content management repository or platform on the market today?

The user experience of scanning to SharePoint is not unlike other content management repositories or platforms available on the market today. Most scanning applications can connect directly to a repository and show index fields based on document types. Also, most scanning applications can utilize either the manual indexing or automatic indexing techniques described in question # 5 above. With SharePoint, the main appeal is the ability for the user’s to manage the overall SharePoint experience. SharePoint offers users the ability to create their own ‘sites’ without the involvement of the Information Systems department. This is basically the equivalent to your own web site where you can store all your electronic content including scanned images. Within these sites, users can create a custom page using different ‘web parts.’ For example, a user can have a news feed in the top-left portion of the page, a business intelligence chart of daily sales activity in the bottom-left, a spreadsheet of current stock prices in the top-right, and finally, a web part with point-and-click access directly to scanned images in the bottom-right. Some people might refer to this as a ‘dashboard’ specifically tailored to what information and what applications a user feels is most relevant to them.


8. What trends are hardware vendors and solutions providers seeing in terms of SharePoint customization?

There are several trends that hardware vendors and solution providers are seeing in terms of SharePoint customization. Both of these trends involve careful planning of the SharePoint system. We recommend that you do not rush to simply begin scanning and importing high volumes of documents into SharePoint without a well thought-out strategy. First, the ability to more effectively manage SharePoint is a big trend. Within some organizations that have migrated to SharePoint from simple shared network drives, they have found that while they achieved the intended reduction in paperwork, they have also found that now they have nothing more than another electronic mess of content. There are several SharePoint Solution Providers that are successful in helping organizations get a better handle on their SharePoint system even after it has been deployed. Secondly, and somewhat related to the manageability of SharePoint, is the importance of metadata and well thought-out document taxonomy. Metadata refers to the key search words used to retrieve documents stored in SharePoint. If an organization is not capturing the right, or accurate, metadata on associated documents then it could mean a complete failure to gain any meaningful benefit from a SharePoint system. A taxonomy provides a formal structure for information, based on the individual needs of a business. Categorization tools automate the placement of content (document images, email, text documents, i.e., all electronic content) for future retrieval based on the taxonomy. Users can also manually categorize documents. Categorization is a critical step to ensure that content is properly stored.


9. How does the implementation of SharePoint impact your current document management system?

Without a doubt, the implementation of SharePoint is going to drastically improve productivity or is going to become a burden to your organization. It will affect your organization either positively or negatively, but the SharePoint Effect will certainly be felt. Let me be specific.

Only a few short years ago, we think that many organizations were under the false impression that SharePoint Server contained all the same capabilities of traditional Enterprise Content Management (ECM) systems which was not the case. For example, auditing is a very important concept for ECM systems and up until SharePoint 2010, with the addition of full-featured auditing with the Compliance Details screen, Microsoft lacked this functionality that is pretty standard with most ECM systems. Additionally, another important ECM concept missing from previous generations of SharePoint was the idea of Managed Metadata (which is also a new feature of SharePoint 2010). Managed Metadata allows organizations to define a set of terms to be used in a consistent manner when applying searchable terms to scanned documents. Point-being that there were certain deficiencies within the suite of SharePoint capabilities that left organizations without adherence to compliance regulation due to the lack of auditing, for example, without the control over a consistent metadata strategy. These are a few examples where a SharePoint implementation might have been perceived to be a failure due to a lack of understanding critical ECM features organizations require.

Consequently, a solid understanding of SharePoint’s true capabilities helps organizations benefit greatly from the ability to leverage SharePoint’s core strengths. These core strengths have traditionally been focused around collaboration and portal – in other words, the sharing of electronic items such as Excel spreadsheets, Word documents, PowerPoint presentations and now, of course, scanned images. It should be noted that with some of the new features of SharePoint 2010, Microsoft is incorporating specific ECM capabilities to make SharePoint more appealing as a complete ECM solution. Many customers have shared with us that SharePoint in conjunction with other ECM software seems to be a solution that works well for them. To illustrate this point we will use a Records Management application as an example. Prior to SharePoint 2010, SharePoint lacked true Records Management capabilities such as ‘holds’ or ‘document retention periods’. Therefore, organizations could use SharePoint for their ‘active documents’ such as an Excel price list that needs updating by a team of people. These people could access the same document, check-out this document, edit it and then check it back into SharePoint for the next person to check-out and edit. However, once the spreadsheet is finalized and defined as a permanent final ‘record’ then this document would be committed into the traditional ECM Records Management system.


10. Can you scan to SharePoint without using another application?

No, Microsoft SharePoint does not offer any native support for document scanning. There are some creative ways to import images into SharePoint via e-mail or shared folders, however this is not ideal because there is no way to apply metadata, or search words, to those particular scanned documents. The true power of SharePoint, or any other Enterprise Content Management (ECM) system, is the ability to keep your information organized and searchable. Adding relevant metadata, and not simply a full-text OCR, to scanned images makes the system much more usable. For example, imagine you have a collection of one thousand images in your SharePoint repository and you had done full-text OCR on each document, then you search for the term ‘scanner documents.’ If you are in the scanner business then the likelihood of nearly each of those one thousand documents being presented as the potential actual document you were searching for is very high. However, if as a business rule or policy, your organization decided on a logical taxonomy to classify your documents and apply only relevant metadata then your search results would be much more pertinent to your query.

It is key to a successful ECM implementation to carefully consider the importance of applying metadata to scanned images. Otherwise you might simply replicate a current paper-based filing system with an electronic mess of disorganized and lost images.


11. Can SharePoint be a document management system for you, out of
the box?

It can be, but the truth is that the capabilities are limited. Also, it is important to note that no matter what version of SharePoint you have (Microsoft Windows SharePoint Services 3.0, Microsoft Office SharePoint Server 2007 or SharePoint 2010) the software must be configured before it is usable. Microsoft Windows SharePoint Services 3.0 is the free component of SharePoint that is included with Windows Server 2003 or it is a free download for customers who have Windows Server 2008.

Also, it is important to note that while SharePoint has its strengths, just like any other product, it also has its weaknesses. We have seen many situations where SharePoint is used in conjunction with other complimentary document management systems. The right solution truly depends on an organization’s business requirements and all options should be thoroughly investigated.