February 04, 2007

742 Google results and most of them are useless!

Will someone at Google please explain this to me?

I just did a search for our company name Cirilab on www.google.ca . I received 742 search results. Wow I thought, that's great. People are starting to recognize our brand and are linking to our website. Then I asked myself the question: Who is linking to the Cirilab website or pages about Cirilab?

So I started to navigate through the 742 Google supplied search results pages. Sure, the first few were ok, but after the first or second page of results the quality of Google search results went way down hill. Many of the results were spoof marketing results pages that when you went to the URL link there was absolutely no mention of Cirilab at all. A significant number of the Google search results seemed to be pages that were indexed at the time that one of Cirilab's Google Adwords advertisements was on the page but when visited at this time had absolutely no real mention of Cirilab at all.

A rought statistical analysis of the 742 results showed me that more than 90% of them were bogus "fake search follow on sites" that want you to then click on their links to continue your search process or were sites that actually contained a Cirilab add at the time of indexing or were sites that used www.cirilab.com as a meta tag.

Worse yet, there were some really valid URLs that poped up in the pages that came long after a lot of bogus and useless results. Clearly Googles automated and human evaluation process is not working. More does not mean better.

Don't believe me, go ahead and do a search for the single word....

cirilab

and evaluate the results. And when you evaluate the results please provide a comment to this post about your assessment. 

 

January 29, 2007

Mindsystems ThemeReader

Thematic Mind Mapping At Its Best

Cirilab Knowledge Interface partner Mindsystems has today released the Mindsystems ThemeReader (MTR). The Mindsystems ThemeReader is an advanced version of the original Export To MindManager.

The Mindsystems ThemeReader allows you to import any unstructured textdocument into MindManager based on its thematic content. In addition to Word, WordPerfect, PDF, Text, Rich Text Format, and even HTML can now be imported directly to Mindmanager. Moreover, the user is provided with the following benefits:

- a Synopsis of the original document based on its Major themes
- a Details Summary of the document based on all its Major and Minor themes
- a powerful Document Navigator which allows you to click on any topic node (theme) and immediately see every sentence that contains that theme
- a link to open the original Entire Document

Mindsystems ThemeReader can be tried for free at the following link:

http://www.mindsystems.com.au/products/themereader/

January 08, 2007

It's the Context, Stupid!

Context is still more important than Content

And this was from 1994! It seems we are just starting to understand that the medium is the message.

http://www.saffo.com/essays/contextstupid.php

When will we finally get it.

December 28, 2006

The US Constitution Says What It Means

Say What You Mean!

The aspect I find most interesting about our technology is when we introduce it to people that are passionate about their writings. When we show someone the Knowledge View of a document they have created we are met with a mix of "yes, that is exactly what I was trying to say" or "no, that Knowledge Signature does not represent what is in my document". Yet, we at Cirilab believe that the document speaks for itself and that our algorithm does indeed reveal the theme structure within a document.

I'm from Missouri so prove it! Well, I'm not personally from Missouri but you get the idea!

So we decided to find a document that is really well written and recognized by all and then run it through our software so that we could demonstrate the accuracy of thematic extraction technology. And what better document than the US Constitution, a document that was reviewed and agreed upon by many before it was published. For Cirilab, the US Constitution is a great example of a document that was really well written and that represents the "themes" that embody the messages that it is trying to convey to the reader. When we look at the current makeup of the US, it's government, and the rights that the country stands for, especially given recent elections, we believe that the Knowledge Signature of the US Constitution reflects very clearly the important themes that the writers wanted to impress upon the reader of the document but more importantly the flavor of the country it was meant to direct.

In short, it is our view that the US Constitution is a very well written and well considered document that speaks very clearly of themes of a country that it is meant to guide during its continued development as well as the balance of power within government that was created to ensure the protection and freedom of its citizens. And I am sure that anyone reading this post will say "well of course it is!". Yes, but how can be demonstrate that it was well written and that it conveys the themes that it was meant to convey to the country's citizens? To prove our point we created several Knowledge View outputs of the US Constitution for all to review and consider. Your review of our Knowledge View of the US Constitution, as well as the Mindsystems generated MindManager mind maps will no doubt be the real test.

Standard Cirilab Knowledge View Output

http://demo.cirilab.com/projects/we_the_people/index.htm

US Constitution Republished based on it’s Thematic Knowledge Signature Outline by www.Mindsystems.com

http://demo.cirilab.com/projects/simpleoutline/index.html

Thematic Knowledge Signature Presentation within MindManager republished by www.Mindsystems.com

http://demo.cirilab.com/projects/presentation/index.html

Thematic Dynamic Outline Web Published View of Cirilab Output republished by www.Mindsystems.com

http://demo.cirilab.com/projects/dynamicoutline/index.html

I look forward to any comments you might have about our Knowledge View and Mindsystems mind maps of the US Constitution. In particular I ask you: "Is the Knowledge Signature a thematically accurate view of US Constitution?".

November 19, 2006

A million URLs served!

So what! What good is a million URLs? More is not better!

And why do they keep doing it? Everybody out there that clicks Next through even to page 10 of your search results please raise your mousing hand, and keep it there.

Why not bring back the best results, the results that are thematically weighted to be as relevant as possible to your original query, and then present them in such a way that allows you to learn more about the results than the query you actually put in? There are really two challenges facing the modern Internet searcher:

1. creating a great query that brings back really relevant results

2. having the information presented in such a way that it tells you more about the query results than you would learn by looking at a list of URLs

And probably the biggest obstacle to all of this is TRUST. How can I trust the results I'm getting back? Well, we have the problem now only now you get back one million results. I believe the trust factor can be overcome with better search queries. However, most search engines only allow you to input ten search words into your query which significantly limits your ability to narrow the number of returned URLs.

What if you could use an entire document as your query? You already trust the information that is in the document in question so that addresses part of the trust issue. If the document can be turned into a complex query and send to the search engine we can be sure to get a narrow yet highly relevant result set back. Of the results returned we can select the top ones and generate a Knowledge Interface(TM) that brings insight into the results that have been returned.

Now, you've got something to work with! I'm from Missouri so show me you say!

In the following example, I'm going to use the Cirilab Discovery information triage utility that walks the talk!

1. Right click on any text document and select Cirilab Discovery.

Right_click_discovery_2 

2. The document will be sent to the Cirilab Knowledge Generation Engine(TM) and a thematic Knowledge Signature(TM) will be produced. The Knowledge Signature(TM) will then be sent to a search engine. Currently, Google, Yahoo, MSN, and Gigablast are supported search engines.

Discovery_searching

3. Results will be gathered and individually assessed for their Thematic Resonance(TM) with the original document used to create the query. Only those with the highest relevance ranking will be kept for classification and aggregation.

Building_interface

4. All of the selected results, the ones with the highest Thematic Resonance(TM) to the original query document are aggregated into a Knowledge Map(TM) of the most relevant results.

Knowledge_map 

Notice that just under 10 highly relevant results are returned. We could stop there but that would be no different than looking at the first page of the search engine results. If search is going to serve the searcher the results need to provide insight into what was returned. The Document Navigator within a Cirilab Knowledge Map does just that.

As the results were returned a Knowledge Signature(TM) was created for each one. The Knowledge Signatures(TM) are compared for relevance and the highest ones are aggregated into a thematic Knowledge Map(TM). This is where the insight is provided. Themes that were found within the results are presented to the searcher within the Knowledge Map Document Navigator and they provide two benefits:

1. the hierarchical theme structures acts as a auto search facility by allowing the user to further narrow the results to just those that meet a particular thematic vein

2. it introduces themes that were discovered in the returned URLs that the searcher may never have thought of looking for in the first place (that's the Discovery part) and upon finding them may leader the searcher to new and exciting areas of investigation

You can explore the results of the above exercise at:

Cirilab Discovery Results using US Constitution as Query Input

You can see the Knowledge Signature of the original document that was used as the input query.

Cirilab Knowledge Signature of US Constitution

Less search results but more relevant ones are what the search engines should be striving for. And then present the results in such a way that provides insight into what has been returned.

Arnold Villeneuve

   

November 15, 2006

Diagramming Anniversary

This week CS Odessa celebrates the Anniversary of its flagship diagramming application ConceptDraw VI. The company marks the occasion with the quiz with three copies of ConceptDraw VI Std give away. The quiz lasts from 15 to 22 of November.

To participate visit

http://www.conceptdraw.com/products/time_limited_offer.php

 

November 09, 2006

Google Sitemaps and Cirilab Knowledge Maps

Are you being indexed Sir/Miss?

I return in a round about way to my previous blog on The Dark Lords of MetaTag Juicing (some of them are my sincere friends like SEO Guru David Jonah). It seems that Google is still bent on putting them out of business. The question remains "how is Google going to evaluate content in the absence of MetaTags as all trust in MetaTags seems to be gone", and rightfully so.

Obviously, Google employees cannot look at every Internet page so some other mechanism seems to be required. That mechanism seems to be "sitemaps". For the webmasters that are not paying attention you are well advised to see the writing on the wall. You see, Google does not care about your MetaTages because they don't believe what you are telling them about the value of their content. Moreover, they have decided to implement a system whereby they (Google) decides the value of your content. This is an important fact to understand because it is a significant shift in the "index me" paradigm.

If you don't see it yet you'll be sorry soon. The paradigm shift is that WebMasters no longer get the option of stating what their content is about but rather that the almighty Google will now decide what your content is about. Google is obviously tired of people abusing MetaTags by telling them that web pages with content about Apples are actually about Oranges and have decided to take action in a serious way to address the issue.

The Emerging Index Paradigm and the SITEMAP!

For those of you that have not stared to look into the emerging concept of a SITEMAP I recommend strongly that you start to look into it in a very serious way because it is the future as Google defines it in the area of Internet search.

There are two sides to the SITEMAP methodology that I can see from my perspective:

1. Tell Google what URLs on your site that you want them to index via a SITEMAP. 

2. Google uses the SITEMAP to crawl the site for the URLs that you specify. It then uses a proprietary and secretive approach to determining the ranking of the content on your website and you have no say in the matter.

Well actually you do, if you know how to use work their system of website and webpage ranking to elevate your search results position. It's all about creating interlinked content and internal website content that refers to trusted external website content. The question is: How do I take all of the content I have now and easily turn it into a "Content Zone(TM)" that Google is willing to rank at a higher level.

That's where Cirilab's Knowledge Maps come into play.  Cirilab's Knowledge Maps are Google Sitemap aware. More importantly, Knolwedge Map content is interlinked because the content is in fact inter-relative within the Content Zone.

There is another key to the success of this approach with Google Sitemaps but you'll need to figure that one out on your own!

October 15, 2006

Leveraging Wikipedia and The Gutenberg Project

Knowledge Generation and Dessimination on Wikipedia

http://en.wikipedia.org/wiki/User:Cirilab

''The Wikimedia Foundation Inc. is a non-profit organization with the goal of '''providing free knowledge''' to every person in the world. Meeting this goal through the maintenance, development and distribution of free content, Wikimedia relies on public donations to run its wiki-based projects.''

Hello Everyone

My name is Arnold Villeneuve and I am with Cirilab Inc. I started this page because of a disagreement with a Wikipedia Editor over the provision of Knowledge Maps (tm) and Knowledge Views (tm) of The Gutenberg Project books and Wikipedia pages within the External Links section of individual Wikipedia pages. My intention is to use this page to engage others in a discussion about what is knowledge and what constitutes an acceptable view of that knowledge from Wikipedia's perspective.

This page is also a call for your assistance. You tell us how we can improve the Knowledge Map and Knowledge Views so that the Wikipedia community is happy with them and we will gladly go about developing the technology so that it can be leveraged by The Gutenberg Project and Wikipedia, and ultimately you the Reader. Ultimately we would like to have a "button" within both of the above websites that would allow any Reader to generate a Knowledge Map or Knowledge View on their own in order to assit them in their research.

From a technical perspective our technology reads a document or collection of documents and generates a Knowledge View and Knowledge Map respectively of the information. The Knowledge Generation (TM) output is meant to provide the reader with information triage tools that allow them to quickly get the gist of the information and determine if it is applicable or important to them. The goal is to provide The Gutenberg Project and Wikipedia readers with additional knowledge generation and analysis views of the available information for free and at no charge to anyone. In fact, it was our intention to provide as a free service to The Gutenberg Project and Wikipedia Editors the generation of Knowledge Maps and Knowledge Views from within their own websites so they can make them available to the reader community.    

The technology brings together in one place the combined value of The Gutenberg Project and Wikipedia information for the end user and provides a research and analysis tools that allows them to enhance their understanding of vast quantities of data.

Our initial approach to this was to select a few of the Top 100 Gutenberg Project authors and create a Knowledge Map of their writings and make it available on the Internet. The Knowledge Map Then we decided to see if we could create a Knowledge View of a Wikipedia page and we were successful. A Knowledge View is of great value in our opinion on large Wikipedia pages where there is a lot of content and the reader wants to quickly arrive at an overview of the page and also locate the information that is important to them. '''We believe that these functions and features are in keeping with the goals and vision of Wikipedia.'''

The initial demonstration link we provided was for Sir Winston Churchill's public domain writings from The Gutenberg Project as well as the related Wikipedia page. We created the following:

- a Knowledge Map of all of Sir Winston Churchill's writings

- a Knowledge View of the Wikipedia web page for Sir Churchill

- a Mind Map of the Wikipedia web page for Sir Churchill

We made these pages available to anyone at no charge on the Internet. We posted the links to these pages on the External Links section of Wikipedia.

However, a Wikipedia Editor did not feel the links were acceptable. That is his call and I respect his decision and the links remain removed from Sir Winston Churchill's page. So we created this page to allow the community to review the Knowledge Maps and Knowledge Views pages themselves to facilitate the forming of their own opinion, all with the hope that you will provide feedback in the open environment that Wikipedia provides for all to see and comment on.

We would like to know the following:

1. Are the Knowledge Interfaces (tm) provided helpful and usefull to you?

2. How should we change them so that they are acceptable to the Wikipedia organization and to the Wikipedia community?

3. Is the concept of placing some advertising links within these Knowledge Interfaces acceptable to the Wikipedia community IF it generates revenue for Wikipedia or should they be removed all together?

4. If our technology is used to produce the freely available Knowledge Interfaces to the community is it acceptable to have a link to our web site on produced Knowledge Maps and Views?

5. Would you like the ability to be able to generate a Knowledge Map / View of Gutenberg Project books or Wikipedia information available on their respective websites?

Sir Winston Churchill Knowledge Interface Links

I created a Knowledge View of the actual Wikipedia Sir Winston Churchill page.

http://www.cirilab.com/TSMAP/Cirilab_Library/Literature/Winston_Churchill/WikiKMapPage/index.htm 

Then we created a Mind Map of Sir Winston Churchill's Wikipedia page using MindManager.

http://www.cirilab.com/TSMAP/Cirilab_Library/Literature/Winston_Churchill/WikiKMapPage/index.html 

Finally, we provided access to a Knowledge Map of Sir Winston Churchill's writings available from The Gutenberg Project.

http://www.cirilab.com/TSMap/Cirilab_Library/Literature/winston_churchill/index.htm

Access to the Knowledge Maps and Knowledge View information is free to all users.

We welcome comments and suggestions from the Wikipedia Editors and Community so that we can improve upon what we are doing to the point that eventually all will find the Knowledge Generation(tm) that we are providing acceptable to all and of value. And of course, Free!

Thank you.

Knowledge View of Sir Winston Churchill Wikipedia Page

Here is the Wikipedia link to this Blog not but be forwarned that it may not actually be there by the time you read this! 

Update

Hello Everyone

Well it has been a busy month so my Blog was getting the big ignore! The new Cirilab web site is up and our Right Click To Knowledge Series(TM) of technology is now available. And the interest has been nothing short of phenomenal. Who'ed of thought that people want software that was as easy to use as a right click of your mouse, or even left click for that matter. We have been processing purchases from all over the world! Here is our Google Ads Geo Map after just 7 days.

Cirilab_adwords_on_google_1   

We will be releasing our Cirilab Knowledge Map software product within the next few weeks and additional WordPerfect Add On products shortly there after.

Cirilab is a Mindjet Technology Partner and recently we received a place on their coveted Solutions web page. http://www.mindjet.com/us/products/mindmanager_solutions/index.php?s=5

There have been many other developments including some work with The Gutenberg Project and Wikipedia sites. More to follow in the next post on these activities.

Now that things are rolling along I'll be getting back to my Blog and starting paying a little more attention to it, and you the reader. 

September 05, 2006

Data is like food!

Information Overload

Fighting data asphyxiation is difficult but possible ... by William Van Winkle

"Data is like food. A good meal is served in reasonably-sized portions from several food groups. It leaves you satisfied but not stuffed. Likewise with information, we're best served when we can partake of reasonable, useful portions, exercising discretion in what data we digest and how often we seek it out." See the rest of this great article at http://www.gdrc.org/icts/i-overload/infoload.html .

www.cirilab.com

My Photo

KnowledgeUser Apparel and Stuff

My Online Status

Knowledge Thinkers

  • ResearchBuzz
    ResearchBuzz: News about search engines, databases, and other information collections.
  • Scott Vine
    Information Management and Librarian Blog with a communications media and technology law slant from Scott Vine.
  • Robin Capper
    I'm a CAD Manager for a Retail Company who in addition to playing with computers likes motorsport, cycling, photography, comedy, music and life in general!
  • Mike Gotta
    Industry Analyst with Burton Group, opinions expressed here are my own and not necessarily those of my employer.

September 2007

Sun Mon Tue Wed Thu Fri Sat
            1
2 3 4 5 6 7 8
9 10 11 12 13 14 15
16 17 18 19 20 21 22
23 24 25 26 27 28 29
30            

Recent Comments