<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	
	>
<channel>
	<title>
	Comments on: Automatic metadata generation	</title>
	<atom:link href="./index.html" rel="self" type="application/rss+xml" />
	<link>./../index.html</link>
	<description>Artist- and Librarian-At-Large</description>
	<lastBuildDate>
	Sat, 04 Jun 2016 18:04:58 +0000	</lastBuildDate>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=5.1.8</generator>
			<item>
				<title>
				By: Eric Lease Morgan				</title>
				<link>./../comment-page-1/index.html#comment-1157</link>
		<dc:creator><![CDATA[Eric Lease Morgan]]></dc:creator>
		<pubDate>Thu, 06 Aug 2009 18:10:14 +0000</pubDate>
		<guid isPermaLink="false">./../../../../index.html?p=335#comment-1157</guid>
					<description><![CDATA[Thank you for the useful comments, and rest-assured, I have not been ignoring you.

Yes, an additional next step would be to map the automatically generated subject &quot;tags&quot; to Library of Congress Subject Headings or some other controlled vocabulary. This has been done a number of times previously with various degrees of success, but compared to the other problems at hand -- such as making the documents themselves more useful and interrelated  -- I plan to forgo this option at the present time. I understand the value of a controlled vocabulary, and that is why I have retained it whenever possible.]]></description>
		<content:encoded><![CDATA[<p>Thank you for the useful comments, and rest-assured, I have not been ignoring you.</p>
<p>Yes, an additional next step would be to map the automatically generated subject &#8220;tags&#8221; to Library of Congress Subject Headings or some other controlled vocabulary. This has been done a number of times previously with various degrees of success, but compared to the other problems at hand &#8212; such as making the documents themselves more useful and interrelated  &#8212; I plan to forgo this option at the present time. I understand the value of a controlled vocabulary, and that is why I have retained it whenever possible.</p>
]]></content:encoded>
						</item>
						<item>
				<title>
				By: Tom Burton-West				</title>
				<link>./../comment-page-1/index.html#comment-1138</link>
		<dc:creator><![CDATA[Tom Burton-West]]></dc:creator>
		<pubDate>Fri, 31 Jul 2009 18:17:51 +0000</pubDate>
		<guid isPermaLink="false">./../../../../index.html?p=335#comment-1138</guid>
					<description><![CDATA[Hi Eric,

You might want to take a look at the open source Kea software, which does something similar (It uses tf*idf and some other heuristics). 

http://www.nzdl.org/Kea/

Tom]]></description>
		<content:encoded><![CDATA[<p>Hi Eric,</p>
<p>You might want to take a look at the open source Kea software, which does something similar (It uses tf*idf and some other heuristics). </p>
<p><a href="http://www.nzdl.org/Kea/" rel="nofollow">http://www.nzdl.org/Kea/</a></p>
<p>Tom</p>
]]></content:encoded>
						</item>
						<item>
				<title>
				By: Karen Coyle				</title>
				<link>./../comment-page-1/index.html#comment-1136</link>
		<dc:creator><![CDATA[Karen Coyle]]></dc:creator>
		<pubDate>Fri, 31 Jul 2009 15:41:25 +0000</pubDate>
		<guid isPermaLink="false">./../../../../index.html?p=335#comment-1136</guid>
					<description><![CDATA[Eric, have you looked at the work done at NLM on using generated metadata as part of the subject cataloging process? They found that presenting catalogers with a list of machine-generated subjects was helpful, especially for the less experienced catalogers. Info at: http://ii.nlm.nih.gov/.]]></description>
		<content:encoded><![CDATA[<p>Eric, have you looked at the work done at NLM on using generated metadata as part of the subject cataloging process? They found that presenting catalogers with a list of machine-generated subjects was helpful, especially for the less experienced catalogers. Info at: <a href="http://ii.nlm.nih.gov/" rel="nofollow">http://ii.nlm.nih.gov/</a>.</p>
]]></content:encoded>
						</item>
						<item>
				<title>
				By: Nathan				</title>
				<link>./../comment-page-1/index.html#comment-1134</link>
		<dc:creator><![CDATA[Nathan]]></dc:creator>
		<pubDate>Fri, 31 Jul 2009 11:47:41 +0000</pubDate>
		<guid isPermaLink="false">./../../../../index.html?p=335#comment-1134</guid>
					<description><![CDATA[Eric,

Interesting.  When you say &quot;Third, automatically generated keywords and phrases were many times just as useful as the librarian-assigned Library of Congress Subject headings.&quot;

I grow concerned though, because the strings:

Universalism United States History
Unitarian Universalist churches United States

...seem to me to be much more fulsome and useful.  I&#039;m not saying your automatically generated list isn&#039;t useful, or that it isn&#039;t complementary to LCSH - I&#039;m just concerned with your phrase &quot;just as useful&quot; which, pardon me - seems to give you away as someone who, when weighing the options, will be more than willing to let LCSH go when the time comes.

And that, I think, is not going to serve us well.

~Nathan 

 Many of the items harvested from the Internet Archive were complete with MARC records. Some of those records included subject headings. During Step #5 (above), I spent time observing the output and comparing it to previously assigned terms. Take for example a work called Universalism in America: A History by Richard Eddy. Its assigned headings included: 

Universalism United States History
Unitarian Universalist churches United States]]></description>
		<content:encoded><![CDATA[<p>Eric,</p>
<p>Interesting.  When you say &#8220;Third, automatically generated keywords and phrases were many times just as useful as the librarian-assigned Library of Congress Subject headings.&#8221;</p>
<p>I grow concerned though, because the strings:</p>
<p>Universalism United States History<br />
Unitarian Universalist churches United States</p>
<p>&#8230;seem to me to be much more fulsome and useful.  I&#8217;m not saying your automatically generated list isn&#8217;t useful, or that it isn&#8217;t complementary to LCSH &#8211; I&#8217;m just concerned with your phrase &#8220;just as useful&#8221; which, pardon me &#8211; seems to give you away as someone who, when weighing the options, will be more than willing to let LCSH go when the time comes.</p>
<p>And that, I think, is not going to serve us well.</p>
<p>~Nathan </p>
<p> Many of the items harvested from the Internet Archive were complete with MARC records. Some of those records included subject headings. During Step #5 (above), I spent time observing the output and comparing it to previously assigned terms. Take for example a work called Universalism in America: A History by Richard Eddy. Its assigned headings included: </p>
<p>Universalism United States History<br />
Unitarian Universalist churches United States</p>
]]></content:encoded>
						</item>
			</channel>
</rss>
