您的当前位置：首页 Human Unsupervised and Supervised Learning as A Quantitative Distinction

Human Unsupervised and Supervised Learning as A Quantitative Distinction

来源：好走旅游网

InternationalJournalofPatternRecognitionandArtiﬁcialIntelligenceVol.17,No.5(2003)885–901

cWorldScientiﬁcPublishingCompany󰀁

HUMANUNSUPERVISEDANDSUPERVISEDLEARNING

ASAQUANTITATIVEDISTINCTION

TODDM.GURECKIS∗andBRADLEYC.LOVE†DepartmentofPsychology,UniversityofTexasatAustin,1UniversityStationA8000,Austin,TX78712-0187,USA

∗gureckis@love.psy.utexas.edu†love@love.psy.utexas.eduhttp://love.psy.utexas.edu/

SUSTAIN(SupervisedandUnsupervisedSTratiﬁedAdaptiveIncrementalNetwork)isanetworkmodelofhumancategorylearning.SUSTAINinitiallyassumesasimplecategorystructure.IfsimplesolutionsproveinadequateandSUSTAINisconfrontedwithasurprisingevent(e.g.itistoldthatabatisamammalinsteadofabird),SUSTAINrecruitsanadditionalclustertorepresentthesurprisingevent.Newlyrecruitedclustersareavailabletoexplainfutureeventsandcanthemselvesevolveintoprototypes/attractors/rules.SUSTAINhasexpandedthescopeofﬁndingsthatmodelsofhumancategorylearningcanaddress.ThispaperextendsSUSTAINtoaccountforbothsupervisedandunsupervisedlearningdatathroughacommonmechanism.Themodiﬁedmodel,uSUSTAIN(uniﬁedSUSTAIN),issuccessfullyappliedtohumanlearningdatathatcomparesunsupervisedandsupervisedlearningperformances.18Keywords:Category;learning;unsupervised;supervised;psychology.

1.Introduction

Categoriesprovideacrucialfunctionunderlyingthecognitiveabilitiesofhumans.Theyallowustogeneralizeourknowledgetonovelsituationsandtoinferunknownpropertiesoftheenvironment.Theseabilitiesareindispensabletoanyintelligentsystem.

Researchersstudyinghumancategorizationhavetraditionallyfocusedonhumanperformanceinsupervisedlearningtasks(seeRefs.2,4and7forsomeexceptions).Inthisexperimentalparadigm,subjectslearntoclassifystimuliasmembersofcontrastivecategoriesthroughtrialbytriallearningwithcorrectivefeedback.Theories(andmodels)oflearningarefavoredthatcanaccountfortherelativediﬃcultyofacquiringdiﬀerentcategorystructures.25,30

Althoughclassiﬁcationlearningdoescaptureaspectsofhumanlearning,othersarenotaddressedbythisparadigm.Forinstance,humanscanspontaneouslyconstructcategoriesintheabsenceoffeedback.Asanexample,manyofushavecreatedthecategories“interesting”emailand“junk”emailintheabsenceof

885

886T.M.Gureckis&B.C.Love

explicitfeedback.Suchlearningisreferredtoasunsupervisedlearning.Supervisedandunsupervisedlearningareoftenseenasbeingqualitativelydiﬀerent.Super-visedlearningischaracterizedasintentional,inthatlearnersactivelysearchforrules(perhapsbyhypothesistesting)andareexplicitlyawareoftheruletheyareconsidering.26Ontheotherhand,unsupervisedlearningisseenasanincidental,undirected,stimulusdriven,andincrementalaccrualofinformation.3,8,13,14,17

Incontrasttothisview,Love18hasfoundthatintentionalunsupervisedlearningperformanceismoresimilartosupervisedlearningperformancethanitistoincidentalunsupervisedlearningperformance.Thisresultsuggeststhattheunsupervised/superviseddichotomymaynotbevalid.GureckisandLove10havearguedthatunsupervisedandsupervisedlearningcanbemodeledthroughacom-monmechanism.However,ouraccounthasyettomodeltoadirectcompari-sonbetweensupervisedandunsupervisedlearning.Here,weapplyGureckisandLove’s10variantoftheSUSTAIN(SupervisedandUnsupervisedSTratiﬁedAdap-tiveIncrementalNetwork)model,referredtoasuSUSTAIN(uniﬁedSUSTAIN),totheLove18datauSUSTAINdiﬀersfromothermodelsthatseektounifyun-supervisedandsupervisedlearning,suchasAnderson’s1rationalmodel,inthatuSUSTAINisapplicabletobothunsupervisedandsupervisedlearningtaskswhilenotpredictingthatthesetasksleadtoequivalentperformance(whichtheydonot).Intheremainderofthispaper,weoverviewSUSTAINanduSUSTAIN.WethenﬁtuSUSTAINtotheLove18dataandconsidertheimplicationsofthesimulations.2.TheModelingApproach:SUSTAINanduSUSTAIN

SUSTAINhasbeensuccessfullyappliedtoanarrayofchallenginghumandatasetsspanningavarietyofcategorylearningparadigmsincludingclassiﬁcationlearning,21learningatdiﬀerentlevelsofabstraction,20inferencelearning,19andunsupervisedlearning.11,22

Inthefollowingsections,wediscussSUSTAIN’soperation,itsunderlyingprinciples,andthemathematicalequationsthatfollowfromtheseprinciples.WethenintroduceamodiﬁcationtoSUSTAINthatenablesittoaccountforsupervisedandunsupervisedlearningdatathroughasinglerecruitmentmechanism.Thismechanismmakesuseofanintuitiveandgeneralnotionofsurprisetofacilitatelearning.ThismodiﬁedversionofSUSTAINisreferredtoasuSUSTAIN.2.1.Overviewofmodel

SUSTAINisanetworkmodelofhumancategorylearning.Oneachlearningtrial,SUSTAINtakesasinputadescriptionofthecurrentstimulusitemrepresentedtothemodelbyasetofperceptualfeaturedimensions.Forexample,astimulusitemdepictingalarge,purplesquarewillberepresentedtothemodelbythefea-turedimensionscolor,sizeandstripe.Likeothermodelsofcategorylearning(suchasRef.1),SUSTAINtreatsthecategorymembership(orcategorylabel)ofasti-mulusitemasanotherstimulusfeaturedimension.SUSTAINmaintainsaselective

HumanUnsupervisedandSupervisedLearningasaQuantitativeDistinction887

attentionmechanismwhichallowsittolearntofocusattentiononstimulusdimensionsthatareparticularlyusefulforthecurrentcategorizationtask(similartoRef.16).

Theinternalrepresentationsinthemodelconsistofasetofclusters.Categoriesarerepresentedinthemodelasoneormoreassociatedclusters.Initially,thenetworkhasonlyoneclusterthatiscenteredupontheﬁrstinputpattern.Asnewstimulusitemsarepresented,themodelattemptstoassignthesenewitemstoanexistingcluster.Thisassignmentisdonethroughanunsupervisedprocedurebasedonthesimilarityofthenewitemtothestoredclusters.Whenanewitemisassignedtoacluster,theclusterupdatesitsinternalrepresentationtobecometheaverageofallitemsassignedtotheclustersofar.

However,ifSUSTAINdiscoversthroughfeedbackthatthissimilarity-basedassignmentisincorrect,anewclusteriscreatedtoencodethecurrentitemasanexception(foraconcreteexampleofthisseePrinciple3inthefollowingsection).Inunsupervisedlearningtasksthereisnocorrectivefeedback,soinsteadSUSTAINcreatesanewclusterifthecurrentstimulusitemisnotsuﬃcientlysimilartoanyexistingclusters(thethresholdforthissuﬃciencyiscontrolledbyaparameterinthemodel).Bothoftheseclusterrecruitmentstrategiesareuniﬁedundertheprincipeof“adaptationtosurprise”.10Insupervisedlearning,SUSTAINcreatesanewclusterinresponsetoasurprisingmisclassiﬁcation,whereasinunsupervisedlearning,anewclusteriscreatedwhenthemodelencountersasurprisinglynovelstimulusitem.

Clusterscompetewitheachothertorespondtothecurrentstimulusitem.Theclusterthatwinsthiscompetitionpassesitsactivationoverconnectionweightstoasetofoutputunits.Theseoutputunitsreplicatethestructureoftheinputdimensions.Theconnectionweightsareadjustedoverthecourseoflearningsothattheassociationbetweeneachclusterandtheappropriateresponseformembersofthatclusterisstrengthened.Forexample,aclusterwhosemembersaremostlyincategory“A”woulddevelopoverthecourseoflearningastrongerconnectiontothecategory“A”outputunitthantothecategory“B”outputunit.Theactivationofanoutputunitisproportionaltothestrengthoftheactivationpassedfromthewinningclusterandthestrengthoftheconnectionweight.SUSTAIN’sultimateresponseisbiasedtowardsthemostactivatedoutputunit.Inthisway,classiﬁcationdecisionsareultimatelybasedontheclustertowhichaninstanceisassigned.2.2.ThekeyprinciplesofSUSTAIN

Withthisgeneralunderstandingoftheoperationofthemodelinmind,wenowexaminetheﬁvekeyprinciplesofSUSTAIN.Theseprincipleshighlighttheimpor-tantfeaturesofthemodelandprovidethefoundationforthemodel’sformalism.2.2.1.Principle1,SUSTAINisbiasedtowardssimplesolutions

SUSTAINisinitiallydirectedtowardssimplesolutions.Atthestartoflearning,SUSTAINhasonlyoneclusterwhichiscenteredontheﬁrstinputitem.Itthen

888T.M.Gureckis&B.C.Love

addsclusters(i.e.complexity)onlyasneededtoaccuratelydescribethecategorystructure.Likeothermodelsofcategorylearning(e.g.Ref.16),SUSTAINlearnstoselectivelyattendtostimulusfeaturedimensionsthataremostusefulforcategorization.ThisfocusonasubsetofstimulusdimensionsalsoservestobiasSUSTAINtowardssimplesolutions.

2.2.2.Principle2,similarstimulusitemstendtoclustertogether

Inlearningtoclassifystimuliasmembersoftwodistinctcategories,SUSTAINwillclustersimilaritemstogether.Forexample,diﬀerentinstancesofabirdsubtype(e.g.sparrows)couldclustertogetherandformasparrowclusterinsteadofleavingseparatetracesinmemoryforeachinstance.Clusteringisanunsupervisedprocessbecauseclusterassignmentisdoneonthebasisofsimilarity,notfeedback.2.2.3.Principle3,SUSTAINlearnsinbothasupervisedand

unsupervisedfashion

Inlearningtoclassifythecategories“birds”and“mammals”,SUSTAINreliesonbothunsupervisedandsupervisedlearningprocesses.ConsideralearningtrialinwhichSUSTAINhasformedaclusterwhosemembersaresmallbirds,andanotherclusterwhosemembersarefour-leggedmammals.IfSUSTAINissubsequentlyaskedtoclassifyabat,itwillinitiallypredictthatabatisabirdonthebasisofoverallsimilarity(batsandbirdsarebothsmall,havewings,ﬂy,etc.).Uponreceivingfeedbackfromtheenvironment(supervision)indicatingthatabatisamammal,SUSTAINwillrecruitanewclustertorepresentthebatasanexceptiontothemammalcategory.ThenexttimeSUSTAINisexposedtothebatoranothersimilarbat,SUSTAINwillcorrectlypredictthatabatisamammal.ThisexamplealsoillustrateshowSUSTAINcanentertainmorecomplexsolutionswhennecessarythroughclusterrecruitment(seePrinciple1).2.2.4.Principle4,thepatternoffeedbackmatters

Astheexampleusedaboveillustrates,feedbackaﬀectstheinferredcategorystructure.Predictionfailuresresultinaclusterbeingrecruited,thusdiﬀerentpatternsoffeedbackcanleadtodiﬀerentrepresentationsbeingacquired.ThisprincipleallowsSUSTAINtopredictdiﬀerentacquisitionpatternsfordiﬀerentlearningmodesthatareinformationallyequivalentbutdiﬀerintheirpatternoffeedback.ThelearningconditionsintheLove18studyconsideredinthispaperareinformationallyequivalent,butdiﬀerintheirpatternoffeedback.2.2.5.Principle5,clustercompetition

Clusterscanbeseenascompetingexplanationsoftheinput.Thestrengthoftheresponsefromthewinningcluster(theclusterthecurrentstimulusismostsimilar

HumanUnsupervisedandSupervisedLearningasaQuantitativeDistinction8

to)isattenuatedinthepresenceofotherclustersthataresomewhatsimilartothecurrentstimulus(seeRef.31,accountofcompetingexplanationsinreasoning).2.3.MathematicalformulationofSUSTAIN

ThissectionofthepaperexplainshowthegeneralprinciplesthatgovernSUSTAIN’soperationareimplementedinanalgorithmicmodel.2.3.1.Inputrepresentation

Stimuliarerepresentedinthemodelasvectorframeswherethedimensionalityofthevectorisequaltothedimensionalityofthestimuli.Thecategorylabelisalsoincludedasastimulusdimension.Thus,stimulithatvaryonthreeper-ceptualdimensions(e.g.size,shapeandcolor)andaremembersofoneoftwocategorieswouldrequireavectorframewithfourdimensions.Afour-dimensionalbinary-valuedstimulus(threeperceptualdimensionsplusthecategorylabel)canbethoughtofasafourcharacterstring(e.g.1211)inwhicheachcharacterrepresentsthevalueofastimulusdimension.Forexample,theﬁrstcharactercoulddenotethesizedimensionwitha1indicatingasmallstimulusanda2indicatingalargestimulus.

Ofcourse,alearningtrialusuallyinvolvesanincompletestimulusrepresenta-tion.Forinstance,inclassiﬁcationlearningalltheperceptualdimensionsareknown,butthecategorylabeldimensionisunknownandqueried.Afterthelearnerre-spondstothequery,correctivefeedbackisprovided.Assumingthefourthstimulusdimensionisthecategorylabeldimension,theclassiﬁcationtrialfortheabovestimulusisrepresentedas121?→1211.

Oneveryclassiﬁcationtrial,thecategorylabeldimensionisqueriedandcorrectivefeedbackindicatingthecategorymembershipofthestimulusisprovided.Incontrast,oninferencelearningtrials,subjectsaregiventhecategorymember-shipoftheitem,butmustinferanunknownstimulusdimension.Possibleinferencelearningtrialsfortheabovestimulusdescriptionare?211→1211,1?11→1211,and12?1→1211.Noticethatinferenceandclassiﬁcationlearningprovidethelearnerwiththesamestimulusinformationafterfeedback(thoughthepatternoffeedbackvaries).

Unsupervisedlearningdoesnotinvolveinformativefeedback.Inunsupervisedlearning,everyitemisconsideredtobeamemberofthesameglobalcategory.Thus,thecategorylabeldimensionisunitaryvaluedanduninformativefordiﬀerentiatingbetweenstimuli.However,thedegreetowhichanyparticularstimulusactivatesthiscategorydimensionindicatesthedegreetowhichthenetworkrecognizesthestimulus.

Inordertorepresentanominalstimulusdimensionthatcandisplaymultiplevalues,SUSTAINdevotesmultipleinputunits.Torepresentanominaldimensioncontainingkdistinctvalues,kinputunitsareutilized.Alltheunitsformingadimensionaresettozero,exceptfortheoneunitthatdenotesthenominalvalue

0T.M.Gureckis&B.C.Love

ofthedimension(thisunitissettoone).Forexample,thestimulusdimensionofmaritalstatushasthreevalues(“single”,“married”,“divorced”).Thepattern[010]representsthedimensionvalueof“married”.AcompletestimulusisrepresentedbythevectorIposikwhereiindexesthestimulusdimensionandkindexesthenominalvaluesfordimensioni.Forexample,ifmaritalstatuswasthethirdsti-mulusdimensionandthesecondvaluewaspresent(i.e.married),thenIpos32wouldequalone,whereasIpos31andIpos33wouldequalzero.The“pos”inIposdenotesthatthecurrentstimulusislocatedataparticularpositioninamultidimensionalrepresentationalspace.2.3.2.Receptiveﬁelds

Eachclusterhasareceptiveﬁeldforeachstimulusdimension.Acluster’sreceptiveﬁeldforagivendimensioniscenteredatthecluster’spositionalongthatdimension.Thepositionofaclusterwithinadimensionindicatesthecluster’sexpectationsforitsmembers.

Thetuningofareceptiveﬁeld(asopposedtothepositionofareceptiveﬁeld)determineshowmuchattentionisbeingdevotedtothestimulusdimension.Allthereceptiveﬁeldsforastimulusdimensionhavethesametuning(i.e.atten-tionisdimension-wideasopposedtocluster-speciﬁc).Areceptiveﬁeld’stuningchangesasaresultoflearning.ThischangeinreceptiveﬁeldtuningimplementsSUSTAIN’sselectiveattentionmechanism.Dimensionsarehighlyattendedtodeveloppeakedtunings,whereasdimensionsarenotwellattendedtodevelopbroadtunings.Dimensionsthatprovideconsistentinformationattheclusterlevelreceivegreaterattention.

Mathematically,receptiveﬁeldshaveanexponentialshapewithareceptiveﬁeld’sresponsedecreasingexponentiallyasdistancefromitscenterincreases.Theactivationfunctionforadimensionis:

α(µ)=λe−λµ

(1)

whereλisthetuningofthereceptiveﬁeld,µisthedistanceofthestimulusfromthecenteroftheﬁeld,andα(µ)denotestheresponseofthereceptiveﬁeldtoastimulusfallingµunitsfromthecenteroftheﬁeld.ThechoiceofexponentiallyshapedreceptiveﬁeldsismotivatedbyShepard’s29workonstimulusgeneralization.

Althoughreceptiveﬁeldswithdiﬀerentλhavediﬀerentshapes(rangingfromabroadtoapeakedexponential),foranyλ,thearea“underneath”areceptiveﬁeldisconstant:

󰀄∞󰀄∞

α(µ)dµ=λe−λµdµ=1.(2)

Foragivenµ,λthatmaximizesα(µ)canbecomputedfromthederivative:

∂α

HumanUnsupervisedandSupervisedLearningasaQuantitativeDistinction1

2.3.3.Clusteractivation

Withnominalstimulusdimensions,thedistanceµij(from0to1)betweentheithdimensionofthestimulusandclusterj’spositionalongtheithdimensionis:

µij=

act

whereHjistheactivationofthejthcluster,misthenumberofstimulusdimensions,λiisthetuningofthereceptiveﬁeldfortheithinputdimension,andrisanattentionalparameter(alwaysnon-negative).Whenrislarge,inputunitswithtightertunings(unitsthatseemrelevant)dominatetheactivationfunction.Dimensionsthatarehighlyattendedhavelargerλsandwillhavegreaterimportanceindeterminingtheclusters’activationvalues.Increasingrsimplyaccentuatesthiseﬀect.Ifrissettozero,everydimensionreceivesequalattention.Equation(5)sumsuptheresponsesofthereceptiveﬁeldsforeachinputdimensionandnormalizesthesum(again,highlyattendeddimensionsweighheavily).Clusteractivationisboundbetween0(exclusive)and1(inclusive).Unknownstimulusdimensions(e.g.thecategorylabelinaclassiﬁcationtrial)arenotincludedintheabovecalculation.

󰀂m

i=1(λi)

(5)

2.3.4.Competition

Clusterscompetetorespondtoinputpatternsandinturninhibitoneanother.

out

Whenmanyclustersarestronglyactivated,theoutputofthewinningclusterHjisless:

ForthewinningHjwiththegreatestH

act

outHj

actβ(Hj)

2T.M.Gureckis&B.C.Love

Clustersotherthanthewinnerhavetheiroutputsettozero.Equation(6)isastraightforwardmethodforimplementinglateralinhibition.Itisahighleveldescriptionofaniterativeprocesswhereunitssendsignalstoeachotheracrossinhibitoryconnections.Psychologically,Eq.(6)signiﬁesthatcompetingalternativeswillreduceconﬁdenceinachoice(reﬂectedinaloweroutputvalue).2.3.5.Response

Activationisspreadfromtheclusterstotheoutputunitsofthequeried(theunknown)stimulusdimensionz:

outCzk

n󰀃j=1

out

wj,zkHj

(7)

out

whereCzkistheoutputoftheoutputunitrepresentingthekthnominalvalueofthequeried(unknown)zthdimension,nisthenumberofclusters,andwj,zkistheweightfromclusterjtocategoryunitCzk.Awinningcluster(especiallyonethatdidnothavemanycompetitorsandissimilartothecurrentinputpattern)thathasalargepositiveconnectiontoanoutputunitwillstronglyactivatetheoutputunit.Thesummationintheabovecalculationisnotreallynecessarygiventhatonlythewinningclusterhasanonzerooutput,butisincludedtomakethesimilaritiesbetweenSUSTAINandothermodelsmoreapparent.

Theprobabilityofmakingresponsek(thekthnominalvalue)forthequerieddimensionzis

Pr(k)=

e(d·Czk

out

)

HumanUnsupervisedandSupervisedLearningasaQuantitativeDistinction3

Anewclusterisrecruitedifthewinningclusterpredictsanincorrectresponse.Inthecaseofasupervisedlearningsituation,aclusterisrecruitedaccordingtothefollowingprocedure:

Forthequerieddimensionz,iftzkdoesnotequal1fortheCzk

out

withthelargestoutputCzkofallCz∗,thenrecruitanewcluster.

(10)

Inotherwords,theoutputunitrepresentingthecorrectnominalvaluemustbethemostactivatedofalltheoutputunitsformingthequeriedstimulusdimension.Inthecaseofanunsupervisedlearningsituation,SUSTAINisself-supervisingandrecruitsaclusterwhenthemostactivatedclusterHj’sactivationisbelowthethresholdτ:

act

if(Hj<τ),thenrecruitanewcluster.

(11)

UnsupervisedrecruitmentinSUSTAINbearsastrongresemblancetorecruitment

inAdaptiveResonanceTheory,5ClapperandBower’squalitativemodel,6andHartigan’sleaderalgorithm.12

Whenanewclusterisrecruiteditiscenteredonthemisclassiﬁedinputpatternandtheclusters’activationsandoutputsarerecalculated.Thenewclusterthenbecomesthewinnerbecauseitwillbethemosthighlyactivatedcluster(itiscentereduponthecurrentinputpattern—allµijwillbezero).Again,SUSTAINbeginswithaclustercenteredontheﬁrststimulusitem.Thepositionofthewinnerisadjusted:

ForthewinningHj,∆Hj

posik

=η(Iposik−Hj

posik

)(12)

whereηisthelearningrate.Thecentersofthewinner’sreceptiveﬁeldsmovetowardstheinputpatternaccordingtotheKohonenlearningrule.15Thislearningrulecenterstheclusteramidstitsmembers.

UsingourresultfromEq.(3),receptiveﬁeldtuningsareupdatedaccordingto:

∆λi=ηe−λiµij(1−λiµij)

(13)

wherejistheindexofthewinningcluster.

Onlythewinningclusterupdatesthevalueofλi.Equation(13)adjuststhepeakednessofthereceptiveﬁeldforeachinputsothateachinputdimensioncanmaximizeitsinﬂuenceontheclusters.Initially,λiissettobebroadlytunedwithavalueof1.Thevalueof1ischosenbecausethemaximaldistanceµijis1andtheoptimalsettingofλiforthiscaseis1(i.e.Eq.(13)equalszero).Underthisscheme,λicannotbecomelessthan1,butcanbecomemorenarrowlytuned.

Whenaclusterisrecruited,weightsfromtheunittotheoutputunitsaresettozero.Theonelayerdeltalearningrule32,28isusedtoadjusttheseweights:

outout

∆wj,zk=η(tzk−Czk)Hj,

(14)

wherezisthequerieddimension.Notethatonlythewinningclusterwillhaveits

weightsadjustedsinceitistheonlyclusterwithanonzerooutput.

4T.M.Gureckis&B.C.Love

2.4.uSUSTAIN:auniﬁedapproachtosupervisedand

unsupervisedlearning

SUSTAINcanmodelbothsupervisedandunsupervisedlearning,butitreliesondiﬀerentrecruitmentmechanisms.Inbothcases,aclusterisrecruitedinresponsetoasurprisingevent(i.e.theexistingclusterstructuredoesnotproperlycharacterizethecurrentstimulus),buthowasurprisingeventisdeﬁneddiﬀers.Inthesupervisedcase,thesurprisingeventisapredictionerror,whereasinthecaseofunsupervisedlearningthesurprisingeventisanunfamiliarstimulus.

Althoughthetwoseparaterecruitmentprocedureshavebeensuccessful,asinglerecruitmentprocedureispreferable.Beyondparsimony,auniﬁedaccountcouldproveusefulinclarifyingtherelationshipbetweenunsupervisedandsupervisedlearning.Asimplewaytointegratethetworecruitmentstrategiesistogeneralizetheunsupervisedproceduresothatitisapplicabletosupervisedlearningsituations.Underthisscheme,anewclusterisrecruitedwhenthecurrentstimulusisnotsuﬃcientlysimilartoanyclusterinitscategory:

Forthequerieddimensionz,

act

IfMax({Hj|µzj=0})<τ,thenrecruitanewcluster,

(15)

act

istheactivationofclusterj,µzjisthedistance[asdeﬁnedinEq.(4)]whereHj

alongthezthdimensionofthecurrentstimulusandclusterj’spositionalongthezthdimension,andτisaconstantbetween0and1(aparameter).Therequirementthatµzjbezerospeciﬁesthatonlyclustersassociatedwiththecategoryofthecurrentstimulusareconsidered.Inunsupervisedlearning,allitemsbelongtothesameglobalcategorywhichrepresentsitemsthenetworkhasseenbefore.Thus,

act

|µzj=0})referstothemostactivatedclusteroverall.InsupervisedMax({Hj

learning,themostactivatedclusterpredictingthecorrectcategorymaynotbethemostactivatedclusteroverall.

Besidesprovidingauniﬁedframework,thisrecruitmentstrategyhasanumberofothervirtuesoverSUSTAIN’soriginalrecruitmentrule[Eq.(10)]forsupervisedlearning.Forexample,theuniﬁedprocedurewillrecruitanewclusterwhenanunusualitemisencounteredthatdoesnotresultinapredictionerrorwhereasthepreviouserror-drivenrecruitmentschemewouldnotrecruitanewclustertoencodetheunusualitem.Assigningaveryunusualitemtoanexistingcluster(aclustertheitemisnotverysimilarto)couldresultincatastrophicinterference(seeRef.27)astheclustermustundergoradicalchangetoaccommodateitsnewestmember.

3.EvaluatinguSUSTAIN

InordertoevaluatethisuniﬁedformulationofthemodelweapplieduSUSTAINtothestudiespreviouslyaccountedforusingseparateclusterrecruitmentmech-anismsforsupervisedandunsupervisedlearning.10ItisimportanttorecognizethattherecruitmentprocedurethatuSUSTAINusesis,infact,ageneralizationofunsupervisedrecruitmentprocedureusedbytheoriginalSUSTAINmodel.Thus,

HumanUnsupervisedandSupervisedLearningasaQuantitativeDistinction5

uSUSTAINandSUSTAINprovideequivalentaccountsofunsupervisedlearning.9uSUSTAINandSUSTAINhaveﬁtanumberofunsupervisedlearningstudiesandhavegeneratednovelpredictionsthathavebeensubsequentlytestedandconﬁrmedwithhumansubjects.11

AtruetestofgeneralityoftheuSUSTAINapproachliesinitsabilitytoﬁtsupervisedlearningdata.GureckisandLove9applieduSUSTAINtoanumberofsupervisedlearningstudiesandfoundthatuSUSTAINapproximatedSUSTAIN’ssuccesses.Despiteitssimplicity,theuniﬁedrecruitmentprocedureinuSUSTAINhasprovenremarkablysuccessfulinthisdomain.

AlthoughuSUSTAINhasdemonstratedtheabilitytoaccountforhumanlearningperformanceacrossawiderangeofcategorylearningparadigms,ithasneverbeenappliedtoastudyspeciﬁcallydesignedtocompareunsupervisedandsupervisedlearning.Giventhepastsuccessesofthemodel,itwouldbeinformativetoapplythemodeltoadirectcomparisonbetweensupervisedandunsupervisedlearning.ThefollowingsectionexaminesuSUSTAIN’saccountofLove’s18studythatcomparesincidentalunsupervisedlearning,intentionalunsupervisedlearning,andsupervisedclassiﬁcationlearninginacontrolledmanner.4.ComparingSupervisedandUnsupervisedLearning

TheLove18studyisuniqueinthatitspeciﬁcallyallowsforadirectcomparisonofsupervisedandunsupervisedlearning.Insupervisedlearning,thecommondepen-dentmeasureusedtoassesslearningdiﬃcultyistrainingaccuracy.25,30However,thereisnomeasureoftrainingaccuracyinunsupervisedlearning(thereisnorightorwrongresponseoneachstudytrial).Inordertodirectlycomparelearningper-formanceacrossthesetwotypesoflearning,acomparabledependentmeasurewasdeveloped.

Toaccomplishthis,stimuliwerecreatedbyembeddingthecategorylabel(whichistypicallyaverballabelsuchascategory“A”or“B”)intoeachstimulusasafourthbinary-valuedperceptualdimension(seeTable1).Onsupervisedclassiﬁca-tionstudyphasetrials,subjectswereshownthevalueoftheﬁrstthreeperceptual

Table1.Thelogicalstruc-tureofTypesI,II,IVandVIclassiﬁcationproblemstestedinRef.30.

11112222112211221212121211112222112222111112122212212112

6T.M.Gureckis&B.C.Love

dimensionsandwerequeriedonthefourth.Afterresponding,thecorrectvalueofthefourthdimensionswasshown.IntheLove18study,thefourthdimension(i.e.the“category”dimension)wasthebordercolor(eitheryelloworwhite)ofageometricﬁgure.Subjectsindicatedwhethertheybelievedthebordercolor(notshownonthedisplay)wasyelloworwhitebasedonthethreeotherperceptualdimensions(whichwereshownonthedisplay).Afterresponding,thecompleteﬁgurewasdisplayed.

Onunsupervisedstudyphasetrials,allfourperceptualdimensionswereshownonstudyphasetrials(thefourthdimensionwasnotqueried).Intheintentionalunsupervisedlearningcondition,subjectswereawaretheywereinalearningtaskandwereinstructedtoactivelysearchforpatternsthatcharacterizedthetrainingitems.Incontrast,subjectsintheincidentalunsupervisedlearningconditionwerenotawarethattheywereinalearningtaskandwereinstructedtosimplyratehowpleasanttheyfoundeachstimulusitem.

Ineachofthethreestudyconditions(supervisedclassiﬁcationlearning,in-tentionalunsupervisedlearning,incidentalunsupervisedlearning),subjectsweretrainedoneitherTypesI,II,IV,orVIcategorystructures(seeTable1)deﬁnedbyShepard,HovlandandJenkins.30TypeIproblemonlyrequiresattentionalongoneinputdimension,whereasTypeIIproblemrequiresattendingtotwodimensions(TypeIIisXORontheﬁrsttwodimensionswithanirrelevantthirddimension).ThecategoriesinTypeIIproblemhaveahighlynonlinearstructure.TypeIVre-quiresattentionalongallthreeperceptualdimensionswitheachdimensionservingasanimperfectpredictor.TypeIVisnotablebecauseitdisplaysalinearcategorystructure.TypeVIalsorequiresattentiontoallthreeperceptualdimensionsandhasnoregularitiesacrossanypairofdimensions.Inallconditions,subjectscom-pletedtenstudyblocks(ablockconsistsofthepresentationofeachstimulusiteminarandomorder).

Categorylearningperformancewasmeasuredinatestphasewhichfollowedthestudyphase.Subjectsviewedapairofstimulithatvariedonlyonthefourthdimensions(i.e.thecategorydimension).Subjectswereinstructedtochoosetheitemthatappearedduringthestudyphase(afamiliarityorrecognitionjudgment).Asintraditionalsupervisedclassiﬁcationlearningstudies,subjectscouldbasethisjudgmentontheirknowledgeoftherelationshipbetweenthecategorydimen-sionandotherdimensions(e.g.rules,correlations,etc.)aswellasonmemorizedexemplars.Love18veriﬁedthatthistestingprocedureyieldsperformancescoresthatcorrelatehighlywithstudyphaseaccuracyinthesupervisedcondition.Thus,testphaseaccuracycanbeusedtocomparetheabilityofsubjectstolearnineachofthethreestudyconditions.

TheresultsareshowninTable2.Theacquisitionpatternsforthethreelearningconditionsdiﬀersigniﬁcantly.SubjectsintheunsupervisedconditionsdidnotshowapreferenceforTypeIIcategorystructurerelativetoTypeIVstructure.Thisef-fectwasmostpronouncedintheincidentalunsupervisedlearningcondition.Oneexplanationforthisdiﬀerencebetweentheincidentalandintentionalunsupervisedlearningconditionsisthatintentionalunsupervisedlearningtaskencouragedsub-

HumanUnsupervisedandSupervisedLearningasaQuantitativeDistinction

Table2.ThestudyphaseandtestphaseresultsfromRef.18.uSUSTAIN’sﬁtisshowninparentheses.

SupervisedClassiﬁcationTypeI0.86(0.74)TypeII0.67(0.63)TypeIV0.65(0.60)TypeVI0.59(0.56)IntentionalTypeITypeIITypeIVTypeVITypeTypeTypeType

IIIIVVI

UnsupervisedNANANANANANANANA

Learning0.(0.90)0.73(0.75)0.70(0.65)0.61(0.58)Learning0.84(0.86)0.(0.57)0.67(0.66)0.(0.50)0.850.560.670.56

(0.81)(0.51)(0.63)(0.50)

IncidentalUnsupervisedLearning

8T.M.Gureckis&B.C.Love

Table3.

uSUSTAIN’sbestﬁttingparametersforRef.18studies.

Learningrate

ClustercompetitionDecisionconsistencyAttentionalfocusThresholdηβdrτ0.01722.9014.4740.4750.5680.01860.60814.4422.209

0.553/0.487

HumanUnsupervisedandSupervisedLearningasaQuantitativeDistinction9

uSUSTAIN’sﬁtoftheLove18datasuggeststhatunsupervisedlearning,par-ticularlyincidentalunsupervisedlearning,isbestmatchedwithlinearcategorystructuresbecausetheoptimalclusteringsolutionforalinearcategorystructureinvolvesoneclusterpercategory.Ontheotherhand,nonlinearcategorystructuresarenotwellmatchedtoanunsupervisedinductiontaskbecausenonlinearcate-gorystructurescanonlybecapturedwithmultipleclusterspercategory.Whilethelinear/nonlineardistinctionhasnotprovedcriticalinsupervisedclassiﬁcationlearning,24Love18suggestedthatthedistinctionmaybemeaningfulinunsuper-visedlearning.uSUSTAIN’saccountofthedatasupportsthisconjecture.

OnecounterintuitivepredictionthatuSUSTAINmakesisthatincidentalunsupervisedlearningmaybethepreferredinductiontaskforsometasks.Inotherwords,sometimeshumansmaybebetteroﬀnottryingtomasterthelearningproblem.Onesuchsituationiswhennumerousstimulusdimensionsareweaklycorrelatedwithoneanother.Undersuchcircumstances,uSUSTAINpredictsthatsupervisedclassiﬁcationlearningandintentionalunsupervisedlearningwillleadtoclusteringsolutionsthatover-diﬀerentiateitemsandthereforedonotfullycapturetheintercorrelatedstructureofthecategories.Incontrast,incidentalunsupervisedlearningtendstoaggregateitemsincommonclustersandismorelikelytocap-turetheunderlyingcategorystructure.uSUSTAIN’slowersettingofτparameter(whichincreasesuSUSTAIN’stendencytoclusteritemstogether)forincidentalunsupervisedlearningdrivesthisprediction.

Despitetheapparentdiﬀerencesbetweensupervisedclassiﬁcationlearning,intentionalunsupervisedlearningandincidentalunsupervisedlearning,allthreeinductiontasksaremodeledthroughacommonmechanisminuSUSTAIN.Beyondthecurrentproject,animportantgoalofoureﬀortsistomodelhumanlearningacrossarangeofsituationsandinductiontasks.Doingsohighlightstheoreticalconnectionsacrossdatasetsandshouldleadtoamoregeneralunderstandingofhumanlearning.

Acknowledgments

ThisworkwassupportedbyAFOSRGrantF49620-01-1-0295toB.C.Love.CorrespondenceconcerningthisresearchshouldbeaddressedtoToddM.Gureckis,gureckis@love.psy.utexas.edu.

References

1.J.Anderson,“Theadaptivenatureofhumancategorization,”Psychol.Rev.98(1991)409–429.

2.F.Ashby,S.QuellerandP.M.Berretty,“Onthedominanceofunidimensionalrulesinunsupervisedcategorization,”Percep.Psychophys.61(1999)1178–1199.

3.D.C.BerryandZ.Dienes,ImplicitLearning:TheoreticalandEmpiricalIssues,Erlbaum,Hillsdale,NJ,1993.

900T.M.Gureckis&B.C.Love

4.D.BillmanandJ.Knutson,“Unsupervisedconceptlearningandvaluesystematicity:acomplexwholeaidslearningtheparts,”J.Experim.Psychol.Learn.Mem.Cogn.22,2(1996)458–475.

5.G.A.CarpenterandS.Grossberg,“Amassivelyparallelarchitectureforaselfor-ganizingneuralpatternrecognitionmachine,”Comput.Vis.Graph.Imag.Proc.37(1987)–115.

6.J.P.ClapperandG.H.Bower,“Learningandapplyingcategoryknowledgeinunsuperviseddomains,”Psychol.Learn.Motiv.27(1991)65–108.

7.J.P.ClapperandG.H.Bower,“Categoryinventioninunsupervisedlearning,”J.Experim.Psychol.Learn.Mem.Cogn.20(1994)443–460.

8.A.Cleermans,MechanismsofImplicitLearning:ConnectionistModelsofSequenceProcessing,MITPress,Cambridge,MA,1993.

9.T.GureckisandB.C.Love,“Modelingunsupervisedlearningwithsustain,”Proc.15thAnnualFLAIRSConf.,2002,pp.163–167.

10.T.GureckisandB.C.Love,“Towardsauniﬁedaccountofsupervisedandunsuper-visedlearning,”J.Experim.Th.Artif.Intell.15(2003)1–20.

11.T.GureckisandB.C.Love,“Whosaysmodelscanonlydowhatyoutell

them?unsupervisedcategorylearningdata,ﬁts,andpredictions,”Proc.24thAnn.Conf.CognitiveScienceSociety,Hillsdale,NJ,LawrenceErlbaumAssociates,2002,pp.399–404.

12.J.A.Hartigan,ClusteringAlgorithms,Wiley,NY,1975.

13.N.HayesandD.E.Broadbent,“Twomodesoflearningforinteractivetasks,”

Cognition28(1988)249–276.

14.H.S.Hock,L.MalcusandL.Hasher,“Frequencydiscrimination:assesingglobal

elementalletterunitsinmemory,”J.Experim.Psychol.Learn.Mem.Cogn.12(1986)232–240.

15.T.Kohonen,Self-OrganizationandAssociativeMemory,Springer,Berlin,Heidelberg,

3rdedn.,19.

16.J.Kruschke,“ALCOVE:anexemplar-basedconnectionistmodelofcategory

learning,”Psychol.Rev.99(1992)22–44.

17.P.Lewicki,NonconsciousSocialInformationProcessing,AcademicPress,NY,1986.18.B.C.Love,“Comparingsupervisedandunsupervisedcategorylearning,”Psychol.

Bull.Rev.9,4(2002)829–835.

19.B.C.Love,A.B.MarkmanandT.Yamauchi,“Modelingclassiﬁcationandinference

learning,”Proc.FifteenthNat.Conf.ArtiﬁcialIntelligence,2000,pp.136–141.

20.B.C.LoveandD.L.Medin,“Modelingitemandcategorylearning,”Proc.20thAnn.

Conf.CognitiveScienceSociety,Mahwah,NJ,LawrenceErlbaumAssociates,1998,pp.639–4,

21.B.C.LoveandD.L.Medin,“SUSTAIN:amodelofhumancategorylearning,”

Proc.FifteenthNat.Conf.ArtiﬁcialIntelligence,Cambridge,MA,MITPress,1998,pp.671–676.

22.B.C.Love,D.L.MedinandT.Gureckis,“SUSTAIN:anetworkmodelofhuman

categorylearning,”Psychol.Rev.(2002)inpress.

23.R.D.Luce,IndividualChoiceBehavior:ATheoreticalAnalysis,GreenwoodPress,

Westport,CN,1959.

24.D.L.MedinandP.J.Schwanenﬂugel,“Linearseparabilityinclassiﬁcationlearning,”

J.Experim.Psychol.:HumanLearn.Mem.7(1981)355–368.

25.R.M.Nosofsky,M.A.Gluck,T.J.Palmeri,S.C.McKinleyandP.Glauthier,

“Comparingmodelsofrulebasedclassiﬁcationlearning:areplicationandextensionofShepard,Hovland,andJenkins(1961),”Mem.Cogn.22(1994)352–369.

HumanUnsupervisedandSupervisedLearningasaQuantitativeDistinction901

26.R.M.Nosofsky,T.J.PalmeriandS.C.McKinley,“Rule-plus-exceptionmodelof

classiﬁcationlearning,”Psychol.Rev.101,1(1994)53–79.

27.R.Ratcliﬀ,“Connectionistmodelsofrecognitionmemory:constraintsimposedby

learningandforgettingfunctions,”Psychol.Rev.97(1990)285–308.

28.D.E.Rumelhart,G.E.HintonandR.J.Williams,“Learningrepresentationsby

back-propagatingerrors,”Nature323(1986)533–536.

29.R.N.Shepard,“Towardauniversallawofgeneralizationforpsychologicalscience,”

Science237(1987)1317–1323.

30.R.N.Shepard,C.L.HovlandandH.M.Jenkins,“Learningandmemorizationof

classiﬁcations,”Psychol.Monogr.75,13,WholeNo.517(1961).

31.S.A.Sloman,“Explanatorycoherenceandtheinductionofproperties,”Thinking&

Reasoning3(1997)81–110.

32.B.WidrowandM.E.Hoﬀ,“Adaptiveswitchingcircuits,”IREWESCONConvention

Record,NY,1960,pp.96–104.

因篇幅问题不能全部显示，请点此查看更多更全内容

查看全文