Upload
snuuxlab
View
137
Download
1
Embed Size (px)
Citation preview
이남민
Analyzing Web Multimedia Query Reformulation Behavior
Proceedings of the 14th Australasian Document ComputingSymposium, Sydney, Australia, 4 December 2009.
BY : Liang-Chun Jack TsengFaculty of Science and Technology
Queensland University of Technology Australia
2012.01.19
Lee Nammin
* İÞ mĕyMµ Ë �� »ñÐ ¼áĹ �� �� ¢8Ï` ćÉIe �Å* ąf reformulationÐ mĕyMµ ��º� ��ĢĦ �ä
Introduction
* mĕyMµ� 7´ĥ, ,f �ÅBp� ä� ĭAº> �ij� ��.* Ď£ē ıĊÓ ąf` �ûijĥ3 = µ]ÈÑ �Ò - Ď£ē �{Ó Ö{ Ë ��º �ī ąfÓ �Ô, �� ¥� �µò. - Ôyï ��Ð 7d ��º �ī �� �Ô, ąf 9 ŵ¢� �Ý iÒ.
�� 7d ĈØÓ mĕyMµ ��Ó Ĕ�, ��º�Ó Ý± ÄÕÑ Ôīĥ� Ìī �ÅÚÓ ąf formulation ĭAÑ �Ě� ģÄ� ÙÒ.
Introduction
* Ë `� �� : :[Ó =ÔČe �ï� Ķ�àÏ` �� ĭA �ö �4 ĥïh vkàÕ ä�3 ¯� ĺK7. - Ē� Ħ ąf 9 � � ä>Ó 8µ� ©Ô3�! - �VKÐ �Ē ú � ô �� ęÔïh ĉ�ĥ� nþ3�!
ĔĹ �ÕKÓ transaction =ÔČ`�Č Ô\Ħ ³�Ó kU �_ ä�à �ZĦ �IJÛź :Ħ �Ï`3 �ÅÚÓ �� ĭAÑ Ôīĥ3 = jÆ �çĨ.
* ý� ½�º�3 ½¡àÏ` ¢äC ąfº� kU ä�e þĀĥ� ¥Û. ĥïh ¢AÏ` ÖÖÔ �đī² Ĭ� Ovº iÐ ´Ñ 7b� ĺK¹Ò.
* ½¡C F �Ó ąf�Ó �ije Ēī �� AÛÑ �Ģ�> ĬÏ#, ąf �� áûº� Öµ#3 �ije �¬² ĥo, Ô +lº Ù3 kUàÕ Ë `� ��Ô ģÄĨ.
Related works
1. Limitation of current Web log analysis
* modification vs. reformulation- � ąfK �º Öµ#3 �ij vs. �� áûº� Öµ#3 �ij- Ĕä ĘčK` ��D ¢ Ù3 vs. �� áZÑ I\(3
* reformulation behavior - ¢ù�Ó ąfKÑ 11�Ó query modification typeÏ` ¢A �c - :[Ó web log data`�Č reformulation behaviore modification patternÑ Ēī �ö
* search strategies - F � Ô�Ó ½¡C ąf�Ó �ij æ� - reformulation ~¦Ñ ¢AÏ` 8�Ó áZÏ` �c
2. Web query reformulation behavior and search strategies
Related works
�� �� áûº�Ó½¡àÕ ąf ¢ä �äÑ �öĥ¼,�è mĕyMµ �� AÛº�Ó õÔe | ī �7.
Research Questions
1. �è mĕyMµ ��K �º ąf reformulationº õÔe �Ô#?
2. µP ąf ¢ä �äÔ Æfº �ÅÚÓ ąf reformulation ĭAÑ �¼ê#?
3. µP �� áZÔ ąf ¢ä �äÏ`�Č ÍþD ¢ ÙÑ ? µQ ��KÐ ćÉI �{Ó mĕyMµ �� ¥£ĐÓ ��º �¼ī² ĥ#?
½�Ó ÓÓ* ÚAÏ`, ½¡àÕ ēXß� `�K +lÓ vkà ä�e ��ĥ3 ú � ô ½�.
* 7´Ħ ıĊÓ mĕyMµ ��K �ÔÓ �� áZ Ĕ�KÑ ��ħ ¢ Ù ī ì �Ô�, yWÓ �� ¥£ĐÑ ÌĦ Õ�ÔēK> å�ī ì �.
Methodology
1,228,310 records on May 15th, 2006
five fields* IP : ąf� Ø^C ĄĠČÓ IPê * Cookie : Dogpile systemÔ �ẠäÓC ÍĶĦ ��º ĔäĦ ĄĠČ` �) �ÍĦ ¦�Ú* Time : �ÅÚ� ąfe Ø^Ħ ¥�* Query : ¥£Đº Ø^Ħ Ê� �� Ď£ē* Vertical : Dogpileº� �ċĦ �� Íı( Ôyï, AÀ�, ÁMÁ )
1. Dogpile log aggregation and query modification records* dogpile : meta-search engine
Methodology
* �� 7d �ÅÚ ¦�Ú` ĖE : IP + CookieÓ æĩ
* AÖĦ ąf� ½¡àÏ` #Ĉ#3 �Ð ĉ�Ĭ73 �Ñ Rĥ3 �.* ý� �a ó�E.
2. Browsing record aggregation
Methodology
��Ó transaction recordKÐ İÞ ąfà Ôá ąfe �{Ï` query modification patternÏ` �cB¹Ò.
four modification patterns* Initial query ( I ) : İÞ ąfº Ôá ąfà �ĒB3 ŵ� ¸Ò.* Addition modification ( A ) : İÞ ąf� Ôá ąfÓ sJ ŵe ĝĨĨ. (�`Ç Åµ� þ�)* Deletion modification ( D ) : İÞ ąfº Ôá ąf`�Č Ö�� 0UE.* Replacement modification ( M ) : ŵ� �åB3�� ;īï3 �Ô A¥º Öµ%.
Ô\Ħ �ëÏ` modification patternse �cīê3 ÚAij >�e åÛĨ.
3. Modification pattern classification
Methodology
* �� �� : AÖĦ �ÅÚ� Ø^Ħ, �_C Ö_Ó ��µKÔ #Ĉ#3 wă
* IPà CookieÓ æĩº Óī äÓB3 �º ;ī ¬u ŵ> ĝĨBµÙï ®Ð �Ð �`Ç ��Ó ¥ÛÏ` �ê, initial query` �c
* AÖĦ IP z Cookie æĩÏ` ��Ó ¢e ��ĥ¼ �ÅÚº Óī Ø^C êåKÓ Ĝ� �¢e ��
4. Session aggregation
Methodology
* Ö8 ąf modification record� ��Bp, � �� (º� Öµ#3 ½¡àÕ modificationÐ yf äīñ 36�Ó modification sequence type î ĥ#` ��. (ġ`�Y �Å)* modification sequence type : yf äīñ F~� �Ó modificationÓ æĩ
* 2-modification sequence3 ĥ#Ó initial query ( I ) º NT Á3 F �Ó ąf modification (replacement, addition, deletion î)` ��E.
* �\x` 2-modification sequencese Ìī 9 patterns ( 3x3) Ô ��C7.* �¤Ħ ~¦Ï`, 3-modification sequencese Ìī 27 patterns ( 3x3x3 ) Ô ��C7.
Ô ��Ó êÄ tàÐ�ÅÚ� ¢ĭĥ3 modification ¥Ć£Ó ��Ħ ĘčÑ | ĥ¼�ÅÚ� �IJĥ3 ½¡ ąf modification ĘčÑ }į(�, �� áZ ��Ñ ÌĦ §ĂàÕ ä�e ·� ÌĨ.
5. Modification sequence
I - A - RI - A - AI - A - D
I - R - RI - R - AI - R - D
I - D - RI - D - AI - D - D
Methodology
* áıàÕ modification sequenceº :ī�3, � sequenceº� �ijC ÅµÓ ¢e ��.* �\Ħ �ijKÑ �ï� �ÅÚ� Ö� Ĕä �� áZÑ øċĥ¿3ï ¼�e ė8ħ ¢ ÙÒ.
�ĭ ½�º NT áZ �ce �ÿĥ¿Ò.* Generalized reformulation- r �Ó ÅµK` ��Ñ ¥Ûĥ�, Ôķº ; iÐ ��KÑ ĝĨĥ� Ìī ŵKÑ �.- ½¡àÕ ÅµÓ å�e Ēī�> #Ĉ%.- àÐ ¢Ó ŵK� �ûĨ.- Hº #Á3 ąf3 Ôá ąfÓ ÅµK�7 Ī� ; à�# �Ð ¢Ó ŵe ĝĨĥ�> Ĩ.
* Specified reformulation- ï¡àÏ` iРŵe þ�ĥ�# �ûàÕ v�` ��- Hº #Á3 ąf� Ôá ąf�7 ; i�# AÖĦ �¢Ó ŵe �ï3 �Æ> ÙÒ.
* Dynamic reformulation- generalized à specified reformulation GÓ ĭĊe Ĩ! �×.- ���äÔ �ĵBï ®�#, áZÔ ¢gBï ®Ð �Æ×.- Ô �Æ �� »ñº� �Ý iÔ >Ãêµ² Ĩ.- Hº #Á3 ąf� Ôá ąf�7 à�# ; iРŵKÑ ĝĨ.
* Constant reformulation- ŵe �_ �û, AÓµ` :û : Ö{àÕ Ĕ�Ñ �Íĥ3 AÖĦ �- ¢ëÓ �Ï`.- ąfº ©Õ ÅµÓ �¢� Ö�à×.
6. Search strategies based on modification sequence analysis
Results
* ÆfÓ =ÔČ �º�3 mĕyMµ ��º� ï�àÕ ÍıÏ` Ôyï ��Ô #Ĉ%.* Ôyï �� > ÁMÁ �� > �MÁ ��
* initial query ( I ) � ï�à×.* Ôyï ��º� Replacement modification Ð addition modification�7 F � ; i°Ò.* ÁMÁ ��º�3 Replacement� ĸ« à Öµ&Ò.* deletionÐ �Ý à Öµ%.
1. Query modification
Results
* ÆfÓ =ÔČ �º�3 mĕyMµ ��º� ï�àÕ ÍıÏ` Ôyï ��Ô #Ĉ%.* Ôyï �� > ÁMÁ �� > �MÁ ��
* initial query ( I ) � ï�à×.* Ôyï ��º� Replacement modification Ð addition modification�7 F � ; i°Ò.* ÁMÁ ��º�3 Replacement� ĸ« à Öµ&Ò.* deletionÐ �Ý à Öµ%.
1. Query modification
Results
2. Modification sequence2- modification sequence analysis
* replacement modificationÔ sJ mĕyMµ ��º� �Ý iÔ Öµ% (IRR, IAR )
* ½¡àÕ deletion modificationÐ 'Ð �>` Öµ% (IDD < 1%)
* Ôyï, �MÁ ��º�, ÁMÁ ���7 1º S IRR ¥Ć£� iÔ Öµ%.
* ÁMÁ ��º�3 #lï F�ï �7 1º S IAD ¥Ć£� iÔ Öµ%. * sJ mĕyMµ ��Ð �Ì ��ï Ęčº� � ¬5T #lï Ęčº�> Í�Ħ �ĝe �×.
Results
3- modification sequence analysis
* replacement, addition modificationÔ iÔ Öµ%. ( IRRR , IARR )
* Ôyï, �MÁ ��º�, ÁMÁ ���7 1º S IRRR ¥Ć£� iÔ Öµ%.
* ÁMÁ ��º�3 #lï F�ï �7 1º S IADA ¥Ć£� iÔ Öµ%. * sJ mĕyMµ ��Ð �Ì 7��ï Ęčº� � ¬5T #lï Ęčº�> Í�Ħ �ĝe �×.
* �Ì 5�Ó ¥Ć£ îº Ħ�ï ĘčhÔ De ĝĨ
* NT�, ąf modification ü�º replacingº :Ħ �ÅÚÓ �IJÃ, deletionº :Ħ � �IJe ĴÕħ ¢ ÙÒ.
* modification sequence��º� �Ý iÔ Öµ%Ñ ĴÕĬÏx`, ½¡àÏ` replacement� Öµ#3 �Ó �ãÏ` �� áZÑ æ�Ĩ.* sJ IRR ¥Ć£Ó ³ 40%� Dynamic search strategye #Ĉ*.
* dynamic strategyÓ �ÎÐ IRRRº� 50%Ô�×.
* Ôyï ��º� constant search strategy� �Ý /Ò.* �\x` Ôyïe ��ħ O� 7d ÍıÓ mĕyMµ �� O �7 AÓµ, �_ ŵK�Ó replacement� �Ý iÔ Öµ%.�\x` Ô �Æ� ŵ å �4Ô �Ý >ÈÔ E.
* ÁMÁ �� �ÅÚ� dynamic áZÑ iÔ øċĨ.
* :û ŵKÑ ĴÕĥ� Ìī ğ� ¦� �4Ñ �İ* X<ĥ �T) 1465�Ó constant IRRR ¥Ć£K î 3003�Ó Åµ� �ûB¹ÒÑ ĴÕĨ.* �ûC ŵ¨Ó 70% Ô�Ô (2125) �Ð ğ�e �ï� Ù¹�, constant search ¥Ć£º� AÓµ# �Ð �æÓ ½� ��µ` �ûC73 �Ñ ĴÕħ ¢ Ù¹7.
Results
3. Search strategies based on modification sequence
* modification sequence��º� �Ý iÔ Öµ%Ñ ĴÕĬÏx`, ½¡àÏ` replacement� Öµ#3 �Ó �ãÏ` �� áZÑ æ�Ĩ.* sJ IRR ¥Ć£Ó ³ 40%� Dynamic search strategye #Ĉ*.
* dynamic strategyÓ �ÎÐ IRRRº� 50%Ô�×.
* Ôyï ��º� constant search strategy� �Ý /Ò.* �\x` Ôyïe ��ħ O� 7d ÍıÓ mĕyMµ �� O �7 AÓµ, �_ ŵK�Ó replacement� �Ý iÔ Öµ%.�\x` Ô �Æ� ŵ å �4Ô �Ý >ÈÔ E.
* ÁMÁ �� �ÅÚ� dynamic áZÑ iÔ øċĨ.
* :û ŵKÑ ĴÕĥ� Ìī ğ� ¦� �4Ñ �İ* X<ĥ �T) 1465�Ó constant IRRR ¥Ć£K î 3003�Ó Åµ� �ûB¹ÒÑ ĴÕĨ.* �ûC ŵ¨Ó 70% Ô�Ô (2125) �Ð ğ�e �ï� Ù¹�, constant search ¥Ć£º� AÓµ# �Ð �æÓ ½� ��µ` �ûC73 �Ñ ĴÕħ ¢ Ù¹7.
Results
3. Search strategies based on modification sequence
Limitations
* ÚAÏ` �ÅÚKÓ ąf modification ĭAÑ | ĥ] Ĭ� Ovº,constant search sequencesº� �ûC ŵe “ğ�”`h ��ĬÒ.�\x` reformulation �ä îº ¢äB3 ŵKÓ typeº :ī�3 ¯ ¢ ¸Ò.
* ĥïh �ÅÚÓ á{àÕ �� áZÐ ÆfÓ ��º� Íþ �4Ĩ.
* Æf3 7´Ħ 7d mĕyMµ �� �ÔÓ r�ï ?ĔĦ Ĕ�KÑ | ī(3= ��. Ô | KÐ ąf modification ĭA� �� áZÓ �ãº� Ö{àÕ Ë ��� ��Bµ² Ĩ.
Conclusion* ąf ¢ä ~�Ñ �{Ï` �ÅÚÓ mĕyMµ �� ĭAÑ æ�Ĩ.
* ąf ¢äÓ 60% ä>� �`Ç �� êåe �¦ij ĥ¿� ( I ) * Ôyï, �MÁ �� �ÅÚKÐ ÁMÁº �ī ³� ; ąfe iÔ ¢äĥ3 �Ï` #Ĉ%.* á{àÏ` ąfe �ûĥ3 �ÎÔ iÒ.( ĔĹ Ôyï , �MÁ ��º� iÔ #Ĉ%)* �� áZ āpº�, constant search strategy� iÔ #Ĉ$�Ï` �¬, ŵ å �4(AÓµ ÷�, �_ ŵ ÷�) ïÊÔ îÄ�Ô µ2ä> #Ĉ%.
* ÁMÁ ��º� Initial Ô replacementº �ī İ�Ĺ iÔ #Ĉ%. * ÁMÁ ��º�3 addition ąf� iÔ Öµ#3 ě×.
* ½¡àÏ` �û�äÔ Öµ#3 �Ð, AÓµ, �_ ŵe ÷3 �Ñ >Ãêp ��º >ÈÔ D �ÔT3 �Ñ �¼í.
* Ôyï ��Ó �Æ ; �ûàÕ �KÑ øċĥ3 áZ �ĮÑ �¼í.* �KÓ våe ãõ �ûijĥ3 �Ñ >Ãêµ² Ĩ.
* Ô ½�º�3 ÚA ��Ó âõe å¥ĥ¿�, 7d =ÔČ�� Ö{ Ë �ÅÚÓ �� ĭAÑ ��ħ ¢ Ù Ĭ73 = ÓÓ�,
* ä[àÕ ��e ÚAàÏ` hK ¢ Ù¹73 = ÓÓ� ÙÒ.* Ô ½�º� # ��KÑ �ï� ąf �ĩ �äÑ @37Jï, ��Ó ĶÎ�Ñ Į�¥ć� Ìī �Åħ ¢ ÙÒ.
Conclusion
* ¥>3 é°Ï# so what..?
* İ�Ó #¾ 2"..findingÔ ³Ĩ.
* :[Ó =ÔČe s¬� Ęčij Ĭ73 �Ð éÏ#,
* ÚAijħ ¢ Ù>a � ĘčÓ Ĕ�Ñ �¬(µ �İĬ3=, `ðÓ �� �qÔ �çĨ. : �ëÑ �Ç �� �qÔ ģÄ.(áZÓ �ƺh �ĭ½�e ¶�) �ĭ ½�� ¸7p �ëÑ �Æ3 � Úû� .vÔ Bµ> é°Ñď=..
* �¼ê3 =ÔČ� +u iÒ : îãàÏ` �qĥ�Ú ĥ3 findingÔ Ğe Ēī Ü �×. iÐ ÓyÙ3 İ�Ñ �¼ê3 ��73, jĤB3 ī�Ô Ĩ! Ùµ² Ü ª .vÔ D L.
* ‘=ÔČ� ¬6 Õ�Ôēe’
End.