{"id":985,"date":"2016-03-04T01:05:45","date_gmt":"2016-03-04T01:05:45","guid":{"rendered":"http:\/\/ec2-52-79-86-100.ap-northeast-2.compute.amazonaws.com\/?p=985"},"modified":"2024-05-01T11:54:39","modified_gmt":"2024-05-01T02:54:39","slug":"answering-tasks-for-ai","status":"publish","type":"post","link":"https:\/\/blog.themusio.com\/?p=985","title":{"rendered":"Answering Tasks for AI"},"content":{"rendered":"<p><strong>Goal<\/strong><br \/>\nThe aim of this short summary is to provide an overview of the tasks an AI system is, should be and will be capable to execute in the near future.<\/p>\n<p>In particular we focus on classifying tasks with respect to NLP.<\/p>\n<p><strong>Motivation<\/strong><br \/>\nWhen testing existing algorithms it is of great importance to have a clear interpretation of the results.<br \/>\nThis can be achieved by setting up specific learning environments in such a way that the complexity of the task is always under control.<br \/>\nMore generally, the state-of-the-art problems determine next development steps for Musio.<\/p>\n<p><strong>Ingredients<\/strong><br \/>\ncommon sense, coreference, compound, conjunction, deduction, induction, argument relation, counting, negation, indefinite knowledge, time reasoning, positional and size reasoning, path finding, motivational reasoning.<\/p>\n<p><strong>Steps<\/strong><br \/>\nLet us start by mentioning several AI tests which are out there.<br \/>\nThe Allen Institute for AI provides a test on general knowledge, called ARISO, in the form of science exams for 4th, 8th and 12th grade students.<br \/>\nThese exams take the form of multiple choice questions and only recently the best algorithms on the 8th grade exam reached a score of 0.60.<br \/>\nA test that is more standardized is the Winograd schema challenge, where a sentence with a coreference and two possible word choices has to be completed.<br \/>\nBoth these tests require general knowledge about the world which is not provided in the test situation.<\/p>\n<p>In contrast the follow tests contain the required background knowledge in order to answer distinct questions.<br \/>\nThe MCTest provides 660 stories with associated multiple choice questions, which each assume a different reasoning.<br \/>\nMore specific is the Children&#8217;s book test (CBT) which measures language modeling in a wider linguistic context by the before mentioned sentence completion task.<\/p>\n<p>Closely related to real-world tasks are the following tests.<br \/>\nThe CNN QA tests is based on news articles with abstract bullet point summaries which have been rephrased as questions.<br \/>\nFocusing on the topic movies, the Movie dialog data set allows to ask factoid questions and provides recommendations.<br \/>\nDespite being limited topic-wise this task is more suited for training and building dialog systems.<\/p>\n<p>A different approach to overcoming the general scarcity of large datasets is based on simulating the needed data.<br \/>\nClearly, this does not allow to capture the complexity of natural language however in this way it is possible to specify certain classes of self-contained tasks.<br \/>\nOnly to name a few here, supporting fact questions require to distinguish between given relevant and irrelevant facts, deduction and induction assume a certain logic reasoning and positional and size reasoning allow to recognize objects.<br \/>\nAmong the hardest tasks is path finding according to a relative description between to points A and B with respect to a point C.<br \/>\nEspecially, tasks that require a kind of long term memory provide hard tasks.<\/p>\n<p>Appropriated tested algorithms take the form of N-Gram models, structured support vector machines or certain types of recurrent neural networks.<\/p>\n<p>Very promising are also memory networks with neural components.<\/p>\n<p>With regard to Musio&#8217;s focus on emotional and sentimental reasoning tasks beyond logical reasoning should be considered.<\/p>\n<p><strong>Resources<br \/>\n<\/strong>&#8220;<a href=\"http:\/\/arxiv.org\/pdf\/1502.05698v10.pdf\" target=\"_blank\" rel=\"noopener\">TOWARDS AI-COMPLETE QUESTION ANSWERING: A SET OF PREREQUISITE TOY TASKS<\/a>&#8220;(PDF). <em>TOWARDS AI-COMPLETE QUESTION ANSWERING: A SET OF PREREQUISITE TOY TASKS.<\/em> December 2015. Retrieved Feburary 23, 2016.<em><br \/>\n<\/em>&#8220;<a href=\"https:\/\/github.com\/facebook\/bAbI-tasks\" target=\"_blank\" rel=\"noopener\">Task generation for testing text understanding and reasoning<\/a>&#8220;(GIT). <em>Task generation for testing text understanding and reasoning<\/em>. Retrieved Feburary 23, 2016.<br \/>\n&#8220;<a href=\"http:\/\/www.thespermwhale.com\/jaseweston\/babi\/abordes-ICLR.pdf\" target=\"_blank\" rel=\"noopener\">Artificial Tasks for Artificial Intelligence<\/a>&#8221; (PDF).\u00a0<em>Artificial Tasks for Artificial Intelligence. <\/em>May 2015.\u00a0Retrieved Feburary 23, 2016.<br \/>\n&#8220;<a href=\"http:\/\/arxiv.org\/pdf\/1511.02301v3.pdf\" target=\"_blank\" rel=\"noopener\">THE GOLDILOCKS PRINCIPLE: READING CHILDREN\u2019S BOOKS WITH EXPLICIT MEMORY REPRESENTATIONS<\/a>&#8220;(PDF).<em> THE GOLDILOCKS PRINCIPLE: READING CHILDREN\u2019S BOOKS WITH EXPLICIT MEMORY REPRESENTATIONS<\/em>. January 2016.\u00a0Retrieved Feburary 23, 2016.<br \/>\n&#8220;<a href=\"http:\/\/arxiv.org\/pdf\/1506.03340v3.pdf\" target=\"_blank\" rel=\"noopener\">Teaching Machines to Read and Comprehend<\/a>&#8221; (PDF). <em>Teaching Machines to Read and Comprehend.\u00a0<\/em>November 2015. Retrieved February 23, 2016.<br \/>\n&#8220;<a href=\"http:\/\/arxiv.org\/pdf\/1511.06931v3.pdf\" target=\"_blank\" rel=\"noopener\">EVALUATING PREREQUISITE QUALITIES FOR LEARN- ING END-TO-END DIALOG SYSTEMS<\/a>&#8221; (PDF). <em>EVALUATING PREREQUISITE QUALITIES FOR LEARN- ING END-TO-END DIALOG SYSTEMS<\/em>. Jan 2016.\u00a0Retrieved Feburary 23, 2016.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Goal The aim of this short summary is to provide an overview of the tasks an AI system is, should be and will  [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[3642,3640],"tags":[3650,3652,4172,4080,3760,3656,3762,3658,3788,3700,3662,4050,4082,4316,4174,3702,3664,3816,4146,3710,4310,3712,4084,4318],"class_list":["post-985","post","type-post","status-publish","format-standard","hentry","category-ai-en","category-all-en","tag-ai-ja-en","tag-aka-ja-en","tag-algorithms-ja-en","tag-answers-en","tag-artificial-intelligence-en","tag-baggage-en","tag-children-book-ja-en","tag-christmas-en","tag-classifier-en","tag-cmos-en","tag-cognitive-behavioral-therapy-en","tag-coherence-of-values-en","tag-communication-robot-en-en","tag-conditional-random-fields-en","tag-connection-en","tag-contents-en","tag-crowd-funding-en","tag-language-en","tag-memory-en","tag-musio-en","tag-n-gram-en","tag-neural-networks-en","tag-questions-ja-en","tag-tasks-en"],"aioseo_notices":[],"jetpack_sharing_enabled":true,"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/blog.themusio.com\/index.php?rest_route=\/wp\/v2\/posts\/985","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.themusio.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.themusio.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.themusio.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.themusio.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=985"}],"version-history":[{"count":3,"href":"https:\/\/blog.themusio.com\/index.php?rest_route=\/wp\/v2\/posts\/985\/revisions"}],"predecessor-version":[{"id":10903,"href":"https:\/\/blog.themusio.com\/index.php?rest_route=\/wp\/v2\/posts\/985\/revisions\/10903"}],"wp:attachment":[{"href":"https:\/\/blog.themusio.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=985"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.themusio.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=985"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.themusio.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=985"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}