Applications of Matrix Methods in Artificial Intelligence and Machine Learning (AMAIML) Session 2
Time and Date: 15:45 - 17:25 on 11th June 2018
Room: M5
Chair: Kourosh Modarresi
380 | An Efficient Deep Learning Model for Recommender Systems [abstract] Abstract: Recommending the best and optimal content to user is the essential part of digital space activities and online user interactions. For example, we like to know what items should be sent to a user, what promotion is the best one for a user, what web design would fit a specific user, what ad a user would be more susceptible to or what creative cloud package is more suitable to a specific user.
In this work, we use deep learning (autoencoders) to create a new model for this purpose. The previous art includes using Autoencoders for numerical features only and we extend the application of autoencoders to non-numerical features.
Our approach in coming up with recommendation is using “matrix comple-tion” approach which is the most efficient and direct way of finding and evaluating content recommendation. |
Kourosh Modarresi and Jamie Diner |
381 | Standardization of Featureless Variables for Machine Learning Models using Natural Language Processing (NLP) [abstract] Abstract: AI and machine learning are mathematical modeling methods for learning from data and producing intelligent models based on this learning. The data these models need to deal with, is normally a mixed of data type where both numerical (continuous) variables and categorical (non-numerical) data types. Most models in AI and machine learning accept only numerical data as their input and thus, standardization of mixed data into numerical data is a critical step when applying machine learning models. Having data in the standard shape and format that models require often a time consuming, nevertheless very significant step of the process. |
Kourosh Modarresi and Abdurrahman Munir |
382 | Generalized Variable Conversion using K-means Clustering and Web Scraping [abstract] Abstract: The world of AI and Machine Learning is the world of data and learning from data so the insights could be used for analysis and prediction. Almost all data sets are of mixed variable types as they may be quantitative (numerical) or qualitative (categorical). The problem arises from the fact that a long list of methods in Machine Learning such as “multiple regression”, “logistic regression”, “k-means clustering”, and “support vector machine”, all to be as examples of such models, designed to deal with numerical data type only. Though the data, that need to be analyzed and learned from, is almost always, a mixed data type and thus, standardization step must be undertaken for all these data sets. The standardization process involves the conversion of qualitative (categorical) data into numerical data type. |
Kourosh Modarresi and Abdurrahman Munir |
372 | Parallel Latent Dirichlet Allocation on GPUs [abstract] Abstract: Latent Dirichlet Allocation (LDA) is a statistical technique for topic modeling. Since it is very computationally demanding, its parallelization has garnered considerable interest. In this paper, we systematically analyze the data access patterns for LDA and devise suitable algorithmic adaptations and parallelization strategies for GPUs. Experiments on large-scale datasets show the effectiveness of the new parallel implementation on GPUs. |
Gordon Moon, Israt Nisa, Aravind Sukumaran-Rajam, Bortik Bandyopadhyay, Srinivasan Parthasarathy and P. Sadayappan |
348 | Improving Search through A3C Reinforcement Learning based Conversational Agent [abstract] Abstract: We develop a reinforcement learning based search assistant which can assist users through a sequence of actions to enable them realize their intent. Our approach caters to subjective search where user is seeking digital assets such as images which is fundamentally different from the tasks which have objective and limited search modalities. Labeled conversational data is generally not available in such search tasks, to counter this problem we propose a stochastic virtual user which impersonates a real user for training and obtaining bootstrapped agent. We develop A3C algorithm based context preserving architecture to train agent and evaluate performance on average rewards obtained by the agent while interacting with virtual user. We evaluated our system with actual humans who believed that it helped in driving their search forward with appropriate actions without being repetitive while being more engaging and easy to use compared to conventional search interface. |
Milan Aggarwal, Aarushi Arora, Shagun Sodhani and Balaji Krishnamurthy |