PreferredAI
diff --git a/‎.github/workflows/nextjs.yml‎
Lines changed: 93 additions & 0 deletions b/‎.github/workflows/nextjs.yml‎
Lines changed: 93 additions & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 43 additions & 0 deletions b/‎.gitignore‎
Lines changed: 43 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 36 additions & 0 deletions b/‎README.md‎
Lines changed: 36 additions & 0 deletions
diff --git a/‎content/about.md‎
Lines changed: 17 additions & 0 deletions b/‎content/about.md‎
Lines changed: 17 additions & 0 deletions
diff --git a/‎content/posts/a-quest-for-fast-personalized-recommendation-part-i.md‎
Lines changed: 62 additions & 0 deletions b/‎content/posts/a-quest-for-fast-personalized-recommendation-part-i.md‎
Lines changed: 62 additions & 0 deletions
diff --git a/‎content/posts/aaai-2019-in-hawaii.md‎
Lines changed: 43 additions & 0 deletions b/‎content/posts/aaai-2019-in-hawaii.md‎
Lines changed: 43 additions & 0 deletions
@@ -0,0 +1,93 @@
+# Sample workflow for building and deploying a Next.js site to GitHub Pages
+#
+# To get started with Next.js see: https://nextjs.org/docs/getting-started
+#
+name: Deploy Next.js site to Pages
+
+on:
+  # Runs on pushes targeting the default branch
+  push:
+    branches: ["main"]
+
+  # Allows you to run this workflow manually from the Actions tab
+  workflow_dispatch:
+
+# Sets permissions of the GITHUB_TOKEN to allow deployment to GitHub Pages
+permissions:
+  contents: read
+  pages: write
+  id-token: write
+
+# Allow only one concurrent deployment, skipping runs queued between the run in-progress and latest queued.
+# However, do NOT cancel in-progress runs as we want to allow these production deployments to complete.
+concurrency:
+  group: "pages"
+  cancel-in-progress: false
+
+jobs:
+  # Build job
+  build:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+      - name: Detect package manager
+        id: detect-package-manager
+        run: |
+          if [ -f "${{ github.workspace }}/yarn.lock" ]; then
+            echo "manager=yarn" >> $GITHUB_OUTPUT
+            echo "command=install" >> $GITHUB_OUTPUT
+            echo "runner=yarn" >> $GITHUB_OUTPUT
+            exit 0
+          elif [ -f "${{ github.workspace }}/package.json" ]; then
+            echo "manager=npm" >> $GITHUB_OUTPUT
+            echo "command=ci" >> $GITHUB_OUTPUT
+            echo "runner=npx --no-install" >> $GITHUB_OUTPUT
+            exit 0
+          else
+            echo "Unable to determine package manager"
+            exit 1
+          fi
+      - name: Setup Node
+        uses: actions/setup-node@v4
+        with:
+          node-version: "20"
+          cache: ${{ steps.detect-package-manager.outputs.manager }}
+      - name: Setup Pages
+        uses: actions/configure-pages@v5
+        with:
+          # Automatically inject basePath in your Next.js configuration file and disable
+          # server side image optimization (https://nextjs.org/docs/api-reference/next/image#unoptimized).
+          #
+          # You may remove this line if you want to manage the configuration yourself.
+          static_site_generator: next
+      - name: Restore cache
+        uses: actions/cache@v4
+        with:
+          path: |
+            .next/cache
+          # Generate a new cache whenever packages or source files change.
+          key: ${{ runner.os }}-nextjs-${{ hashFiles('**/package-lock.json', '**/yarn.lock') }}-${{ hashFiles('**.[jt]s', '**.[jt]sx') }}
+          # If source files changed but packages didn't, rebuild from a prior cache.
+          restore-keys: |
+            ${{ runner.os }}-nextjs-${{ hashFiles('**/package-lock.json', '**/yarn.lock') }}-
+      - name: Install dependencies
+        run: ${{ steps.detect-package-manager.outputs.manager }} ${{ steps.detect-package-manager.outputs.command }}
+      - name: Build with Next.js
+        run: ${{ steps.detect-package-manager.outputs.runner }} next build
+      - name: Upload artifact
+        uses: actions/upload-pages-artifact@v3
+        with:
+          path: ./out
+
+  # Deployment job
+  deploy:
+    environment:
+      name: github-pages
+      url: ${{ steps.deployment.outputs.page_url }}
+    runs-on: ubuntu-latest
+    needs: build
+    steps:
+      - name: Deploy to GitHub Pages
+        id: deployment
+        uses: actions/deploy-pages@v4
@@ -0,0 +1,43 @@
+# See https://help.github.com/articles/ignoring-files/ for more about ignoring files.
+
+# dependencies
+/node_modules
+/.pnp
+.pnp.*
+.yarn/*
+!.yarn/patches
+!.yarn/plugins
+!.yarn/releases
+!.yarn/versions
+
+# testing
+/coverage
+
+# next.js
+/.next/
+/out/
+
+# production
+/build
+
+# misc
+.DS_Store
+*.pem
+
+# debug
+npm-debug.log*
+yarn-debug.log*
+yarn-error.log*
+.pnpm-debug.log*
+
+# env files (can opt-in for committing if needed)
+.env*
+
+# vercel
+.vercel
+
+# typescript
+*.tsbuildinfo
+next-env.d.ts
+
+/src/generated/prisma
@@ -0,0 +1,36 @@
+This is a [Next.js](https://nextjs.org) project bootstrapped with [`create-next-app`](https://nextjs.org/docs/app/api-reference/cli/create-next-app).
+
+## Getting Started
+
+First, run the development server:
+
+```bash
+npm run dev
+# or
+yarn dev
+# or
+pnpm dev
+# or
+bun dev
+```
+
+Open [http://localhost:3000](http://localhost:3000) with your browser to see the result.
+
+You can start editing the page by modifying `app/page.tsx`. The page auto-updates as you edit the file.
+
+This project uses [`next/font`](https://nextjs.org/docs/app/building-your-application/optimizing/fonts) to automatically optimize and load [Geist](https://vercel.com/font), a new font family for Vercel.
+
+## Learn More
+
+To learn more about Next.js, take a look at the following resources:
+
+- [Next.js Documentation](https://nextjs.org/docs) - learn about Next.js features and API.
+- [Learn Next.js](https://nextjs.org/learn) - an interactive Next.js tutorial.
+
+You can check out [the Next.js GitHub repository](https://github.com/vercel/next.js) - your feedback and contributions are welcome!
+
+## Deploy on Vercel
+
+The easiest way to deploy your Next.js app is to use the [Vercel Platform](https://vercel.com/new?utm_medium=default-template&filter=next.js&utm_source=create-next-app&utm_campaign=create-next-app-readme) from the creators of Next.js.
+
+Check out our [Next.js deployment documentation](https://nextjs.org/docs/app/building-your-application/deploying) for more details.
@@ -0,0 +1,17 @@
+# About
+
+Preferred.AI is a research undertaking at the [Singapore Management University (SMU)](http://www.smu.edu.sg/) – [School of Computing and Information Systems (SCIS)](https://scis.smu.edu.sg/) led by [Hady W. Lauw](https://www.hadylauw.com/).
+
+## Mission
+
+Our mission is to 'push the envelope' on learning user preferences from data to improve the effectiveness and efficiency of recommendations using data mining, machine learning, and artificial intelligence. This encompasses designing algorithms for mining user-generated data of various modalities (e.g., ratings, text, images, social networks) for understanding the behaviours and preferences of users (individually and collectively), and applying the mined knowledge to develop user-centric intelligent applications.
+
+## Goals
+
+Our goals are multi-fold: scientific, impact-oriented, and educational.
+
+1. We push the boundaries of science by conducting high-quality research with an eye towards publishing them in the top-tier conferences and journals.
+2. We seek broader impact, by developing knowledge bases, libraries, or systems and sharing them for the benefit of the community. We are also interested in pursuing fruitful collaborations of mutual interest with industry partners if an opportune synergy in research focus arises.
+3. We contribute towards the furtherance of training and education, through our research activities within the university environment, as well as information sharing and dissemination to the broader community.
+
+This site tracks our various activities towards these objectives.
@@ -0,0 +1,62 @@
+---
+title: "A Quest for Fast Personalized Recommendation"
+date: "2020-11-10"
+author: "Andrew Le"
+excerpt: "Personalized recommender systems attempt to generate a limited number of item options (e.g., products on Amazon, movies on Netflix, or videos on Youtube, etc.)..."
+featuredImage: "/uploads/2020/11/download-1.jpg"
+categories: ["Education"]
+tags: []
+seoTitle: "A Quest for Fast Personalized Recommendation - Preferred.AI"
+seoDescription: "Personalized recommender systems attempt to generate a limited number of item options (e.g., products on Amazon, movies on Netflix, or videos on Youtube, etc.)..."
+---
+
+# A Quest for Fast Personalized Recommendation
+
+Personalized recommender systems attempt to generate a limited number of item options (e.g., products on Amazon, movies on Netflix, or videos on Youtube, etc.) that are curated for each individual customer. The necessity of such systems is driven by the explosion of online choices, which makes it difficult for each customer to investigate every option. Therefore, more and more product and service providers are now relying on these systems to improve customer experience and conversion on their websites.
+
+An established and prevalent technique for personalized recommendation is collaborative filtering based on matrix factorization (MF), which attempts to learn customers’ preferences from their historical activities. Let’s assume that there are ***m*** users, denoted as ***U*** and ***n*** items, denoted as ***I***. Typically, a classic matrix factorization model consists of two phases:
+
+*   ***Learning***: this phase analyses customers’ historical activities, represented by a sparse matrix ***R*** of size ***m* x *n***, to learn their preferences. Each customer *u* is represented by a ***d***\-dimensional vector ***xu*** and each item *i* is represented by a ***d***\-dimensional vector ***yi***, where ***d*** is the hypothetical number of factors that explain the behaviour of each customer. The degree of preference of a customer *u* for an item *i* is modelled as the inner product score ***(*xu*)Tyi***. A higher inner product score implies a higher chance of the customer *u* to prefer the item *i*.
+
+*   ***Retrieval***: given the output vectors from the learning phase, to arrive at a personalized recommendation list for customer ***u***, we need to identify the top-*K* items in ***I*** that have the highest inner product scores to ***xu***. Figure 1 illustrates the pipeline of top-*K* MF recommendation retrieval, in which ***Y***  denotes the item matrix where each row represents an item vector.
+
+![](/uploads/2020/09/mf-based-recommendation-retrieval-1.png)
+
+**Figure 1: Top-K Retrieval of Matrix Factorization Models**
+
+The challenge of the *learning* phase is how to design effective algorithms that can learn from the data at the scale of millions of customers and items. This problem has been studied extensively in the research literature. On the other hand, the challenge of the *retrieval* phase is *speed,* due to the real-time nature of the task: *upon the arrival of a targeted customer* *u*, the system needs to quickly generate top-*K* items with highest inner product scores to ***xu*** be recommended for *u*.
+
+Formally, the above problem of finding the top-*K* MF recommendations can be stated as follows:
+
+**(Maximum Inner Product Search-MIPS)** Given a customer vector *xu*, determine the item *i* such that:
+
+i=\\mathrm{argmax}\_{j \\in I}  x\_u^T y\_j
+
+A straightforward solution for MIPS is to compute the inner product between ***xu*** and all item vectors **{*y1*, *y2*, …, *ym*}** and rank these scores. However, such solution scales linearly with the number of items, which incurs the prohibitive cost given current number of items in today large-scale systems (see References \[1\], \[2\], \[3\] for more detailed analysis). To achieve real-time personalized recommendation, we shall look for faster alternatives to solve the MIPS problem efficiently, specifically those who can avoid examining all items in *I*. In this post, we will explore such a solution, namely *indexing.*
+
+**Indexing for Matrix Factorization Recommendation Retrieval**
+
+As top-*K* MF recommendation retrieval can be considered as top-*K* similarity search task with inner product as the similarity function, one can apply various approximate similarity search techniques such as indexing, quantization, or graph similarity to improve the speed of the search process. In this post, we primarily focus on indexing as it offers several appealing advantages: one-time pre-processing of item vectors, parallelizable search, and scale linearly with the number of items, etc.
+
+Figure 2 depicts two steps of a top-*K* recommender system with the aid of indexing structures:
+
+![](/uploads/2020/09/indexing-for-MF-recommendation-retrieval-1.png)
+
+**Figure 2: Indexing Approach for Efficient Top-K Retrieval**
+
+*   **Index construction**: process and store the item vectors *Y* in a data structure (e.g., hash tables, binary search trees, etc.) so that similar item vectors are stored closely in the data structure (e.g., on the same buckets of the hash tables or the same leaf nodes of the binary search tree. etc.).
+*   **Retrieval**: Given the built data structure, a search for the top-*K* most similar items to a customer vector ***xu***, i.e., top-*K* recommendations can be performed in order of magnitude faster than naïve exhaustive search. This is primarily due to the property of indexing structures, which can automatically remove potential irrelevant items with high confidence and reduce the number of item candidates for inner product computation and ranking.
+
+The benefit of indexing comes at the cost of constructing the data structures to store the item vectors in new formats that support efficient similarity search, which is a one-time cost to be amortized over the many query instances. 
+
+Though having several advantages, a factor for consideration when using indexing structures for top-*K* recommendation is the growth rate of the systems. As customer preferences may change over time, new items appear, or old items are removed, maintaining a retrieval-efficient structure would require constant updates (e.g., insertion, deletion, or even re-build).
+
+In the next part, we will investigate further some issues with using indexing for top-*K* MIPS as well as discuss some promising solutions. 
+
+**References**
+
+**\[1\]** Koenigstein, Noam, Parikshit Ram, and Yuval Shavitt. “Efficient retrieval of recommendations in a matrix factorization framework.” *Proceedings of the 21st ACM international conference on Information and knowledge management*. 2012.
+
+**\[2\]** Le, D. D., & Lauw, H. W. (2017, November). Indexable Bayesian Personalized Ranking for Efficient Top-k Recommendation. In *Proceedings of the 2017 ACM on Conference on Information and Knowledge Management* (pp. 1389-1398). ACM.
+
+**\[3\]** Le, D. D., & Lauw, H. W (2020, Feb). Stochastically Robust Personalized Ranking for LSH Recommendation Retrieval, In *Proceeding of the 34thAAAI Conference on Artificial Intelligence* (AAAI’20), Feb 2020.
@@ -0,0 +1,43 @@
+---
+title: "Aloha, AAAI-2019"
+date: "2019-03-09"
+author: "Hady Lauw"
+excerpt: "In January 2019, four members of Preferred.AI travelled to the AAAI-19 conference held in Honolulu, Hawaii to present 2 papers and 1 tutorial. As..."
+featuredImage: "/uploads/2019/03/aaai19-hilton.jpg"
+categories: ["Video", "Travel", "Presentation"]
+tags: []
+seoTitle: "Aloha, AAAI-2019 - Preferred.AI"
+seoDescription: "In January 2019, four members of Preferred.AI travelled to the AAAI-19 conference held in Honolulu, Hawaii to present 2 papers and 1 tutorial. As..."
+---
+
+# Aloha, AAAI-2019
+
+In January 2019, four members of Preferred.AI travelled to the AAAI-19 conference held in Honolulu, Hawaii to present 2 papers and 1 tutorial.
+
+![](/uploads/2019/03/aaai19-acceptance.jpg)
+
+As a country, Singapore held our own against much larger neighbors. With 122 submissions and 25 papers acceptance, the country success rate was a credible 20.5%
+
+On Jan 28, [Andrew](/team/andrew/) and [Hady](/team/hadylauw/) delivered a 3-hour tutorial on “[Recent Advances in Scalable Retrieval of Personalized Recommendations](/aaai19-tutorial/)“. This emphasized the importance of retrieval efficiency for recommendation and covered the main strategies such as approximate maximum inner product search, indexable representation learning, discrete representations. We made the [materials](https://github.com/PreferredAI/recommendation-retrieval) as well as [video recording](https://www.youtube.com/playlist?list=PL291RJWFNQGL7MBEuBIDwMIQn8rX1Jloz) available.
+
+![](/uploads/2019/03/aaai19-tutorial.jpg)
+
+Andrew and Hady explored the various strategies to increase the retrieval efficiency of recommender systems, while maintaining accuracy
+
+On Jan 30, [Tuan](/team/tuan/) presented the spotlight for our paper “[VistaNet: Visual Aspect Attention Network for Multimodal Sentiment Analysis](http://www.hadylauw.com/publications/aaai19a.pdf)” that showed the efficacy of review images in helping to identify the textual passages that would be useful for sentiment analysis. The [implementation](https://github.com/PreferredAI/vista-net) is now available.
+
+![](/uploads/2019/03/aaai19-vistanet.jpg)
+
+Hady and Tuan at the poster session for VistaNet
+
+On Jan 31, [Maksim](/team/maksim/) gave the spotlight on our paper “[CompareLDA: A Topic Model for Document Comparison](http://www.hadylauw.com/publications/aaai19b.pdf)“, emphasizing that when comparison was a key property, a topic model supervised by pairwise comparisons such as CompareLDA would be more effective. The [implementation](https://github.com/PreferredAI/compare-lda) is also now available.
+
+![](/uploads/2019/03/aaai19-comparelda.jpg)
+
+Maksim explaining how a topic model aligned to comparisons can reveal insightful topics about how entities are ranked with respect to one another
+
+While the AAAI-19 program was interesting, the island of O’ahu also offered as picturesque a scenery as any. [Maksim](/team/maksim/) captured the winter waves of O’ahu in the following stunning drone video.
+
+During the conference downtime, we explored several attractions around the island. We invite you to share in our experiences with the following montage.
+
+Mahalo!