Notes of a Dabbler

Exploring OMPR with HiGHS solver

Notesofdabbler — Sun, 11 Sep 2022 00:00:00 GMT

There is a class of software for modeling optimization problems referred to as algebraic modeling systems which provide a unified interface to formulate optimization problems in a manner that is close to mathematical depiction and have the ability to link to different types of solvers (sparing the user from solver specific ways of formulating the problem). Both commercial and open source options are available. GAMS and AMPL are examples of commercial options. The popular open source options are JuMP in Julia and Pyomo in python. I have typically used Pyomo in Python but have explored using it from R. I recently became aware of algebraic modeling system in R provided by OMPR package developed by Dirk Schumacher.

There are commercial and open-source options available for solvers also. For a class of optimization problems referred to as Mixed Integer Linear Programs (MILP), the commercial solvers such as CPLEX, and GUROBI perform significantly better than open source solvers such as glpk, and CBC. A new open-source solver HiGHS has been developed recently that has generated quite a bit of buzz and by different accounts looks like a promising option. There is now a highs package in R that can call the HiGHS solver.

In this blog, I wanted to explore using OMPR modeling system with HiGHS solver by using it to solve a few examples of LP/MILP problems.

Example 1: Example from highs package

Here I want to just describe the example in mathematical notation and show how OMPR model is close to mathematical notation. The full details of this example are in this location.

Example Problem in highs package

OMPR model

mdl = MIPModel() %>%
      add_variable(x0, lb = 0, ub = 4, type = "continuous") %>%
      add_variable(x1, lb = 1, type = "continuous") %>%
      set_objective(x0+x1+3, sense = "min") %>%
      add_constraint(x1 <= 7) %>%
      add_constraint(x0 + 2*x1 <= 15) %>%
      add_constraint(x0 + 2*x1 >= 5) %>%
      add_constraint(3*x0 + 2*x1 >= 6)

Since OMPR can directly call HiGHS optimizer, we can solve the model and get solution as shown below.

# solve model
s = mdl %>% solve_model(highs_optimizer())

# get solution
s$status
s$objective_value
s$solution

Solving the above problem results in an objective value of 5.75 and solution of (0.5, 2.25)

Example 2: Transportation Problem

This example discusses a transporation problem from GAMS model library where the goal is to find the minimum cost way to meet market demand with available plant capacity. We just show how the OMPR package can handle variables involving indices using this example. The full description of this example is in this location.

Mathematical Formulation
Model build using OMPR

where

is the quantity to be shipped from plant to market (decision variable)
Objective (a) is to minimize shipping cost
Constraint (b) ensures that total supply from a plant is below capacity
Constraint (c) ensures that demand for each market is met.

np = length(plants)
nm = length(mkts)
# create ompr model
mdl = MIPModel() %>%
  add_variable(x[i, j], i=1:np, j=1:nm, type = "continuous",lb = 0) %>%
  # objective: min cost
  set_objective(sum_over(cost(i, j) * x[i, j], i = 1:np, j = 1:nm), sense = "min") %>% 
  # supply from each plant is below capacity
  add_constraint(sum_over(x[i, j], j = 1:nm) <= cap[i], i = 1:np) %>%  
  # supply to each market meets demand
  add_constraint(sum_over(x[i, j], i = 1:np) >= dem[j], j = 1:nm)

The figure on the left show the supply network (plants on top and markets below with numbers being capacity for plants and demand for markets). The figure on the right shows the solution where Chicago market is supplied by Seattle plant and San Diego plant supplies both New York and Topeka markets.

Network Information

Solution

Example 3: Map Coloring Problem

This example discusses a map coloring problem where the goal is to use the minimum number of colors so that no two adjacent states in the US map have the same color. In this example also, I am just showing the mathematical formulation and OMPR model. The full description of this example is in this location.

Mathematical Formulation
Model build using OMPR

where:

if color is used, if state is colored with color .
Objective (a) is to minimize the number of colors used
Constraint (b) ensures that each state gets some color
Constraint (c) ensures that if state and are adjacent, they don’t get the same color.

# OMPR model
ns = nrow(nodes_df)
nc = 4
edge_str = edge_df %>% mutate(edge_str = glue("{fromid}_{toid}")) %>% pull(edge_str)
mdl = MIPModel()
mdl = mdl %>% add_variable(x[i, c], i = 1:ns, c = 1:nc, type = "integer", lb = 0, ub = 1)
mdl = mdl %>% add_variable(y[c], c = 1:nc, type = "integer", lb = 0, ub = 1)
mdl = mdl %>% set_objective(sum_over(y[c], c=1:nc), sense = "min")
mdl = mdl %>% add_constraint(sum_over(x[i, c], c = 1:nc) == 1, i = 1:ns)
mdl = mdl %>% add_constraint(x[i, c] + x[j, c] <= y[c], i = 1:ns, j = 1:ns, c = 1:nc, glue("{i}_{j}") %in% edge_str)

Solving this problem give the following map coloring

Using Pyomo from R through the magic of Reticulate

Notesofdabbler — Wed, 01 Jul 2020 00:00:00 GMT

Pyomo is a python based open-source package for modeling optimization problems. It makes it easy to represent optimization problems and can send it to different solvers (both open-source and commercial) to solve the problem and return the results in python. The advantage of pyomo compared to commercial software such as GAMS and AMPL is the ability to code using standard python syntax (with some modifications for pyomo constructs). Another open source package for modeling optimization problems is JuMP in Julia language.

My goal in this blog is to see how far I can get in terms of using Pyomo from R using the reticulate package. The simplest option would be to develop the model in pyomo and call it from R using reticulate. However, it still requires writing the pyomo model in python. I want to use reticulate to write the pyomo model using R. The details of the blog post (along with code) are in this location.

Summary

Here I covered two examples to show how to develop a pyomo model from R using the reticulate package. While it might still be easier to develop the pyomo model in python (since it was meant to be that way), I found that it is possible to develop pyomo models in R also fairly easily albeit with some modifications (some maybe less elegant compred to the python counterpart). It may still be better to develop more involved pyomo models in python but reticulate offers a way to develop simple to intermediate levels models directly in R. I am summarizing key learnings:

Need to overload arithmetic operators to enable things like addition etc. between pyomo objects
Use the option convert = FALSE to retain pyomo objects as python objects potentially avoid issues that are hard to troubleshoot.
Lack of list comprehension in R makes some of the constraint specifications more verbose but still works.
Need to be careful about indexing (sometimes need to explicitly specify a tuple and sometimes not)

Proofs without Words using gganimate

Notesofdabbler — Sun, 26 Apr 2020 00:00:00 GMT

I recently watched the 2 part workshop (part 1, part 2) on ggplot2 and extensions given by Thomas Lin Pedersen. First of, it was really nice of Thomas to give the close to 4 hour workshop for the benefit of the community. I personally learnt a lot from it. I wanted to try out gganimate extension that was covered during the workshop.

There are several resources on the web that show animations/illustrations of proofs of mathematical identities and theorems without words (or close to it). I wanted to take a few of those examples and use gganimate to recreate the illustration. This was a fun way for me to try out gganimate.

Example 1:

This example is taken from AoPS Online and the result is that sum of first odd numbers equals .

The gganimate version of the proof (using the method in AoPS Online) is shown below (R code, html file)

Example 2:

This example is also taken from AoPS Online and the result is:

The gganimate version of the proof (using the method in AoPS Online) is shown below ( R code, html file):

Example 3

This example from AoPS Online illustrates the result

The gganimate version of the proof (using the method in AoPS Online) is shown below ( R code, html file):

Example 4

According to Pythagoras theorem, where , , are sides of a right angled triangle (with being the side opposite angle)

There was an illustration of the proof of pythogoras theorem in a video from echalk.

The gganimate version of the proof is shown below ( R code, html file)

In summary, it was great to use gganimate for these animations since it does all the magic with making transitions work nicely.

Keeping up with Tidyverse Functions using Tidy Tuesday Screencasts

Notesofdabbler — Tue, 06 Aug 2019 00:00:00 GMT

David Robinson has done several screencasts where he analyzes a Tidy Tuesday dataset live. I have listened to a few of them and found them very interesting and instructive. As I don’t use R on a daily basis, I have not kept up with what the latest is in Tidyverse. So when I listened to his screencasts, I learnt functions that I was not aware of. Since I sometimes forget which function I learnt, I wanted to extract all the functions used in the screencasts so that it is easier for me to refer to the ones that I am not aware of but should learn.

The approach I took is:

Get all the Rmd analysis files from the screencast github repo.
Extract the list of libraries and functions used in each .Rmd file
Plot frequencies of function use and review functions that I am not aware of

The html file with all the code and results is in this location. The R file used to generate the html file is here.

The plot below shows the how many analyses used a particular package.

The top library as tidyverse is to be expected. It is interesting that lubridate is second. I can see that broom is used quite a bit since after exploratory analysis in the screencast, David explores some models. There are several packages that I was not aware of but I will probably look up the following: widyr, fuzzyjoin, glue, janitor, patchwork and the context in which they were used in the screencast.

The plot below shows the number of functions used from each package.

As expected, most used functions are from ggplot2, dplyr, tidyr since there is lot of exploratory analysis and visualization of data in the screencasts.

The next series of plots shows the individual functions used from the packages.

Based on the above figures, I am listing below some functions that I was not aware of and should learn

count function in dplyr as a easier way to count for each group or sum a variable for each group.
geom_col function in ggplot2 for bar graphs
I became aware of forcats package for working with factors. fct_reorder and fct_lump from the package were used frequently.
tidyr functions - nest/unnest, crossing, separate_rows
I realized that I know only a few functions in stringr and should learn more about several functions that were used in the screencast.

Fastai Collaborative Filtering with R and Reticulate

Notesofdabbler — Sun, 01 Apr 2018 00:00:00 GMT

Jeremy Howard and Rachel Thomas are founders of fast.ai whose aim is to make deep learning accessible to all. They offer a course called Practical Deep Learning for Coders (Part 1). The last session, taught by Jeremy, was in Fall 2017 and the videos were released early January 2018. Their approach is top down by showing different applications first as black boxes followed by progressive peeling of the black box to teach the details of how things work. The course uses python and they have developed a python library fastai that is a wrapper around PyTorch.

I wanted to learn reticulate by trying to create a R version of one of the python notebooks from that class. The class covers the topic of collaborative filtering in lecture 5 and lecture 6. The dataset used is a sample of movielens dataset where about ~670 users have rated ~9000 movies. The objective is to develop a model to predict the rating that a user will give for a particular movie.

The Jupyter notebook for this topic is divided into 2 portions:

In the first half, the model is developed using just high level fastai functions. The R notebook for the first half is located here.
In the second half, the model is developed from scratch and 3 different types of models are discussed going from matrix factorization type model to deep learning type models. The R notebook for the second half is located here.

Since the first half involved mainly python functions from fastai library, it seemed like a good use case for reticulate since we could use reticulate just for model development and use R functions for other pre and post processing tasks. The second half involved model building from scratch. In pyTorch, custom models need to be written as python classes. While it was still possible to use reticulate in this case, this may not be the ideal use case since it might be better for somebody developing custom models to do the whole work in python. But once they wrap it into a python package, it is easier to use from R. Overall, reticulate was great to work with and it made it very easy to translate a python function to an equivalent R function. It is a great addition to the R packages.

Exploring Instacart Dataset with PCA

Notesofdabbler — Mon, 22 May 2017 00:00:00 GMT

Recently, Instacart released a dataset of ~3 million orders made by ~200,000 users at different days of week and times of day. There is also an ongoing Kaggle competition to predict which products a user will buy again. My goal here is more modest where I just wanted to explore the dataset to find patterns of purchasing behaviour by hour of day, day of week and number of days prior to current order. An example of this kind of analysis is also shown in their blog. Here I wanted to explore if I can find such kind of patters by using the very common and popular dimension reduction technique - Principal Component Analysis (PCA). There are several great resources that introduce PCA if you are not familiar with PCA. One of the resources is the set of video lectures on machine learning by Prof. Hastie and Prof. Tibshirani.

The general approach that I have followed is:

Do principal component analysis on the data (each row is a product, each column is a time period (hour of day, day of week or number of days prior to current order))
Review the loading plots of first two principal components to see purchase patterns
Identify top 20 products that have high scores in either first or the second principal component
Check the purchasing pattern by checking the average number of orders for the products that were identified as having top scores in one of the principal components.

Spoiler Alert: Since my analysis is basic, don’t be disappointed if there are no big Aha moments (there will be none). But I think it is still fun to see how we can extract such information directly from data.

I downloaded the data from the following link. The data dictionary is in the following link. The full code with results is in the following location.

Below are some basic info on the datasets

The number of users are ~200,000.
The number of orders are ~3.4M. The number of products are ~50K or which ~5K account for 80% of total orders

PCA to find patterns of purchase by hour of day

The goal here is to find products with different patterns of purchase timing by hour of day with PCA. Dataset for PCA has for each product (rows), the percentage of product orders at each hour of day (column). Since all the data is in percentages, I didn’t do any further scaling of data.

The plot of cumulative variance shows that first component accounts for 44% of variance, first two account for 58% and first 3 account for 67% of variance.

Next, we will look at the first two loadings since first 2 components account for 58% of variance.

First principal component loading PC1 indicates a pattern of either higher percentage of purcahses in the morning or evening. The second principal component loading indicates a pattern where there is higher purchase around 11am and 4pm. To check which product items follow these patterns, we look at products that either have high scores or low scores on a principal component. So here we take the top 20 and bottom 20 products in terms of their scores on PC1. The actual pattern still may not quite match the loading plot since the overall pattern is a combination of all principal component loadings.

Below is the table that lists the actual products that are in top and bottom scores of PC1. Ice cream purchases tend to occur more in the evening. Items like granola bars, krispie treats, apples are purchased more in the morning.