This document explores the technical back-end design for SC2 Training Grounds. It also functions as the development environment for sc_training, a prototype Python library that demonstrates parts of a high-level implementation of this design.

Note: this prototype is primarily meant as a tool for reflective practice. This reflection is part of my PhD research project, "Encouraging Play Experience Expansion Through Machine-Learning-Based Recommendations: User-Experience Design considerations in the use of Machine-Learning-Based Recommendation System for Video games".

Executive Summary

sc_training is a python library developed to document, organise and communicate the design of SC2 Training Grounds' computer model. I developed this library for the following purposes:

To review the technical challenges associated with creating a solution such as SC2 Training Grounds.
To assess the possibilities and limitations of the resources I could use to develop this solution. Therefore, allowing me to design a more realistic and grounded user experience for the application.
To provide a proof-of-concept that tests the technical feasibility of the project.

In this case, I used literate and iterative programming to develop the library while providing a narrative explanation and documentation of this process (Knuth, 1984; Ramsey, 1994; Literate Programming, 2014; Kery et al., 2018). This methodology allows rapid testing and exploration of ideas while integrating them into a functioning program (Howard, 2019). Hence, the library is simultaneously the program's documentation and the program itself.

Accessing the Library's Files and Documentation

Note: a pdf version of this document is attached as Appendix C of my PhD project. This format is included for convenience. However, I remind the reader that, in this format, the library loses part of its original functions as an interactive artefact. Readers can download all the files described below from the following GitHub repository.

To facilitate the use of literate programming, I developed this library using I used nbdev. The use of this tool means that this library is rendered in various ways to accommodate different interests.

First, since nbdev uses interactive notebooks (see Jupyter notebooks) as a development environment, the library was developed and is originally contained in this format. These interactive documents intertwine textual elements that describe the ideas that support its components and the source code that defines its functions (see Jupyter/IPython Quick Start Guide for a guide on how to set up the Jupyter and IPython libraries to use these documents).
Second, the library is also exported as a traditional python library. Readers can find a functioning version of all the library's modules in the repository's sc_training folder, which they can use to replicate and verify any examples and experiments explored in this deliverable.
Third, these notebooks produce a website that serves as the library's Interactive Documentation. This documentation omits many technical details focusing on the descriptions of the development process and the library's structure.

Setup

Before going into the document itself, it is important to clarify some aspects regarding their composition.

Because of the nature of the documents, each notebook is exported as a module of the sc_training library.

Meanwhile, since each notebook works as a separate module, I have to include some setup elements at the beginning of each document. Since this process is almost identical in most cases, I will cover it here so that I can hide it in the module's documentation.

Note: Remember that any elements I hide in these documents can still be reviewed and verified in the library’s development notebooks and the library’s exported source code.

The development of all modules starts with two elements: importing the module's dependencies and loading the development tests and sample data.

Importing the module's dependencies means invoking several libraries used by each module. The following is an example of these imports.

# development.

# These are some of the libraries of Python's standard library 
# I use to support different functionalities. 
from pathlib import Path
from pprint import pprint
from dataclasses import dataclass, astuple, field
from datetime import datetime
from typing import *

# I use the following libries to load and manage data files. 
import csv
import json

# Pymongo allows me to interact with a local MongoDB client to 
# store and manage my document-based database.
import pymongo

# The following libraries offer various utilities for efficient
# data and numeric manipulation, and for data visualisation.
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

import sklearn # Depending on the module I use different modules of
               # scikit-learn library.


# sc2reader facilitates data extraction from replay files
import sc2reader

Additionally, to develop each module, I need to load some StarCraft II replay files (marked with the .SC2Replay extension). The game generates these files to store all relevant configuration and game-play information necessary to rebuild and reproduce a match.

The replays used to test and develop this project are extracted from publicly available datasets. In no case do I access any personal or private information of the users or players. The following code illustrates how they are loaded using sc2reader and collected into variables that store sc2reader.resources.Replay objects for later processing. I store this data in the repository's test_replays folder.

Tip: One can load one replay at a time using the sc2replays.load_replay(<str_file_path>) method.

rps_path = Path(Path.cwd()/'test_replays') \
           if Path('test_replays').exists() \
           else Path('../test_replays')
game_path = str(rps_path/'Jagannatha LE.SC2Replay')
single_replay = sc2reader.load_replay(game_path)
single_replay

<sc2reader.resources.Replay at 0x1af38d6d610>

Tip: Alternatively, one can use the sc2replays.load_replays() method, which returns a generator of replay objects.

replays = sc2reader.load_replays(str(rps_path))
replays

<generator object SC2Factory.load_all at 0x000001AF3BA9BB30>

Beyond these two initial setup elements, some modules also load data files that store helpful information to organise and configure the modules and their functions. These data files are stored in the library's data folder. The code below illustrates how I load some of these files.

data_path = Path(Path.cwd()/'data') \
            if Path('data').exists() \
            else Path('../../data')

with open(data_path/'unit_names.csv') as f:
    file_reader = csv.reader(f)
    unit_names = next(file_reader)
    
with open(data_path/'changes_names.csv') as f:
    file_reader = csv.reader(f)
    change_names = next(file_reader)
    
with open(data_path/'army_list.json') as f:
    race_armies = json.load(f)

with open(data_path/'buildings_list.json') as f:
    race_buildings = json.load(f)

with open(data_path/'upgrades.json') as f:
    race_upgrades = json.load(f)

Exportable functions

In each module, I include a section where I compile all the exportable functions that respond to each module's challenges and requirements. Once exported, other modules can call these functions by importing the module that exported them in the first place.

For example, in the following code, I demonstrate how to call the get_replay_info function from the summarise_rpl module (see <<3 - Summarising Replays>>).

from sc_training.ingest.summarise_rpl import *

#Note that get_replay_info is defined in the summarise_rpl module.
replay_data = get_replay_info(single_replay)
print(replay_data)

File path:                   c:\Users\david\Documents\phdcode\sc_training\test_replays\Jagannatha LE.SC2Replay 
File name:                   Jagannatha LE.SC2Replay 
Date (datetime.datetime):    2021-02-24 02:53:06 
Duration (seconds):          590 
Game type:                   1v1 
Game release:                5.0.6.83830 
Map:                         Jagannatha LE 
Game category:               Ladder 
winner:                      2 
players:                     [(1, 'HDEspino', 'Protoss', 'Loss'), (2, 'MxChrisxM', 'Terran', 'Win')]

Another example would be the use of the ingest's sub-package inventory_replays function to gather to collect the data from a replay batch into a database. This database is setup using a config.json file which MUST BE CREATED AND STORED IN THE PROJECT'S DATA FOLDER (see load_configurations and set_up_db in <<9 - The main ingest module>>).

from sc_training.ingest import *

inventory_replays()

Inventorying replays at: test_replays\TestProfilerBatch in database TEST_library
Load complete.
153 files processed
0 files loaded
4 files ignored
149 files alredy existed

Once this function runs correctly and a batch of replays is processed and its data is stored in the database specified in the config.json file, the user can call the build_player_race_profiles to extract the race profiles.

Note: that this function only creates a race profile if the player has at least five replays played with that race in the processed batch. In the sample batch only one player fulfills this requirement, that is why only one profile is created per race.

from sc_training import *

build_player_race_profiles()

Accessing: TEST_library
1 users found in database
Generating Player Profiles
Created the following profiles
Protoss: 1
Zerg: 1
Terran: 1

References

Howard, J. (2019) nbdev: use Jupyter Notebooks for everything fast.ai, fast.ai. Available at: https://www.fast.ai/2019/12/02/nbdev/ (Accessed: 8 July 2021).
Kery, M. B. et al. (2018) 'The Story in the Notebook', in Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. New York, NY, USA: ACM, pp. 1-11. doi: 10.1145/3173574.3173748.
Knuth, D. E. (1984) 'Literate Programming', The Computer Journal, 27(2), pp. 97-111. doi: 10.1093/comjnl/27.2.97.
Literate Programming (2014) WikiWiki. Available at: https://wiki.c2.com/?LiterateProgramming (Accessed: 6 July 2021).
Ramsey, N. (1994) 'Literate programming simplified', IEEE Software, 11(5), pp. 97-105. doi: 10.1109/52.311070.

SC2 Training Grounds - Technical Specifications Document

Executive Summary

Accessing the Library's Files and Documentation

Setup

Exportable functions

References