Author: Pete Scully PhD (UK)

Find my work on Research, Technology & Science... Past highlights include @PyCon, Data Science @Lexr.AI, Agri-Tech @MSU-TH, 3D Reconstruction Light Stages for Plants (Phenotypic/Genomic Mappings) @AU @NPPC-UK, Cybernetics, Social Science + Behavioural Research + Grant Writing @MU-TH, Model Explainability, IoT Apps for Zombies, Self-Healing AI + Cyber Security @Airbus, COVID-19 Thailand's Provincial Public Data Analysis, PlayfulCoding.EU, Training SW Engineers @MorganStanley, Commercial Projects + Start-ups. To discuss or work together, I'm on email: pmdscully-at-gmail-com.

How to create a “Split-Bars” plot in Python with matplotlib?

May 20, 2024May 20, 2024 by Pete Scully PhD (UK), posted in code/science, TIL

In the datawrapper.de visualizations service a "Split-Bars" plot is available, the screenshot below is an example. The data visualization is already very informative and clearly visualizes the tabular data. However, I would like to recreate this kind of visualization plot type programmatically in Python; i.e. using matplotlib (or similar library). As far as I searched, … Continue reading How to create a “Split-Bars” plot in Python with matplotlib?

On Estimating Thailand’s COVID-19 Infections and Infection Fatality Ratio (IFR)

May 10, 2021July 21, 2021 by Pete Scully PhD (UK), posted in code/science, research, TIL

This post records the method used to create an estimate for Thailand's Infected Cases nationally and by province, by using the IFR metric, quantities of deaths in Thailand, combined with known provincial demographics to estimate Thailand's (unreleased) COVID-19 related death ages. Infected Cases differ from actual Recorded Cases in that they offer a view on whether more cases (i.e. asymptomatic) are likely to be found. This work was sparked by Matt Greenfield and Dylan Jay's discussion. Obviously, there's too much to fit in a tweet. In the article, we go through (1) how readers can directly calculate their own estimates for Number of Infected Cases from our data, followed by (2) method taken to reach the results, then (3) discussion, limitation and conclusions. There are plenty of caveats described in this article, so take a look at these before drawing conclusions.

Road Network – Agent Modelling

February 26, 2021 by Pete Scully PhD (UK), posted in code/science

A quick blog update to play with a neat agent simulation tool integrated into OpenStreetMap mapping tools with vehicle driver behaviour modeling. SMARTS - Scalable Microscopic Adaptive Road Traffic Simulator (University of Melbourne) Image credit: SMARTS - University of Melbourne Project: https://projects.eng.unimelb.edu.au/smarts/ to keep up to date (2013-2023).Citation paper and the location of the GitHub … Continue reading Road Network – Agent Modelling

Extension of Python Dict .get() – Lookup with Similarity for Built-in Libraries

February 11, 2021July 6, 2023 by Pete Scully PhD (UK), posted in code/science

This is a prospective extension to Python dict .get() that solves a common problem in data applications. The bold proposal asks whether to include such an implementation in the core language or in a library, across languages used for data processing. See what you think.. Background & Why? These days we have more data-oriented code … Continue reading Extension of Python Dict .get() – Lookup with Similarity for Built-in Libraries

Thailand Province Border Adjacency Dataset/Code

February 8, 2021February 8, 2021 by Pete Scully PhD (UK), posted in agri-tech-thailand, code/science, research, software-eng-dev

A quick update post to help get my latest project's new dataset more readily indexed on Google search, etc. (Feb 8th 2021) I've recently been working on risk assessment for COVID-19 in our 2nd wave. To create an email alert per province (taking account of local regional data) I needed to join provincial data together. … Continue reading Thailand Province Border Adjacency Dataset/Code

Towards COVID-19 Wave Risk Assessment Tool for BKK Residents: Results so far…

January 25, 2021January 26, 2021 by Pete Scully PhD (UK), posted in code/science

Dated 25th Jan 2021 (A) New Cases for Bangkok and Nearby Provinces: All data collected from Daily COVID-19 report, Thailand information [Daily COVID-19 cases reported]Data Service: https://opendata.data.go.th/dataset/covid-19-dailyLast Updated: 24 มกราคม 2564 0 New Cases in กรุงเทพมหานคร / Bangkok on 2021-01-25:25 New Cases in กรุงเทพมหานคร / Bangkok on 2021-01-24:5 Key Clusters with 21 Cases in กรุงเทพมหานคร … Continue reading Towards COVID-19 Wave Risk Assessment Tool for BKK Residents: Results so far…

ON MEASURING MACHINE LEARNING MODELS AGAINST CONCRETE BUSINESS OBJECTIVES

August 25, 2020May 30, 2021 by Pete Scully PhD (UK), posted in book-review, code/science, machinelearning, research

REVIEW NOTES: DATA SCIENCE FOR BUSINESS BY PROVOST & FAWCETT: CHAPTER 7 I enjoyed reading this chapter. It's insightful and well explained with detailed examples, diagrams and graphics, on a few data science topics that correspond directly to conventional scientific research in computer science. That makes me happy, because these are crucial points, yet rarely … Continue reading ON MEASURING MACHINE LEARNING MODELS AGAINST CONCRETE BUSINESS OBJECTIVES

TIL: Fixing ‘file not found’ dependency libraries in Linux

July 29, 2020July 31, 2020 by Pete Scully PhD (UK), posted in TIL

In Ubuntu (Debian/CentOS, and the like) apt is our go to CLI application package installer. It handles everything in a single iconic command that every Linux user knows: sudo apt install <packageName> Sometimes, and I still don't get why or when, a package's shared library (dependency) is not installed. For example, this happened today for … Continue reading TIL: Fixing ‘file not found’ dependency libraries in Linux

On Measuring the Senior, In Senior Software Engineering Roles

May 19, 2020May 23, 2021 by Pete Scully PhD (UK), posted in code/science, software-eng-dev

Labelling "Senior", "Mid" and "Junior" roles of software engineers comes up from time to time in the developer and programmer forums. While I'm not a fan of labels for people or groups of people - Seniority and Skill/Knowledge/Ability Levels get to me because they are so ambiguous. So it is down to us to contribute and discuss to reach a clear definition. A truth of seniority, across all genres, is group-wide effect. It's leadership, it's empathy, it's improving the individuals and the group as a whole for the group's common interest. It's a positive improvement. But what does that mean for Developers and Software Engineers?

Adding to the Conversation on Data Science Training: Looking into the Future

May 16, 2020May 16, 2020 by Pete Scully PhD (UK), posted in code/science

Conversation on Data Science Training: Looking into the Future -- While I have been hesitant to define the structure of data science training, and I am biased towards (as I have) a Comp Sci background. I conversely follow the doctrine that "comp sci is without purpose with no application", owing to my position of "domain knowledge is where the value is generated" (societal, financial, etc). If setting boundaries and principles in Data Science is required, I think it's the view of the future that must be settled first....