Welcome to my blog. I plan to use this site as a record of my work and my ideas. My current interests include machine learning and data science, natural resource exploration (especially geophysical methods), geostatistics, GIS, and remote sensing.
Recent Posts
Hugo Tips Part 2
Hugo Tips: Part 2 I only have 3 tips this time, but they are important:
Always always always remember to run env HUGO_ENV="production" hugo (or the equivalent on your system) rather than just plain hugo. It is possible that running hugo will write the files in development mode, which puts NOCRAWL, NOINDEX tags on every page. These tags will prevent search engines from crawling your site. For Windows, it should be set HUGO_ENV=production.
read more
Well Logs Part 2
This is a continuation of my first post on the SEG Facies Classification competition.
In the previous post I left off before implementing a baseline learner to see what results we might expect.
Baseline Learner After Witten, Frank, and Hall’s Data Mining, I use depth-1 decision stumps and shallow decision trees as baseline learners. Decision trees are easily interpretable once plotted. They also give us a sense of feature importance, since the greedy algorithm will split the more informative features earlier in the tree.
read more
Buffalo Civic Innovation Challenge: Introduction
I recently competed in the Buffalo Civic Innovation Eco Challenge. We finished “in the money”, earning a prize from ESRI for best use of mapping technology. The competition was covered in the local paper!
While it is neat to get my name in the Buffalo News, the writeup did not include any detail on our project. I built a web application, Lead Free Buffalo, that has some useful information on lead contamination of drinking water and a couple visualizations of some lead testing performed across the city.
read more