Category: Data Analytics
Can the power of GPT-4 be leveraged to build a high-performance API? In this article we will instruct GPT-4 to build a high-performance API in GoLang using Protocol Buffers …
At the beginning of this month, at AMIS, I attended a Special Interest Group (SIG) meeting about GitHub Copilot, given by one of my colleagues. As we do so …
In the rapidly evolving world of artificial intelligence, language models stand at the forefront of technological innovation and ethical debate. Among these, OpenAI’s suite of models, including the renowned …
Quickly creating a geo-map illustration for an article. Showing off PDOK as geo data source, QGIS as geo data inspector and editor and Datawrapper to create the actual map. …
In this article I take a custom GeoJSON file that defines some 20 custom locations – the office locations for the Conclusion ecosystem – and process it in the …
PDOK – Publieke Data op Kaart – is a Dutch data service that offers 230+ geo data sets on many different aspects of the public space in The Netherlands. …
Leaflet is a great JavaScript library for creating map visualizations in web applications. Leaflet is created with that specific purpose in mind. It caters for most of the things …
Using QGIS to gather, visualize, analyze, edit, process and export Geo Data is fun. I only discovered QGIS fairly recently. And now of course I see it popping up …
In a recent investigation I stumbled on two valuable things: the free QGIS desktop tool that allows inspection, transformation and analysis of Geo data the ProRail ArcGIS Map Server …
My challenge: show Dutch rail tracks on a map. This challenge is composed of several steps: find a dataset that contains geographic data on Dutch railways convert that data …
The combination of GeoJSON data sets and the JavaScript d3.js for SVG visualizations is quite valuable for creating rich and interactive visualizations of data that is related to geography …
In several recent articles on Data Visualization using a Thematic World Map with color shades assigned to countries based on their value for a specific property in a world …
This article is a sequel to my introduction into World Map visualization (using d3, SVG and JavaScript). The previous article introduced the visualization of country specific data on a …
Data associated with countries is fairly common. Presenting such data in the form of world map that shows the countries and presents the associated data through a color is …
With the use of ChatGPT, anyone has access to vast domain knowledge. Even a generalist like me (my colleague would call me a dummy) can act as an expert …
Platys is a tool created by Guido Schmutz, architect and data specialist at Accenture and frequent teacher at various universities. In order to be able to quickly create an …
Apache Superset is an open source end user tool for analyzing and visualizing data. Dozens of chart types are available out of the box, dashboards can be created, finetuned …
Apache Superset is an open source platform for data exploration and visualization. It can be described as an open, free alternative for Microsoft PowerBI, Tableau, Qlik and Oracle Analytics …
ChatGPT is a bit of a “tongue twister” so I will speak of Cheppy. AMIS has a long history of spotting, exploring, embracing and rolling out new concepts and …
In our daily work we are dealing with data from many sources. Data in CSV files, from Cloud APIs, in mail servers, configuration files, Terraform plans, in logging systems, …
This brief article shows you the quickest way to trying out Jupyter Notebooks. It will not try to persuade you that you should try them out. You probably know …
TL;DR: Oracle offers a new cloud native, serverless service on OCI for data processing and ETL/ELT, called Data Integration. It seems a new incarnation of Oracle Data Integrator or …
Data is really important to any organization. Data tells us what the organization is doing, and where it is going. And how it can improve quality and efficiency of …
I have time series data in my Pandas Data Frame. And I want to present an aggregation of the data by day of the way in an orderly fashion …
One of the relatively new features available with Oracle Autonomous Data Warehouse is Oracle Machine Learning Notebook. The description on Oracle’s tutorial site states: “An Oracle Machine Learning notebook …
It is such a small thing. That you can look for in the docs, no Stackoverflow and in many blog articles. After I have used groupby on a Data …
The CBS (the Dutch Centraal Bureau Statistiek) keeps track of many thing in The Netherlands. And shares many of its data sets as open data, typically in the form …
In a number of recent articles, I have described how to provision an instance of Oracle Data Warehouse Cloud in Oracle Cloud’s Always Free tier. I have also described …
Oracle Cloud now offers the Always Free Tier that comes with an always free Autonomous Data Warehouse (up to 20 GB data storage) as well as an free Autonomous …
Data Sets are often too small. We do not have all data that we need in order to interpret, explain, visualize or use for training a meaningful model. However, …
Earlier this week, I presented a workshop on Data Analytics. I wanted to provide each of the participants with a fully prepared environment, right on everyone’s own laptop (and …
Even though many languages share the same or a very similar alphabet, the use of letters in documents written in these languages is quite distinct. The letter ” e” …
In this article, I analyze the race that took place in stage 14 of the 2019 Tour de France in a Jupyter Notebook using Python, Pandas and Plotly and …
In this article, I will show how to analyze the performance of Steven Kruijswijk during stage 14 of the 2019 Tour de France in a Jupyter Notebook using Python, …
This year’s Tour de France was quite a spectacle. Great performances, exciting stages, unexpected events: it had it all. Analyzing the race events as they unfolded during the stages …
Correlation is a powerful thing. When two metrics rise and fall in a similar way, surely that cannot be just coincidence. It has to be meaningful in some way. …
If data is available from REST APIs, Jupyter Notebooks are a fine vehicle for retrieving that data and storing it in a meaningful, processable format. This article introduces an …
Oracle OpenWorld and CodeOne 2018 are two co-located conferences that took place in October 2018. Some 2000 sessions presented by over 2500 presenters form the core of these conferences. …
Apache Drill is a schema-free SQL query engine. Drill supports a variety of NoSQL databases and file systems, including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, …
Data Wrangling is a crucial stage in the data science workflow. Or in any workflow that starts from raw data and hopes to achieve business insight – and perhaps …