Category: Data Analytics
Two weeks ago I went to PyData Amsterdam which kicked off with a tutorial day. These tutorials included a workshop on creating multi-modal LLM agents, advanced web-scraping techniques and …
Explore key lessons in implementing a Retrieval-Augmented Generation (RAG) system, balancing innovation with practicality for enhanced AI responses.
Running Large Language Models (LLMs) locally offers enhanced privacy, independence, cost effectiveness, and unrestricted use. This guide covers tools like LM Studio and Ollama for setup and hands-on learning.
Can the power of GPT-4 be leveraged to build a high-performance API? In this article we will instruct GPT-4 to build a high-performance API in GoLang using Protocol Buffers …
At the beginning of this month, at AMIS, I attended a Special Interest Group (SIG) meeting about GitHub Copilot, given by one of my colleagues. As we do so …
In the rapidly evolving world of artificial intelligence, language models stand at the forefront of technological innovation and ethical debate. Among these, OpenAI’s suite of models, including the renowned …
Quickly creating a geo-map illustration for an article. Showing off PDOK as geo data source, QGIS as geo data inspector and editor and Datawrapper to create the actual map. …
In this article I take a custom GeoJSON file that defines some 20 custom locations – the office locations for the Conclusion ecosystem – and process it in the …
PDOK – Publieke Data op Kaart – is a Dutch data service that offers 230+ geo data sets on many different aspects of the public space in The Netherlands. …
Leaflet is a great JavaScript library for creating map visualizations in web applications. Leaflet is created with that specific purpose in mind. It caters for most of the things …
Using QGIS to gather, visualize, analyze, edit, process and export Geo Data is fun. I only discovered QGIS fairly recently. And now of course I see it popping up …
In a recent investigation I stumbled on two valuable things: the free QGIS desktop tool that allows inspection, transformation and analysis of Geo data the ProRail ArcGIS Map Server …
My challenge: show Dutch rail tracks on a map. This challenge is composed of several steps: find a dataset that contains geographic data on Dutch railways convert that data …
The combination of GeoJSON data sets and the JavaScript d3.js for SVG visualizations is quite valuable for creating rich and interactive visualizations of data that is related to geography …
In several recent articles on Data Visualization using a Thematic World Map with color shades assigned to countries based on their value for a specific property in a world …
This article is a sequel to my introduction into World Map visualization (using d3, SVG and JavaScript). The previous article introduced the visualization of country specific data on a …
Data associated with countries is fairly common. Presenting such data in the form of world map that shows the countries and presents the associated data through a color is …
With the use of ChatGPT, anyone has access to vast domain knowledge. Even a generalist like me (my colleague would call me a dummy) can act as an expert …
Platys is a tool created by Guido Schmutz, architect and data specialist at Accenture and frequent teacher at various universities. In order to be able to quickly create an …
Apache Superset is an open source end user tool for analyzing and visualizing data. Dozens of chart types are available out of the box, dashboards can be created, finetuned …
Apache Superset is an open source platform for data exploration and visualization. It can be described as an open, free alternative for Microsoft PowerBI, Tableau, Qlik and Oracle Analytics …
ChatGPT is a bit of a “tongue twister” so I will speak of Cheppy. AMIS has a long history of spotting, exploring, embracing and rolling out new concepts and …
In our daily work we are dealing with data from many sources. Data in CSV files, from Cloud APIs, in mail servers, configuration files, Terraform plans, in logging systems, …
This brief article shows you the quickest way to trying out Jupyter Notebooks. It will not try to persuade you that you should try them out. You probably know …
TL;DR: Oracle offers a new cloud native, serverless service on OCI for data processing and ETL/ELT, called Data Integration. It seems a new incarnation of Oracle Data Integrator or …
Data is really important to any organization. Data tells us what the organization is doing, and where it is going. And how it can improve quality and efficiency of …
I have time series data in my Pandas Data Frame. And I want to present an aggregation of the data by day of the way in an orderly fashion …
One of the relatively new features available with Oracle Autonomous Data Warehouse is Oracle Machine Learning Notebook. The description on Oracle’s tutorial site states: “An Oracle Machine Learning notebook …
It is such a small thing. That you can look for in the docs, no Stackoverflow and in many blog articles. After I have used groupby on a Data …
The CBS (the Dutch Centraal Bureau Statistiek) keeps track of many thing in The Netherlands. And shares many of its data sets as open data, typically in the form …
In a number of recent articles, I have described how to provision an instance of Oracle Data Warehouse Cloud in Oracle Cloud’s Always Free tier. I have also described …
Oracle Cloud now offers the Always Free Tier that comes with an always free Autonomous Data Warehouse (up to 20 GB data storage) as well as an free Autonomous …
Data Sets are often too small. We do not have all data that we need in order to interpret, explain, visualize or use for training a meaningful model. However, …
Earlier this week, I presented a workshop on Data Analytics. I wanted to provide each of the participants with a fully prepared environment, right on everyone’s own laptop (and …
Even though many languages share the same or a very similar alphabet, the use of letters in documents written in these languages is quite distinct. The letter ” e” …
In this article, I analyze the race that took place in stage 14 of the 2019 Tour de France in a Jupyter Notebook using Python, Pandas and Plotly and …
In this article, I will show how to analyze the performance of Steven Kruijswijk during stage 14 of the 2019 Tour de France in a Jupyter Notebook using Python, …
This year’s Tour de France was quite a spectacle. Great performances, exciting stages, unexpected events: it had it all. Analyzing the race events as they unfolded during the stages …
Correlation is a powerful thing. When two metrics rise and fall in a similar way, surely that cannot be just coincidence. It has to be meaningful in some way. …
If data is available from REST APIs, Jupyter Notebooks are a fine vehicle for retrieving that data and storing it in a meaningful, processable format. This article introduces an …