
Among the many things AI has enabled, it has now made possible an entirely new method of interacting with our computers.
Introduction
I have long believed that we are on the precipice of changing the fundamental way in which humans will interact with computers. We are stepping into an age where we are going to simply be able to tell our computers the goals we want to achieve and they will figure out the steps in order to achieve it and then execute it for us. But this powerful idea is founded on the notion that your computer will know what there is to know about you, including your preferences, your data and whatever else you can supply it. But that brings up many new questions regarding privacy, data ownership and data residency. But don’t worry, I’m not here just to whine, (though I will), but I also come bearing a solution and even ways for you to participate in it.
Background
To begin to understand the importance of what we’re currently living through, it is important for me to provide you a bit of background about three aspects of this industry.
First, I want to point out the evolution of computing from the perspective of how much effort it takes to have your computer do what you need it to do, the effort has been reducing over time. We’ve gone from writing our instructions on punch cards to writing programs using almost english sounding programming languages like BASIC or Python. After that, the evolution plateaued until the birth of LLMs. Now, we can speak to them in natural language and have them do things for us.
The second progression that I want you to pay attention to is the number of things a particular program is capable of doing. You see, at first you needed to employ programmers in your office to write you custom programs because there was just no other way to having your company use the efficiency that computers could bring to your organisation. Post that phase, applications were available to people to just download, install and use. Now app developers who were competing for the user’s dollars, either built apps to serve a niche audience or started building super-apps that could do everything. The latter group quickly ran into the limitations of the Graphical User Interface (GUI) because there was only so many things you can pack into a screen and still be user-friendly. This was the case, again, until LLMs. Now you had computers that could do anything, even write programs to do specific things.
Thirdly, apps have been very specific solutions to a problem. As such, they’ve been very brittle as you could only ask the app to do certain things and that’s it. Even when speech technology was applied, you still spent time learning the phrases that you could use a la, “Hey Google, turn my lights on in the living room”. If you said that phrase differently than it had been setup with, for example, if you said, “Hey, it’s too dark in the living room” – you couldn’t expect your app to understand that phrase. This hurdle again has been overcome by LLM’s because they can understand your meaning past your words. For example, you could say “I’m thirsty”, “I’m parched” or even be euphemistic like “My mouth feels like the Sahara desert right now” and it would still get the meaning you want to convey. From a UX perspective, this ability for technology to fit into the life of human beings seamless is what I find most powerful.
So we are at the threshold of these three progressions having come to a head at once. This means that not only are our computers going to be super capable going forward, it’s going to be incredibly simple to use and no one has to spend any time even learning them. Can you imagine what an unlock this is for society? And this is the first time in history that the benefits of this technology is not just going to those of us who grew up with technology in our lives and we’ve found it comfortable because we spent the time learning them, it’s going to be a massive unlock for those that were so far left behind the technological divide or even the english-language. These include people like my parents who find their smartphones challenging to use.
This, is huge.
Context is Everything
For AI systems to go from just being effective on the baseline (based on what it’s learnt about humans as a whole from the internet) to doing things for you the way you prefer it be done requires understanding your context. They need to be able to study your behaviours and patterns in order to be able to emulate them. This may mean giving the AI access to not just your emails, calendars, contacts, photos but even allow it to watch your health data, medical records, your passwords, etc.
That’s when they can go to a place where they can understand that you prefer to say “clutch” over “cool” or like to order from this particular Chinese restaurant and the other one is there as a backup or that you prefer your screen font size large because you have trouble reading or even to cancel all your appointments automatically if you woke up unwell one day. It can figure all this out without you ever having to tell it these things explicitly.
But AI isn’t just going to be a useful assistant to have, it will soon become essential. We live in a world where information is being produced at unprecedented rates and we feel compelled to keep up. But we are reaching the limits of human capabilities and that’s just going to continue to be the case. There will be hundreds of decisions that we can delegate to AI that’s working on our behalf, the least of which is deciding which email is spam and which isn’t. If you want to participate in an accelerating world, having an AI assistant will become a necessity in the same way having a smartphone today is seen as a necessity.
So, the question is, where do you get your AI assistant.
Shop Around
If there’s one thing we’ve learnt from the mistakes made by the social media decade is that if the service is free, you are the product. We’ve also seen the many ways data about users have been used against them. To most readers of this blog, I don’t need to list the kinds of things these large technology companies learn about us just through the ways in which we shop or even the intonations of our voice. We’ve seen how effective and accurate they have been, including stories about products being suggested to dads of teenagers who were pregnant, even before the dad knew about that pregnancy!
I would never trust these corporations with my context data, especially when that data could reveal infinitely more information than my social media profile ever could.
Let me be clear, I don’t think these corporations are necessarily evil. I find it hard to imagine anyone waking up every morning with the sole intention to destroy other people’s lives. But the incentives of corporations are definitely not aligned to serving the user. They are beholden to their shareholders and are mandated with generating profits for them. This mechanism doesn’t allow anyone to see the user as anything more than a resource to derive value from. If value can be generated by selling their data to the highest bidder, so be it.
Given this equation, let’s suppose you go ask a question about a certain type of cancer to the AI assistant from one of these large corporations or submitted a scan to this assistant with the intention of getting a second opinion. What if that AI assistant saw that and realised that your insurance company may be willing to pay a huge amount for that information because they could loose a lot more if they continue to cover you if you fell ill. Do you think the assistant would be programmed to act on your behalf or to generate a profit for the corporation?
The good news is that we don’t need to rely on these corporations. The open source world has realised these pitfalls and have been hard at work developing technologies that will ensure that this kind of a future doesn’t need to befall us. I also realise how important these efforts are and have been contributing my time and efforts to such projects as well.
The Solution is Simple
While these big companies that own the frontier AI models want to portray themselves as the only option for this sort of intelligence, don’t be fooled. Know that you can have absolutely great stuff available to you without any of them in the picture. You can retain complete ownership of your data while still getting all the convenience. You can do all this locally, in the privacy of your own computer. You don’t have to learn prompting strategies either.
You can install Ollama and Move 37 on your machine and you’re good to go! All of it is completely free and you can even check and audit the code if you’re so inclined. Granted it’s a little bit of work to get this installed right now, but they are all making things easier all the time.
If you’d like to see a demo of this, I’d be happy to create one. If you want to find out how to install this also, I can create another video about it. But for now, here’s a very quick walkthrough.
[COMING SOON]
As mentioned, these two projects are available for free, from these locations. So go check them out:
Conclusion
If you believe in an open future where individuals retain control over their own destinies, you can start using these technologies for yourself. But if you feel like you’d want to contribute more and be a part of these movements, there are many ways you can participate depending on your skills.
If you’re technically inclined:
- Move37 is looking for help in testing, verifying the code completeness and various other items listed in their roadmap.
- UX designers can propose new ways to organise the frontend to make things easier and more efficient.
- The installation process could use a lot of simplifying and that’s something that you can contribute to, especially on Move37.
If you’re not an engineer:
- Use and provide feedback and spot bugs
- Spread the word and get more people to realise the problem and understand that there are solutions
- Create videos talking about the projects
- Add new recipes – the steps for using various applications
- Suggest new ideas and features that can be built
- Star the projects on GitHub so that they get more visibility
- Just donate to the projects if you’ve got the money, they can use it.
But whatever you choose, thanks for being a part of this! Until next time, ciao!