Category: Uncategorized

  • I want to know where I’m spending my money. Can I?

    Why? Because I want to budget so that I can spend more where I can and cut down where I have to.

    Simple, right?

    Before we start, let’s go through the lay of the land. I have one bank account (HDFC) & two credit cards (Amazon ICICI & OneCard) from where I spend.

    I invest through Zerodha, but tracking investments is a whole new ballgame and I don’t want to get into that.

    Now about spends. Pretty simple setup, whenever salary hits my DBS account (which I’m trying to retire), I transfer it to HDFC and run all transactions out of the HDFC. Depending on offers, I spend money on the credit cards if it makes sense & usually pay the bills in under 20-25 days.

    About the transaction mediums

    • UPI for day to day: 80% of my bank statement is UPI spends (I’ve crunched numbers btw). Range from >5 to < 5000 INR usually.
    • NEFT & RTGS: for cross account transfer
    • ACH (Automated clearing house transactions): For all the auto deducting subscriptions, mutual fund transfers etc.. volume wise these are less transactions but amount wise they are significant.

    First try: Account aggregator

    Account aggregator in principal is awesome, one singular place where all account information is available.

    But that’s the catch, it is available. But not to you. It is available to companies to profile and sell their business.

    I have nothing against it by the way, making money is important. I would have loved if some company offers a paid API to pull transaction data so that I can setup everyday sync with the budgeting platform of choice.

    This is not a groundbreaking idea, Plaid is a company which does it in the US, a lot of other apps like YNAB use for syncing transactions. But no company in India provides this as a service. And interestingly no company is solving for budgeting yet in India. Why not?

    Since I don’t have access to my own data neither through API nor through CSV exports in the Account Aggregator ecosystem, on to the next try.

    Second try: Bank statement parsing

    Every bank worth its salt gives a CSV export, Right?

    Not really (for example Jupiter money does not), but some banks do. For example HDFC does!

    You can download the last five years bank statement in multiple formats ranging from CSV to excel to MS Money..?

    Anyways, we need to download “delimited”, which is “delimited” by a comma. Nice. But on downloading you see a txt file…? How did that happen?

    Well because HDFC only said it’ll share a delimited file, it did not say its a csv file. Right? so the text file is actually a csv file. You’ll just have to rename the extension.

    On open, the first line is empty.. that’s not a deal breaker. But then you look at the timestamp, which is impossible because there is no time stamp in the export file. Only a date. So technically there is no ordering of transactions.

    I don’t understand the rationality of these technical decisions for some reason. I’m sure the server is reading off a database, I’m sure it is storing the timestamp for reconciliation exercises. Why not share it? Is it because non tech people get scared by looking at timestamps? I don’t honestly know, anyways moving on

    Now that the CSV piece is sorted, I quickly wrote a python script which parses and transforms the data into a simpler CSV. And then added another step to pipe it into Actual budget. My budgeting apps of choice.

    I wanted to develop a pipeline where I can forward the bank statement to a telegram app & it parses the files and stores it in the budgeting app of choice. Because the whole notion of doing this every week on a desktop seems hectic to me.

    So I open the HDFC mobile app, and go to the statement section.

    On clicking “Request for a statement”, I realize that HDFC mobile does not have the option to generate a “delimited” statement. Only excel & PDF. WHY?

    Alright.

    I’ll write another parser for excel then, since its the only thing available for both mobile and desktop. Not a big deal, since the core logic is the same.

    Side quest: Extracting data from narrations

    See, Banks don’t give you what you want. They give you whatever they feel like. In a statement, it would be nice If I had columns as such

    • Timestamp
    • Payee
    • Credit/Debit amount
    • Mode
    • Payee identifier (could be bank account : IFSC, UPI handle or none for something like ACH)
    • Reference ID (Something along the lines of UTR)
    • Narration (for description/notes)

    All of which is nicely laid out in their database I’m sure.

    Payee context is important because If I have 50 transactions of swiggy, I need some way of saying that all of them are from me, the individual. To swiggy, the company without doing sub-string match kung-fu. I cannot really budget if I don’t know whom I’m paying to through my month.

    But no, life can’t be that simple in Banking.

    Banks just share

    • Date
    • Narration
    • Amount
    • Ref Number (UTR ID)

    and that’s it! Now its my job to do sub-string match kung-fu and extract data!

    What the **ck! (I was saying “what the heck” there by the way, messed up mind you got)

    There is a way though, at least for HDFC narrations all UPI narrations start with UPI and delimited by – have the payee name, IFSC and narration. Similarly for NEFT and RTGS.

    Sidenote, if you think the Ref number will be unique for all transactions. That’s not true. Some reversed transactions can have the same UTR (Unique transaction reference). One more thing to mention, if you have a mutual fund auto deduction or auto sweep setup for FD, then the reference ID can either be 0 (on CSV) or 00000000000 (on excel). They could have just let it blank, why have a variable length string of zeros?

    Anyways. Moving on..

    After dealing with all this, I wrote a parser in python through which works pretty well. So now I have payee information along with the narration. This later on becomes very useful since I can say that everything from Swiggy is food in my budgeting app.

    But then, 80% of my transactions are UPI, right?

    And all of them are small size payments done to local vendors. Normal people who don’t have a commercial entity… How do I know that I paid 250 to an auto guy for travel when 20 days back if the narration says “sidappa@ybl”?

    Another side quest: Adding category context to UPI transactions

    One solution is to type out some context in the UPI transaction notes section. say for example, enter “travel” when you are paying for auto and “groceries” when you are buying some vegetables on the street.

    But the overhead of typing is pretty irritating.

    So I thought I will make a react native app in which you can scan a QR code, put the amount, click on a category icon and click next. Meanwhile this app will generate a brand new UPI link with all this context and open it in your preferred UPI app of choice, with the description and amount pre-filled.

    I’ve added one extra click to your user experience, but you are permanently putting context about transactions into your bank statements. That context is golden for budgeting.

    I’ve actually built this out in about 2-3 hours, I’ll make it open source very soon. Generated an APK, checked it on a friends phone. It works! But it does not work all the time. Only merchant payments seem to work..

    So basically, I scanned a QR code through the app. It made a new app link (something along the lines of paytm://pay...) and this opens paytm. The description & amount is as expected and right, but then after entering the PIN, the transaction fails stating “Risk reasons”.

    On further research, I found out there is a parameter called signature. And it needs to be generated via authorized sources. normal links will not work.

    Apparently, this is a known issue. And as of now without integrating with phonepe, cashfree or the likes, there is no way to go about opening UPI apps from my app. 🙁

    I cannot even type the context manually since these QRs sometimes come with default transaction note, which cannot be over-ridden.

    My current setup

    Every weekend I block 15 minutes to

    • download the “delimited” bank statement from HDFC website, rename it to csv
    • use my script to extract payees where-ever possible, and generate a clean version of CSV with columns of my choice
    • send it through a data importer so that they get populated in my budgeting app of choice

    With that, every month end I know how much I spent where. Also every once in a while I look at how much I’ve allocated for myself this month and see if I’m exceeding that. If yes, I cut down.

    So the setup works, but it sucks. There is no seamlessness at all. It’s riddled with manual operations. I don’t like it, but we don’t have Plaid or something similar in India.

    My ideal setup is a platform that allows me to do all this in an app, right from setting budget per category per month. Allowing UPI transactions with tagging, rules to classify transactions & reporting.

    Until then, this is our best bet.

    Conclusion

    Thanks for coming along the journey, if you liked this post feel free to share it with your friends too. If you feel like there is a better way to go about it, message on telegram!

    That’s about it for this time, hopefully there will be volume two which is not so long and much more cleaner 😉

  • Building for Ourselves: The Beauty of Personal Software

    Chefs come back home and cook for their family and friends. They look forward to the cooking on the weekend, the “specials” with great excitement. It is a place where there is love & gratitude. No pressure to deliver award winning recipes, just a good meal with jokes and banter. I think there is something beautiful in that.

    A lot of Software engineers seem to have gotten into this mentality where things have almost always become about scale. Scale in terms of users & scale in terms of feature set.

    Once in a while it is good to just build for ourselves, to build for friends and family. Something which is personal, predictable, customized. Something which is minimal & clean. Not after money, not after popularity & fame, not after hype cycles. Just for us.

    With the state of things in open source and AI, it’s much more easier to build home grown software. Engineers should get back to building things for themselves & friends. There is no guilt in building something which is just being used by a handful of people. And it is a testament to the beauty of the mature structure and ecosystems we have around us.

    Inspired by Robert Sloan’s beautifully written post called “An app can be home-cooked meal”

  • Extracting data from Indian Bank Statement exports for importing into Budgeting apps

    I’m into budgeting and tracking expenses. I think a lot about personal finance, have written my thoughts earlier too. But the thing is that all tracking operates on data. And getting data out of the silo in a structured format is pretty hard.

    Account aggregator is supposed to be solving for this. But each vendor in turn again locks data inside their silo and make their app more compelling. Until and unless we build plug and play pipelines we can never really do well in terms of developing personal finance management apps which make sense I think.

    I’ve recently started using Actual Budget for budgeting for me and my fiancee. Its a very nifty software which can be self hosted. Now the challenge is to get data from the bank to Actual.

    Figuring out how to get data into Actual

    Here is the idea. HDFC thankfully gives a CSV export option of their bank statement. If I can figure out how to get this data into actual using a library like ActualPy, then everything is sorted.

    Note that I’m using HDFC account, based on your bank these details can change (or not!)

    Parsing the CSV file

    So step one is CSV parsing. Some catches during the process.

    • Some lines in the CSV file have more than the required 7 columns. usually because the narration has a comma in it.
    • Some times, the ref number / UTR field will have zero. A lot of software relies on a unqiue identifier. In this case, its better to combine narration, date and hope it stays unique. There is technically a possibility it won’t though.

    There is one catch, Actual works heavily on payee information.

    Parsing Payee Information from the narration

    What does this mean? Here is a sample narration from the bank statement

    UPI-ZOMATO LTD-ZOMATO-ORDER@PAYTM-PYTM0123456-401537805904-ZOMATO PAYMENT                                                

    This seems gibberish, but this is loaded with information. Delimited by `-`, the first segment is the mode of the payment. the second is the payee name, third is the UPI ID, fourth is the IFSC, fifth is something I did not really get. I’m guessing the last is the note sent to the UPI network.

    There seems to be a catch. Sometimes to fit in information, the payee name is truncated. Not really sure in what exact circumstances the bank takes a call to truncate the payee name. Need to look at more data and figure that out. Similarly, NEFT & RTGS also follow a similar structure. So, parsing the payee information is actually fairly convenient.

    With this, more than 80% of the transactions would be covered. But then there are other important ones

    ACH narrations, ACH stands for automated clearing house. If you happen to have a recurring eNACH, or a mutual fund has to post returns back to your account you’ll incur a ACH line item in your bank statement. ACH narrations are interesting because I’ve seen multiple variations in my bank statement itself. Usually it is ACH followed by a C if its a credit or D if its a debit. then after a hyphen, there is text explaining the transaction followed by an ID of sorts (I guess?). Need to look into this more

    Here are some examples

    ACH C- SYMPHONY LIMITED FV-SYM0INT02024W
    ACH C- ITC LIMITED-2643579
    ACH D- INDIAN CLEARING CORP-H5GGPYWT2R6R
    ACH C- IRFC LIMITED-TAX FRE-2418947
    .ACH DEBIT RETURN CHARGES 230124 230124-MIR2403472992640
    ACH C- LTIMIN INT 2024 25-354851

    Currently I’m not setting a payee for these transactions in my personal setup.

    Finally, The Actual Budget Importer

    This is a streamlit app currently hosted in my scripts repo. It takes in csv bank statement of HDFC & posts data to your actual server.

    A simple, no BS application which takes in a csv file, the account you are targeting in the server and just posts transactions.

    It takes care of de duplication, payee parsing and other quirks regarding commas in narration etc..


    With this, data in actual is finally making sense. Now budgeting on top is going to be simple finally!

  • Lessons learnt while developing apps with Streamlit

    Streamlit is a framework in Python meant to turn data scripts into web apps in minutes. If you think about it, many apps are fundamentally data scripts first and web apps next. There are many scripts I have which would work better as simple web UIs.

    I’ve been exploring how to get from idea to deployment quickly for a while. Of course, the answer is going to be slightly nuanced, but this approach works well for many simple and/or internal apps.

    I recently launched CleanMail, which was actually a simple CLI tool back in 2019 called mail-sanitizer. However, it was using Google API credentials and was fairly inaccessible. Now I’ve moved it into a Streamlit app and deployed it on my Coolify instance, and people are using it frequently. I’ve also started migrating my scripts over to Streamlit since it’ll be easier to manage.

    Here are some lessons I’ve learned while developing apps in Streamlit:

    Embrace Forms

    State management is hard, especially when there are no frameworks to rely on (like Redux, etc.). There is a lot of temptation to have independent elements in the UI and to manage state by hand. This will almost always end up being a bad idea.

    Forms also control re-renders, and manual state management usually ends up being a pain in terms of the dreaded “Running” state, which kills the UX for the end user.

    Minimize State to Manage

    Forms help with this, but since there isn’t a lot of tooling around Streamlit right now, it’s important to minimize potential data present in state. Managing state and especially dependent variables in state is very prone to bugs.

    Make Use of Fragments

    Re-renders are time-consuming and sometimes irritating. Making use of the st.fragments API means that we can control the effects of state with more granularity. This becomes very important as the app scales up in logic.

    If you haven’t used the st.fragments API, please do read about it. It’s quite handy.

    LLMs for UI

    Once all the core logic is in place, asking Aider to come up with an initial draft of UI is extremely useful. The first draft is surprisingly good and almost always close to what you would want in the end version.

    Missing Components in the Ecosystem

    Streamlit is awesome for a particular set of apps, but there are some critical components missing in my opinion. These are ordered by importance in my perspective

    1. Library for easily installing analytics like plausible, clarity etc..
    2. Libraries to simplify state management
  • Review – Open Street Maps

    FYI, When I say OSM here, I mean the ecosystem of apps along with the core project. Not just open street maps website.

    Let me start off by saying that Open Street Maps is a fantastic project helping a lot of people & organizations day in and day out. There is no doubt about that at all.

    This is intended to be a review as a normal user & viewed from an angle of it replacing google maps eventually. I do understand OSM is a database first, but for the ecosystem to mature we’ll need a lively data source.

    I’ve also been contributing to OpenStreetMaps, all the while pondering over how useful my contributions are (very similar thoughts to this post). As a contributor, to drive contributions it would be great to see a wall of fame for OSM.

    But anyways, here are my thoughts on the ecosystem of OSM as a consumer looking for an alternative to google maps which uses OSM

    If you are an iOS user, I would recommend using Organic Maps. Other alternatives include OsmAnd & Maps.me. If you are into editing the maps, I would recommend EveryDoor or Go maps!!

    Extremely sparse Point of Interest data

    The primary use-case as a maps user for me is to look up places like cafes, restaurants, pharmacies, ATMs, hospitals, parks etc..

    Depending on where you are, this data is drastically out of date. Even popular places & parks are completely missing. This is mainly because OSM is community driven & there is not a lot of community in a particular place. This sucks from an end user perspective because this is the whole point of a maps for them. Roadways are also equally, if not more important but OSM seems to have good data around this.

    Search needs a lot of improvement

    This is probably the Achilles’ heel of OSM. In no app did I see a nice search bar with autocomplete ordered by relative distance popping up.

    For example, from Andhra pradesh. If I’m searching for Catamaran (A cafe in allepy, because I just want to check open hours and call them up for some questions)

    • Organic maps: first result is Catamarca in Argentina
    • OsmAnd: no results
    • Maps.me : first result is Catamarca
    • Nominatim (the search powering openstreetmap.org) : One hit, the cafe. Bingo!

    Some more examples, everyone is used to searching for “Cafes near me” or “ATMs near me”. None of the above apps show any ATMs around me if I run the same query.

    This right here is what drives me nuts, consumer apps rely on search. Having a bad search engine is the single worst problem to have.

    Missing user reviews, ratings with photos

    This is not currently in OSM’s purview, understandably so. But there should be some integrations for apps like Organic Maps so that the quality of the place is evident. Say for example for restaurants, cafes etc..

    A good web UI

    A lot of times, I plan for trips and look up places on web. There is no good web UI for browsing OSM as far as I know.

    Conclusion

    I really want OSM to take off, I ran into OSM after knowing that Namma Yatri uses it extensively. Open data is core infrastructure and I strongly believe this should be open source.

    But if someone in the ecosystem does not solve for this, we’ll never see the full potential of open mapping systems.

    Talk is cheap, solutions?

    Thanks for reading this far, I hope you see all this in good faith. I have a membership for OSM and now have a recurring donation. That’s my contribution on the financial side. Now let’s talk about the data side of things.

    I believe more POI data is extremely important for more OSM adoption. I’ll start tackling that set of problems. Once the POI is setup, apps like every door, street complete and many others will start showing it on the app and people can start populating more granular data.

    Here are some things I’m planning to work on or started working on

    • OSMRocket (alpha): a small app to take in plain text info on lat long, open hours, amenity from people and convert it to OSM tags setup. I’ve used this to add more than 50 places in my city. I found it pretty useful.
    • Scrape places with more than 100 reviews from google maps and make sure they are present on OSM. I have a script for this already, will furnish this as a project soon.
    • A good self host able search engine with proximity for OSM. Similar to OSM nomination. something which supports plain text also so that atms near me has results and is lightweight. Maybe host it and give it as a free service to organic maps etc..
    • An expo app which lets people click locate, speak into the mic and generate a osm card for poi using AI assistance. When pressed okay, adds it to a stack. Which will be later committed as a change set into OSM. A natural extension of OSM rocket.

    What would be really be beneficial to the project. Most of the ideas here need funding, but solving anything at scale requires some capital investment

    • Google’s exhaustiveness comes from the vehicles which go around with high accuracy gps attached to 360 degree cameras and LIDAR for depth sensing. This data is later used to track roads, point of interest data and many other things. This is probably the only way to add seed data at scale. Once this exercise is done, people can later enrich data further. This is a high capital investment, but once data is in place it’ll prove to be very useful
  • Prompts I expect to work in a voice assistant

    Apple intelligence is slowly rolling out, meanwhile people are building their own versions of voice assistants similar to Alexa & Siri. I think its a good time to think about what prompts we can expect to work in a basic voice assistant, this might serve as a good test case repository to see how voice assistants perform.

    This is excluding very basic prompts like

    • set a timer for 15 minutes
    • remind me to switch off the geyser after fifteen minutes

    and many more. These are already well supported by most of the voice assistants out there.

    Prompts

    • Set an alarm three hours before the flight departure tomorrow
      • Should read the calendar, figure out that a meeting invite for a flight is set to 9am in the morning. Set an alarm at 6AM.
    • Mark schedule visit to dentist as complete
      • Should query reminders, find out what’s the reminder about the dentist and mark it complete

  • Speed Typing

    Speed typing is the idea of working and improving on typing speed. It is said that an average person types around 40 words per minute (or WPM for short).

    The reason I want to type fast is because I write a lot of blogs/notes. I don’t mind programming with a slower WPM since anyways that’s not the bottleneck. But during blogging, I would prefer if I can type at the pace of my thoughts.

    People say typing speed does not matter in programming. I think that is not entirely true, if you are above 60 wpm I think you are good. If not, typing should not be a hindrance to think and iterate for programming is what my I think.

    The journey to 100 WPM

    My current baseline is around 50-70 WPM, depending on the tool. On KeyBR, I was told “Your all time average speed beats 50.19% of all other people. Average speed: 51.7 wpm”

    Now on how to improve from here

    Tools to practice with

    There are a couple of categories here

    Games

    The most fun ones

    • ZType: This blew my mind. You basically type words and on a word press, one enemy ship goes down. So basically words are ammo. 🤯
    • TypeRacer: Fun game to play against friends

    Practice tools

    • MonkeyType: My current favorite. Great UI, lot of customization. Highly recommended.
    • KeyBR: Ruthless, long winded. probably the most useful since it says which keys I suffer with.
  • Protocols – Health

    Heads up: Heavily WIP. Still figuring it out.

    • Get a full body health checkup done every six months. Observe trends on all key metrics.
    • Go get a routine ENT & dental checkup every year
    • Exercise at least for 2% of all time
  • Guide: Setting Up Home Assistant with HomeKit Bridge while using RunTipi & Tailscale

    Here is the gist of how all this works together

    • Home assistant needs to run on the local network accessible so that devices can ping and communicate with it
    • Homekit integration will be using multicast DNS over 21063 (mDNS for short) to establish a homekit bridge between the server on the network & iPhone.
    • Since both Tipi & HA run in docker, we’ll need to install mDNS repeater on Tipi and use it to forward mDNS packets from HA.

    It might look like a lot, but I promise it’ll be pretty straightforward. Feel free to contact me on twitter if you run into any issues!

    Step 1: Configuring Home Assistant

    First, we need to make sure Home Assistant is properly configured to work with HomeKit:

    1. Open your Home Assistant configuration file (configuration.yml this is situated at runtipi/app-data/homeassistant-1/data/config/configuration.yaml).
    2. Add the following HomeKit configuration:
    homekit:
      name: Bridge
      port: 21063
      advertise_ip: "192.168.0.101"  # Replace with your Home Assistant IP
    1. Save the file and restart Home Assistant.

    Step 2: Setting Up mDNS Repeater

    To ensure proper communication between HomeKit and Home Assistant, we need to set up mDNS repeater:

    1. Identify your network interfaces by running docker exec <container_id> ifconfig , here the container ID is the ID of the docker container of Home assistant
    2. run route -n to figure out what’s the network name of the IP address you got above
    3. Install mDNS repeater app from runtipi appstore. Start the mDNS repeater with mDNS repeater app with the host interface (e.g., enp1s0) and the Docker network interface (e.g., br-fd25fefeed1f).

    Step 3: Configuring RunTipi

    Now, let’s configure RunTipi to expose the necessary ports:

    1. Navigate to your RunTipi configuration directory: ~/runtipi/user-config/homeassistant-1/
    2. Edit the docker-compose.yml file:
    services:
      homeassistant-1:
        dns:
          - ${DNS_IP}
        ports:
          - 21063:21063
    1. Save the file and restart the Home Assistant app on RunTipi.

    Step 4: Setting Up HomeKit Integration

    1. In Home Assistant, go to Configuration > Integrations.
    2. Add a new integration and search for “HomeKit”.
    3. Follow the prompts to set up the HomeKit integration.
    4. Once complete, you’ll see a QR code or a pairing code.

    Step 5: Pairing with iOS Device

    1. Open the Home app on your iOS device.
    2. Tap the “+” button to add a new accessory.
    3. Scan the QR code or manually enter the pairing code provided by Home Assistant.
    4. Wait for the connection to be established. This may take a moment.

    Troubleshooting

    If you encounter issues:

    1. Ensure all ports are correctly forwarded and not blocked by firewalls.
    2. Double-check that the mDNS repeater is running correctly.
    3. Verify that the HomeKit integration in Home Assistant is using the correct port (21063).
    4. If using Tailscale or other VPN solutions, ensure they’re not interfering with local network discovery.

    Conclusion

    By following these steps, you should now have Home Assistant successfully integrated with HomeKit through RunTipi. This setup allows you to control your smart home devices using Apple’s Home app and Siri, providing a seamless experience across your Apple devices.

    Special mention & Massive shout-out to jigsawfr from Runtipi discord for helping me set it up for the first time!

  • One time purchase licenses for software

    Initially software was bought and sold in CDs. If you bought a copy of windows 7, you’ll get a windows key along with the CD. And you could use the key along with the CD and you owned the software. No recurring subscriptions etc.. Later on if Windows 8 comes up, you would have to go and get the upgrade.

    The reason this is great is because there is one simple predictable pricing structure & clear ownership.

    This is fundamentally different from leasing software with subscriptions. Any software without significant recurring cost should IMO offer a single time license. They can always say that the upgrade needs to paid for. But the version of the software you bought is with yours. Even if the company dies.

    This is a list of software vendors who support this kind of payment options. They either offer a way for me to buy software and own that version of it or do a lifetime purchase for the service itself.

    Note to self: maybe make this a website later on?

    Pushover

    Description: Send notifications to your devices with web hooks. Basically hosted ntfy/gotify service.

    Plan: One purchase per device/platform priced at 5$ last I checked. Lifetime access. A steal if you ask me.

    Alfred

    Description: The Quintessential quick launcher for Mac.

    Plan: One purchase per version.

    MixPost

    Description: Cross platform social media manager

    Plan: One time purchase of 299$ for the pro. lifetime license.

  • On why companies should self host

    Consider the following use cases, and a hypothetical scenario of what would happen normally in an organization which does not solve these explicitly

    1. A developer wants to convert sensitive doc from base64 to plain text
      • people go to public websites & paste PII data due to ignorance
    2. People want to shorten links and create bookmarks. for example company.links/mis should take you to the data analytics platform
      • everyone creates there own set of bookmarks, and migrations to new domains are essentially messages on public channels etc..
    3. Dashboard of all internal services
      • bookmarks again
    4. People want to chat using ChatGPT / Claude
      • they user either personal accounts with their own configuration
    5. Marketing wants to transcribe a un-released, probably sensitive video and extract text
      • upload to some website which offers a free tier
    6. Convert & Join PDFs
      • some random online tools
    7. Share files between devices
      • personal whatsapp or telegram would be used

    There are many other use cases where employees default to their own preferred method without really thinking about data security

    Instead if the company opts to run a small server under a VPN, then for each of these problems, there is a nice solution which is data sec friendly

    Here is how it would look like in a world where the company uses something like runTipi to self host inside a VPN

    1. A developer wants to convert sensitive doc from base64 to plain text
    2. People want to shorten links and create bookmarks. for example company.links/mis should take you to the data analytics platform
    3. Dashboard of all internal services
    4. People want to chat using ChatGPT / Claude
    5. Marketing wants to transcribe a un-released, probably sensitive video and extract text
    6. Convert & Join PDFs
    7. Share files between devices
      • Something along the lines of Send

    The organization can have its own universe of self hosted & well designed tooling which solves these problems really well.

    More importantly, self hosting is deemed to be too scary. I disagree. Its actually very much doable with the current compute speeds and docker based backups.

    Companies should give it a honest shot at a small scale to start with and see how it goes.

  • Torrenting on Tipi

    Let’s say you want to download Sintel, the creative common licensed movie made in Blender from torrents.

    There are a couple of options for downloading torrents on your Tipi. We’ll use Transmission for now.

    Understanding the folder structure

    Tipi mounts folder /media/torrents for transmission to write into. Usually people prefer incomplete downloads to be placed in /media/torrents/incomplete & /media/torrents/complete for complete downloads.

    Permissions

    Make sure ~/runtipi/media/torrents is owned by user 1000. For this, ssh into your server and do ls -al ~/runtipi/media and see if the owner is user 1000. If not, run

    sudo chown -R 1000 ~/runtipi/media/torrents/

    The above should be automated in the install step on Tipi later on, this is temporary.

    Setting good defaults

    Go to settings in transmission

    Set Download to /media/torrents/complete & temporary folder to /media/torrents/incomplete.

    This will make sure torrents download to the folders shared with Tipi.

    You can later move the file to ~/runtipi/media/data/movies or ~/runtipi/media/data/music using an app like filebrowser.

    That’s it! Apps like Jellyfin have access to ~/runtipi/media and you should be able to watch it on your Tipi instance.

  • On Going from Idea to Deploy

    How much time does it take for you to go from imagining a decent idea to having a functional bare bones proof of concept web app?

    Right now, for me it’s around two days of intermittent work. Here’s why

    My current stack is

    • Next js for the back end and SSR
    • mongodb as the database
    • Tailwind as the css framework
    • Firebase as the auth provider
    • Prisma as the orm most of the time (or mongoose if I’m with mongodb)
    • Some kind of redux alternative if needed

    And the list goes on.

    I waste the most amount of time on

    • Authentication: the firebase/ auth0 integrations of the world
    • Deciding on the db
    • Deciding on the front end framework
    • Spending a stupid amount of time on styling and css libraries

    This is pretty disappointing. Idea to go live should be as soon as possible so that iteration speed is great. What’s the point of having insane tech at our disposal if we still struggle to do basic crud apps slowly?

    How to move faster on side projects?

    There are three common answers to this from the community I hear

    1. Php and friends: larvel seems to be pretty good
    2. Ruby on Rails : one of the most beloved frameworks for developing web apps. Apparently once you get it, there is no going out.
    3. Django: aimed at perfectionists with deadlines. Delivers what it promises absolutely.
    (more…)
  • Review – Self hosted photo management apps

    There are only two options, in my opinion, that absolutely stand out right now. One is called PhotoPrism, and the other is called Immich.

    As I mentioned before, I run a modest Chrome Box as a home server. I’ve always wanted to move away from iCloud because it becomes expensive rather quickly, but I couldn’t find a solution that also offered a great user experience.

    I initially started with Immich, ran it once, and it crashed my server. I then migrated to PhotoPrism, which was more efficient, but the UI wasn’t great.

    After a few days, I switched back to Immich. I hoped I could make it work by disabling many options, and I was right. After completely disabling machine learning and video transcoding, Immich seems to be okay, although it still consumes a significant amount of CPU and heats the server.

    But as of now, there’s nothing else like it on the market, and this is the best we have. I hope they optimize the performance and eventually provide tips on disabling heavy features during installation, which would make the onboarding process easier. To be fair, Immich isn’t even at version 1.0 yet—it’s still considered beta software. For a beta, it’s extremely well-polished, so kudos for that.

    My recommendation would be to go with Immich, but carefully go through the settings to ensure you’re disabling any unnecessary or resource-intensive features.

    The rest of the article is a more formal analysis of the pros and cons.

    PhotoPrism

    Pros:

    • Does not disturb the external library, making it very easy to migrate and maintain.
    • Machine learning for face detection.

    Cons:

    • No Android or iOS app.
    • The UI could be more polished.
    • User management UI is not included in the free version.

    Immich

    Pros:

    • Excellent Android and iOS apps.
    • Beautiful UI.
    • Machine learning for face detection, etc.
    • Implementation of configuration for custom use cases.
    • Great administration UI, including views on jobs running, active, waiting, etc.

    Cons:

    • If you run a lighter home server and forget to disable machine learning and video transcoding features, it will push the CPU to 100% consistently. Some feature flag management at the setup level would be helpful to prevent this.
  • Rabbit holes

    I have a (unhealthy) habit of jumping into rabbit holes and spending days or sometimes months obsessing over a topic which is entirely unrelated to my day to day & probably is useless. But, its pretty damn good fun.

    So, here are the rabbit holes I jumped into

  • Review – CasaOS for self hosting

    CasaOS is yet another project to assist people into self hosting. But this one gets a whole lot of things right.

    What it gets right

    • The setup process is smooth, with a machine running tailscale & CasaOS. Most of the heavylifting for self hosting a home sever is done. The UI for apps is beautifully done.
    • Gets the UI right for volume management. Adding and removing volumes is beautifully done. Assumes you understand docker compose though. Nice balance of power and flexibility.
    • Minimal but well curated app store
    • Support for third party app stores
    • Very well made file manager
    • Supports network shares with samba inherently. Very powerful.
    • Can cap ram usage and cpu usage per application. A very useful feature since some applications like Jellyfin, photoprism can potentially take over the entire ram.

    What it does not get right

    • Entire backups are not obvious, duplicati is recommended. But that only backs up selected folders. I would prefer something along the lines of time machine in MacOS. This is actually a deal-breaker, but I don’t think there is any project out there which figured this out either.
    • Hosting to public domains is not inherently obvious. It would be nice if it supports
    • Communicating between apps is non – intutive. I’ve still not figured out how to get sonarr to talk to transmission. Or how I can configure Homarr & make it talk to jellyfin etc..
    • Network shares is great, but need polishing. Currently its without auth and no way to tweak permissions. Probably advanced and might be potentially out of scope for the project, but it does have the idea of making a folder network share-able. Would be nice if there is an app just to interface with samba, so that CasaOS just interfaces with the app for network sharing capabilities.

    Verdict – Recommended

    If you are starting out on your self hosting journey, definitely recommend checking out CasaOS. Among Umbrel, Tipi & CasaOS, CasaOS definitely gets most things right.

  • Experiment: Running a home server on a chrome box

    Status: Running

    I’m new to the self hosting server game. And in retrospect, this is kind of embarrassing to not foresee that this would not work TBH. Let’s start with the story.

    My friend has a old Mini PC lying around which was unused. Since I was currently running Tipi on my Thinkpad & kind of abusing it. I wanted to move to a more sustainable solution. So, I asked for it temporarily to see if I can use it as a home server while I figure out what hardware to buy. Or if it really fits the bill, probably buy it.

    On arrival I realized it was a chrome box. Damn it.

    Chrome box’s and chrome books are meant to be usually starter level compute boxes. Initially built for Students & Teaching organizations. Hence the doubts. But then this comes with an i5 4th gen or something along that lines. That’s a powerful enough CPU. So I kept my hopes high and kept going.

    First mistake: Deciding to run fedora workstation

    On my thinkpad, I run tipi on fedora. seems to work alright. So, I thought why not. I was lazy and did not want to experiment.

    A non-server OS comes with a couple of problems. Starting with a full graphical install. Power saving modes. By default, disabled SSH. sleep and suspend, login on boot, etc..

    Although all of these were manageable, it made more sense to just move to a server based operating system.

    So I moved to fedora server instead.

    The weird issue of needing a functional display to boot up

    SSH was setup on install, the os was pretty good. Everything was sorted on the software front. But interestingly, the system would not boot on power if the HDMI cable was not connected to a display.

    This was very weird, because the power was on and the system just did not start. After a lot of googling and some AI assistance, I realised that chrome boxes might have a setting to prevent stale starts, so they had this feature of not switching on the system if the display was not connected. I would have to deep dive into the bios settings to figure out if I could turn this off, but I thought I can live with this for a while and start using the server by manually switching it on every time it switches off.

    This was also okay because I had power back up at my place, so once it was on, it rarely switches off.

    Heating issues

    After installing the server and installing immich. I observed that the server randomly just goes down.

    This was unanticipated, after which I installed net data and started looking at temperatures. To see if it was a potential heating issue.

    As expected, the CPU was running around 10 3°C. Which is pretty high and naturally, the system was just shutting down.

    Chromebooks are not designed for servers. They don’t have great cooling systems. This is obvious, but for some reason, I was optimistic and thought I could wing it.

    I could add more cooling into the picture, probably add a fan, et cetera, but I still need to figure out the bios set up.

    Conclusion

    This is just the beginning of the set up. I have just installed image hosting application. I’m not yet installed Jellyfin, which is my media streaming service of choice. That would be more CPU intensive and probably even GPU intensive if it is present.

    If with one application, the system is just heating up and shutting down, there is not a lot of hope in deploying the entire stack.

    I have now returned the CPU. Time to be on a lookout for probably a better mini PC.

    Learnt about setting up

    • Server operating systems
    • Started looking into network attached storage
    • Learnt a lot about cooling and temperature monitoring, et cetera
    • Learnt about the limitations of using tipi and customs volumes

    All in all an interesting experiment of around 5 hours. Need to be on the lookout for the next server now.

    The journey continues…

  • On the power of communities

    A bit of a story, I recently wanted to buy a mini pc for my home server. That’s a 20+k investment, even if I buy it in second hand. While taking to a friend, I realized that one more common friend had a pc lying around which was unused. After asking him about it, he quickly offered to give it to me since it was lying unused.

    A couple of interesting things happened here. I saved a significant amount of money and time, which would have gone into researching and purchasing the pc. My friend got rid of some old unused hardware which was lying around. We effectively contained the carbon footprint of the community too since I did not purchase yet one another pc. I potentially prevented a computer from entering the landfill. That’s a pretty damn good outcome!

    On communities

    We are social animals, we thrive in communities/tribes. Each and every one of us is potentially part of multiple communities. Family community, workplace community, your friends who you play football with etc..

    There are resources scattered in these communities which are wildly under utilised..

    Economics of scale is a very powerful fact of life. With volume, almost always price comes down. If communities are formed, a lot of things organically start working out.

    Use cases

    • Sharing subscriptions like iCloud, google one, YouTube, prime video etc..
    • Car pooling across groups like apartment groups & office groups
    • Collation of community media and resources like videos, music etc.. For example, uploading pics of an event, sharing videos after an event. Uploading performances for everyone in the community to check out etc..
    • Communication like posting events & updates, requesting referrals etc.
    • Listings for leasing for or for not money and managing exchanges. For example, I have a raspberry pi at home which I’m not using which I’ll happily lend to a junior at office if he wants to experiment on IoT.

    Types

    Communities in my opinion are of multiple types by the nature of entry

    • By property: you’ll be a part of a community by property of being a part of a family. Or by property of being employed at a place.
    • By trust: you and your friends will be a part of the community

    Dynamics are quite different in these communities. For example, you might comfortably give away your bike to a friend for a week, but will think twice to give the same bike to a coworker in your office.

    Some financially profitable use cases to experiment with

    • Backups: backblaze seems to offer a terabyte worth of data storage on hot s3 compatible medium for around 7 dollars a month. For one person, this might be overkill, but for a family of four it makes sense.
    • YouTube and iCloud has family plans which could be shared and are fairly cheap per member.

    Closeknit: an experiment

    Looks like discovery is one of the most important bottleneck. Closeknit aims to be a discovery and management platform for items and subscriptions.

    With Closeknit, you can join communities and discover under utilized resources.

    Let’s see how this goes.

  • Review – Tipi for self hosting

    What it gets right?

    • Install is very smooth
    • auto HTTPs for all public domains using let’s encrypt & traefik
    • local domains for self hosting out of the box
    • great UI for installing and managing apps
    • app level backups on the UI. Really big deal since not a lot of products in this space have it out of the box including CasaOS.

    What it does not get right?

    • No UI for automated app backups & No Maintenance window setups for complete server backups

    Observations

    Once I get used to the tool, I lose the advantage of having a fresh eye. Hence documenting observations here if I find anything weird.

    • Uninstall is not obvious. The expectation is to stop the app & then uninstall. Although that makes perfect sense, it would be nice to just have a button for uninstall.
    • tipi.local does not work as advertised. Not sure why too.
      • there was documentation here, did not stumble into it cause it said “Local SSL certs” but I read it as “documentation on enabling https://tipi.local”, not how to setup tipi.local. but anyways, the docs were fantastic and folks over at discord were super helpful. Made it work after following the docs.
    • connecting between services is not intuitive.
      • You just use your IP:port, it would be nice if there was a button to copy the URL from the service on the dashboard
    • mounting volumes is do-able, but no UI
      • nitpick, user-config does exist. But it should be done via code. UI around adding volumes would be really nice.
  • Project notes – Citadel

    For context, read the self hosting dream post

    The philosophy / idea is to provide an interface to self hosting applications similar to phones. Just run the OS, hit install and you’re ready to start using the app.


    Literature review

    I do not want to re-invent the wheel. If something along these lines already exists, I want to use it.

    Caprover / Dokku & friends

    Aim to be solving for developers interested in self hosting. Serve as an alternative to companies like vercel. Although there is significant overlap in scope, do not really solve for a simple day to day customer who is interested in hosting some services.

    Umbrel & CasaOS

    Solves for hosting at home. Does not intend to be on the public internet, although you can achieve that by cloudflare tunnels. Extremely well thought out systems IMO. If they solve for backups & domain management as a part of their stack. Citadel might not be needed.

    Tipi

    longer review here

    Apart from minor nitpics on UI, bang on on what I planned. unfortunately I found this after phase one, if not I would have just jumped on discord and asked questions on how I can just help.


    Phases

    Phase one

    A CLI to setup a single node on something like a raspberry pi, old laptop or desktop or a server rented online. We’ll start by supporting Ubuntu for now. It should allow for

    • Setup: user creation, installing docker if it already does not exist. Setup firewall if it doesn’t exist. Configure to open only required ports. docker network creation. App Store setup.
    • List: list all available apps
    • Install: install an app. run pre install and post install hooks if needed. pre populate app data if needed. Spin up the docker compose project.
    • Update: update the docker compose file. Re start the docker compose stack with the new version.
    • Remove: spin down the docker compose stack. Delete app data if specified. Delete the installed app.
    • Domain add: add another base domain to the application.
    • Domain remove: remove a domain mapping

    Problems and Design decisions

    with corresponding considered solutions & solution picked

    User Authentication & Default passwords

    Both google & iOS thrive on using their own authentication stack (google sign in & apple sign in). Also, apps on the phone do not have to worry about others accessing it since they are protected by the phone lock itself.

    Potential solutions:

    1. maybe block the UI behind a simple HTTP auth? Horrible idea, apps can chose to have complex multiple user management. This will either be two layers or one half implemented layer.
    2. Have only apps which have sign in with say apple, firefox or some auth provider in the auth store? Adoption will be next to impossible. Not a good idea.
    3. during on boarding, let the user give a really long default password and use that as the default password for everything. meanwhile clearly calling out that the password needs to be changed post log in? Sucks UX wise, but works and is a good middle ground. The password has to be stored in plain text somewhere in the env variables. Security wise probably a bad idea.
    4. How does umbrel solve it? maybe borrow that solution? It seems to have default passwords in the codebase. Also umbrel is not trying to be a public facing server. It’s more a personal server.

    right now, going with option 3

    User input for setup (like domain names etc..)

    If I’m hosting a WordPress website, I should get to choose the domain it lives in. How to get this information during app install smoothly?

    During setup, ask as little questions as possible.

    Domain management

    how to make this seamless?

    For example

    • the user is setting up their wordpress blog, I want to ask what the subdomain should be. should it be blog.citadel.bharatkalluri.com or notes.citadel.bharatkalluri.com?
    • what if the user wants to add another domain along with the default one?

    File permission management

    To run docker at user level (`1000`), all the app data folders must be created to run at `1000`. Which means that, the app data folders need to be created upfront, permissions need to be set in the app data folder so that when the container actually executes, it has the privileges to actually add/edit/delete files in the folder.

    That means that before running the container, we’ll have to first set up folders in app data with the right owner. Which means that app data needs to actually be pre-populated.

    Solution: every app will also come with a folder for app_data. the contents of this folder will be copied into the `$CITADEL_APP_DATA` folder as it is & then docker compose up is run. this “seed” app data folders will already have the right permissions setup. This is sorted now.


    Apps I would like on citadel

    All the apps which a phone has built in like

    • File manager
    • App Store

    These will be un-installable. Basically the default apps.

    and some more like

    • WordPress
    • Memos note taking
    • pihole
    • Jellyfin: auto discovery might or might not work
    • Tailscale on the host system: setup during onboarding
    • Duplicati
    • Transmission
    • A monitoring tool
    • Statping
    • Arrr stack & Jellyseerr