This year will be on rebuilding and growth. After a challenging 2025, I am looking forward to a fresh start in 2026.
After living in 4 separate cultures - Singaporean, American, Australian and Dutch, I have a privileged perspective on cultures, lives, countries.
Granted they are all a snapshot in time, formed through my own personal lens and limited by my sphere of comprehension at that time,
I do feel I have a broader perspective than most. I am exceedingly thankful for the life I have led so far, I have taken on a lot,
challenged a lot and overcome a lot. Moving forward, I want to think about which direction I want to take the next 5 yrs.
Chatbot Deployment Update: 15 Jan tweaks
I spent a few days rounding back to this and thinking on how I can improve it. I am still learning about AI but I feel that the chatbot gives limited examples of my work/portfolio, and also doesn’t do enough filtering of irrelevant questions and naturally accepts or rejects what questions it can answer, what it is trained on. The current problems:
- configuration issues (k=5 is too low, no chunking)
- filter design (too permissive for personal questions when there is little personal data in dataset)
- prompt emgineering (summarization towards personal skills)
- lack of testing (no established baseline)
- multiple LLMs (anthropic never trains on api data, openai stopped since march 2023 and ollama is 100% private)
Currently, the costs for the current set up is really low (was my goal). 79 mdx files, about 600kb of text, 6818 lines = 145k tokens at 580kb / 4 chars per tokens It fits well in a single LLM conext window for Claude 2.5 Sonnet ad Haiku, will need chunking for gpt-4o-mini. Cost per query is $0.00077, assuming 1k querues: $0.77 per month.
If I ran this through as a document to an LLM (much alike notebook LM): Without Prompt Caching:
- GPT-4o-mini: $0.022/query → $22/month (28x more)
- GPT-4o: $0.364/query → $364/month (473x more)
- Claude Haiku: $0.117/query → $117/month (152x more)
- Claude Sonnet: $0.437/query → $437/month (567x more)
With Prompt Caching (Anthropic 90% discount, OpenAI 50% discount):
- GPT-4o-mini: $0.011/query → $11/month (14x more)
- Claude Haiku: $0.012/query → $12/month (15x more)
- Claude Sonnet: $0.044/query → $44/month (57x more)
Accuracy Analysis
Would it be more accurate? YES, significantly:
✅ Zero retrieval failures - Every document always available ✅ Better “list all” queries - LLM sees everything, can’t miss projects ✅ Cross-document reasoning - Can compare/synthesize across all content ✅ No chunking artifacts - Full context, no split information ✅ No scoring bias - No keyword ranking affecting visibility
Potential downsides:
⚠️ Lost in the middle - With 145k tokens, some attention dilution ⚠️ Latency - Processing 145k tokens slower than 5k tokens (~2-4 seconds vs less than 1 second) ⚠️ Scalability limit - Won’t work if content grows to 500k+ tokens ⚠️ Cache invalidation - New content requires full re-cache
If i want a hybrid approach with the two:
- 20% of queries are “list all” type: 200 * $0.012 = $2.40 (“What are your list of …”, “What are all your… ” questions)
- 80% of queries are specific: 800 * $0.00077 = $0.62
- Total: ~$3/month (4x current cost, but solves both problems)
The flow chart would be a multi-agent system that looks like this:

Original Multi-Agent Design
| Component | Cost per Query | Monthly (1k queries) |
|---|---|---|
| Filter Agent | $0.0002 | $0.20 |
| Classifier Agent | $0.0002 | $0.20 |
| Tag Agent | $0.0005 | $0.50 |
| FAISS Agent | $0.00077 | $0.77 |
| Relevance Scorer (LLM) | $0.004 | $4.00 ⚠️ |
| Response LLM | $0.0003 | $0.30 |
| Behavioral (Claude Sonnet) | $0.015 | $3.00 ⚠️ |
| Total (Skills 80%, Behavioral 20%) | - | $8.00 |
Optimized Multi-Agent (Cheap)
| Component | Cost per Query | Monthly (1k queries) |
|---|---|---|
| Filter+Classifier (combined) | $0.0002 | $0.20 ✅ |
| Tag Agent | $0.0005 | $0.50 |
| FAISS Agent | $0.00077 | $0.77 |
| Heuristic Scorer (no LLM) | $0 | $0 ✅ |
| Response LLM | $0.0003 | $0.30 |
| Behavioral (GPT-4o-mini) | $0.004 | $0.80 ✅ |
| Total (Skills 80%, Behavioral 20%) | - | ~$2.40 ✅ |
Savings: $5.60/month (70% reduction!)
How Behavioral Analysis Works
You asked: “how to draw connections and analysis about a person through blogs”
Strategy:
- Load blog posts (recent 20 posts, ~60k tokens)
- Extract themes (first pass with Claude):
- Leadership examples
- Problem-solving approaches
- Values and motivations
- Lessons learned
- Synthesize answer (second pass):
- Use STAR format (Situation, Task, Action, Result)
- Base on actual blog content
- Show self-awareness and growth
Example: Q: “How do you handle technical disagreements?”
Behavioral Agent: → Scans blogs for conflict resolution stories → Finds: “In my post about ManaBurn architecture decisions…” → Extracts pattern: “I prefer data-driven discussions + prototypes” → Synthesizes: “I approach technical disagreements by [STAR response based on actual blog examples]”
Call me cheap but $8/mth is more than I want to pay. Reiterating. More tomorrow.
Personal Goals for 2026
Physical
Getting into peak form in terms of strength, weight-bearing workouts, flexibility and cardio.
Mental
Onboarding a meditation practice, journaling practice, and reading habit.
Finding a comfortable groove that allows for reflection and growth.
Professional
Looking for an evironment that challenges me, focuses on healthy growth.