[
{
"title": "How Other Link Checkers Do Recursion",
"url": "https://endler.dev/2026/how-other-link-checkers-recurse/",
"body": "After I published Five Years of Trying to Add Recursion to lychee, one reply I got was a very fair question:

If recursion is so hard, how do other link checkers do it? Plenty of them already crawl websites!

This sent me down a rabbit hole of reading the code of other link checkers.
The key takeaway is: they didn’t find a clever trick we missed. They were built as crawlers from the very first commit, and I initially built lychee as a stream.
I went and read the source of the recursive checkers we list in lychee’s README: muffet (Go), LinkChecker (Python), linkinator (TypeScript), and broken-link-checker (JavaScript). This post is a teardown of how each one actually handles recursion, what it costs them, and what it means for lychee.
If you haven’t read the first post, the summary is that lychee was architected as a one-shot, unidirectional pipeline (inputs → extract → check → output). Recursion needs a cycle (responses create new inputs), and cycles in an async, channel-based pipeline are where the dragons live. 🐲 Five years and four attempts later, the pieces we’ll need to do it properly only just landed.

    
DAGs vs. cycles
Every recursive checker I looked at is built from the same three parts:

A mutable work queue (let’s call it “frontier”), not a fixed input stream. Discovered URLs go back into the same queue they came from.
A visited set that’s updated at enqueue time (before the request completes), so two pages discovering the same link can’t both submit it.
A primitive that answers “is everything done?”: a WaitGroup, a joinable-queue counter, an onIdle() promise, or a queue-drain event.

Diagrammatically, lychee is different from the others:
graph TD
    subgraph crawler[&amp;amp;quot;Everyone else: a cycle&amp;amp;quot;]
        direction TB
        CQ[Frontier queue] --&amp;amp;gt; CW[Worker pool]
        CW --&amp;amp;gt; CP[Fetch and parse page]
        CP --&amp;amp;gt;|new links| CQ
        CP --&amp;amp;gt; CR[Results]
    end
    subgraph lychee[&amp;amp;quot;lychee: a DAG&amp;amp;quot;]
        direction TB
        LA[Inputs] --&amp;amp;gt; LB[Extractor]
        LB --&amp;amp;gt; LC[Checker]
        LC --&amp;amp;gt; LD[Results]
    end
Crawlers have a back-edge baked in. Our pipeline doesn’t, and every one of my failed attempts was an effort to bend that back-edge into a graph that was never designed for it.
Let’s look at that graph design more closely:
graph TD
    Seed[Seed URLs] --&amp;amp;gt; Enq[&amp;amp;quot;Enqueue step: is URL in visited set?&amp;amp;quot;]
    Enq --&amp;amp;gt;|yes| Skip[Drop]
    Enq --&amp;amp;gt;|no| Mark[&amp;amp;quot;Mark visited, then push&amp;amp;quot;]
    Mark --&amp;amp;gt; Q[Frontier queue]
    Q --&amp;amp;gt; Pool[&amp;amp;quot;Worker pool, bounded concurrency&amp;amp;quot;]
    Pool --&amp;amp;gt; FP[Fetch page and extract links]
    FP --&amp;amp;gt;|discovered links| Enq
    FP --&amp;amp;gt; Rec[Results]
    Q -.-&amp;amp;gt;|empty AND no worker busy| Stop[Terminate]
Note that the visited check happens in the enqueue step, atomically with the mark, before the worker ever touches the network. That ordering is the entire fix to the deduplication race that haunted lychee’s attempts 1–4, where the cache was written after checking.
Each tool uses a variation on it.

    
muffet (Go): a WaitGroup and a Set
muffet is closest in spirit to lychee: a fast, single-binary, concurrent website checker.
The dedup + scheduling decision lives in one method (page_checker.go):
func (c *pageChecker) addPage(p page)  
	if !c.donePages.Add(p.URL().String())  
		c.daemonManager.Add(func()   c.checkPage(p)  )
	 
 
donePages is a concurrentStringSet (a mutex-guarded map[string]struct  ). Add returns whether the URL was already present, so a page is only scheduled the first time it’s seen. Dedup happens at enqueue, synchronized by the set’s mutex. This is basically a line-by-line translation of the diagram above.
Checking a page fetches all of its links concurrently, and feeds qualifying ones back into addPage, the back-edge:
go func(u string)  
	defer w.Done()
	status, p, err := c.fetcher.Fetch(u)
	&amp;#x2F;&amp;#x2F; ...
	if !c.onePageOnly &amp;amp;amp;&amp;amp;amp; p != nil &amp;amp;amp;&amp;amp;amp; c.linkValidator.Validate(p.URL())  
		c.addPage(p)   &amp;#x2F;&amp;#x2F; recursion: discovered page re-enters the frontier
	 
 (u)
    
        
How muffet knows it’s done
muffet’s answer to termination is a little daemonManager built around a sync.WaitGroup (daemon_manager.go):
func (m daemonManager) Add(f func())  
	m.waitGroup.Add(1)
	m.daemons &amp;amp;lt;- func()  
		f()
		m.waitGroup.Done()
	 
 
func (m daemonManager) Run()  
	go func()  
		for f := range m.daemons  
			go f()
		 
	 ()
	m.waitGroup.Wait()   &amp;#x2F;&amp;#x2F; &amp;amp;lt;- termination
 
Every scheduled page increments the group; every completed page decrements it; Wait() returns when the count hits zero. The whole crawl bootstraps with a single addPage before Run(), so the counter is positive before anyone waits on it.
This is the same counter I tried (and failed with) in Attempt 1 and Attempt 4. The difference is the invariant: waitGroup.Add(1) is only ever called from inside an already-running daemon that holds the count above zero (or from the bootstrap). There is no window where the counter briefly reads zero while work is still pending. Go’s WaitGroup enforces this invariant so naturally that it doesn’t feel like distributed termination detection at all, but that’s exactly what it is. It’s the moral equivalent of the WaitGroup primitive Kait contributed to lychee in 2026.

    
Where the tradeoffs are

Concurrency isn’t bounded by the daemon manager. Run() does go f() for every task, spawning unbounded goroutines. The actual limiting happens downstream in a semaphore (a buffered-channel counting semaphore) and a per-host throttler pool. muffet separates “the frontier” from “the rate limiter,” which is exactly the separation lychee lacked when it tried to use one bounded channel as both in the past.
Cheap goroutines do a lot of heavy lifting. Spawning a goroutine per link is “fine” in Go. The equivalent in Rust (tokio::spawn per link, each needing Send + static state) is what pushed me toward Arc&amp;amp;lt;RwLock&amp;amp;lt;…&amp;amp;gt;&amp;amp;gt; and the ownership pain I wrote about.
On extensibility, muffet is a focused CLI, not a library. There’s no plugin surface; you get what the flags give you. lychee deliberately ships lychee-lib as a reusable crate, which raises the bar, since every architectural choice has to uphold the standards of a public API.
On scalability, unbounded goroutines plus an in-memory visited set scale comfortably to large sites, but there’s no disk-backed frontier, so a truly enormous crawl is bounded by RAM. Same as lychee.


    Takeaways: muffet
     
muffet’s termination is a sync.WaitGroup, full stop. It’s the design lychee converged on after five years; muffet got it for free from Go’s standard library on day one.
The frontier and the concurrency limiter are separate things. A mutex-guarded set is the frontier; a semaphore plus host throttler bounds concurrency. Conflating them is what deadlocked lychee.
Goroutines hide the cost that Rust makes you pay explicitly. The same per-task model that’s trivial in Go is where Rust’s Send&amp;#x2F;ownership friction shows up.


LinkChecker (Python): a joinable unbounded queue
LinkChecker has existed since the year 2000. It’s a synchronous, thread-pool crawler.
Its frontier is a hand-written UrlQueue (cache&amp;#x2F;urlqueue.py), a clone of Python’s queue.Queue with task_done()&amp;#x2F;join(). Look at the very first design comment:
def __init__(self, max_allowed_urls=None):
    # Note: don&amp;amp;#39;t put a maximum size on the queue since it would
    # lead to deadlocks when all worker threads called put().
    self.queue = collections.deque()
    # ...
    self.unfinished_tasks = 0
It’s explicit about the exact deadlock that bit me.
That comment is our Attempt 4 backpressure deadlock, called out and designed around. lychee tried to push discovered URLs into a bounded channel; when it filled, the response handler blocked, no responses drained, no slots freed. Deadlock. 💥
LinkChecker’s answer is brutalist in nature: the frontier is unbounded.
Backpressure is enforced elsewhere (a fixed thread count and per-host throttling), never by blocking a producer that is also a consumer.

    
Termination by counter, done right
join() blocks until unfinished_tasks hits zero (urlqueue.py):
def task_done(self, url_data):
    with self.all_tasks_done:
        self.finished_tasks += 1
        self.unfinished_tasks -= 1
        self.in_progress -= 1
        if self.unfinished_tasks &amp;amp;lt;= 0:
            self.all_tasks_done.notify_all()

def join(self, timeout=None):
    with self.all_tasks_done:
        while self.unfinished_tasks:
            self.all_tasks_done.wait()
Again: a counter. But the increment in _put and the decrement in task_done are both inside the queue’s Condition lock, and a worker calls task_done only after fully processing an item including enqueuing its children. So children are counted before the parent is marked done, with no premature zero. It’s WaitGroup semantics implemented with a mutex and a condition variable.

    
Deduplication, before the request
LinkChecker writes the URL into its result cache at enqueue time (urlqueue.py):
def _put(self, url_data):
    key = url_data.cache_url
    cache = url_data.aggregate.result_cache
    if cache.has_result(key):
        return  # already queued&amp;#x2F;checked -&amp;amp;gt; skip
    # ...
    self.queue.append(url_data)
    self.unfinished_tasks += 1
    # add a None placeholder so this URL is never queued twice
    cache.add_result(key, None)
That add_result(key, None) sentinel is a “fix” that’s missing in lychee’s attempts. By the time any worker thread checks the URL, the cache already says “mine,” so concurrent discovery from another page is a no-op.

    
Per-host politeness and termination guards
The Aggregate (director&amp;#x2F;aggregator.py) throttles per host:
@synchronized(_hosts_lock)
def wait_for_host(self, host):
    t = time.time()
    if host in self.times and self.times[host] &amp;amp;gt; t:
        time.sleep(self.times[host] - t)
    # spread requests using maxrequestspersecond
    wait_time = random.uniform(wait_time_min, wait_time_max)
    self.times[host] = time.time() + wait_time
and abort() calls urlqueue.join(timeout=…) so a stuck crawl can’t hang forever.

    
Where the tradeoffs are

Blocking threads instead of async. Each of the (default 10–100) Checker threads does blocking I&amp;#x2F;O via requests. Simple and battle-tested, but the concurrency ceiling is the thread count, and each thread carries a full stack. lychee’s Tokio model reaches thousands of concurrent in-flight requests on a handful of OS threads; LinkChecker can’t, and doesn’t try.
The unbounded frontier trades a deadlock for unbounded memory. The explicit “no max size” decision means RAM growth on huge sites. There’s a max_allowed_urls cap and a periodic cleanup() to mitigate it.
Extensibility is excellent. LinkChecker has a real plugin system (linkcheck&amp;#x2F;plugins&amp;#x2F;: anchor checks, SSL, virus scanning, and more) and many output loggers. This is the most extensible of the bunch, and it pays for that with a large, mature, somewhat old-fashioned codebase.
On scalability, it’s GIL-bound and thread-limited, so raw throughput is the lowest here, but correctness and feature coverage are high.


    Takeaways: LinkChecker
     
The unbounded frontier is a deliberate anti-deadlock choice, documented in a one-line comment. It describes the exact problem we hit in lychee in attempt 4.
Dedup at put() time (a None placeholder in the cache) is their synchronization mechanism. The cache must claim the URL before the request, not after.
Threads buy simplicity at the cost of throughput. A blocking thread pool is the easiest correct model… and the slowest one.


linkinator (TypeScript): Single-Threaded queue.onIdle()
linkinator is a Node.js checker, and it benefits from something neither Go nor Rust provides: a single-threaded event loop. Check-and-insert into the visited set is atomic for free, because no two callbacks run simultaneously.
The frontier is a concurrency-limited Queue (a p-queue-style structure). Termination is one line in check() (src&amp;#x2F;index.ts):
const queue = new Queue(  concurrency: options.concurrency || 100  );
&amp;#x2F;&amp;#x2F; ... seed the queue ...
&amp;#x2F;&amp;#x2F; resolve when nothing is queued or running:
await queue.onIdle();  
onIdle() is the library’s termination detection: it resolves when the queue is empty and no task is in flight. Same idea as muffet’s WaitGroup and LinkChecker’s join(), just expressed as a promise and backed by a single-threaded runtime, so no Mutex is needed to protect the visited set.

    
The back-edge and the race-free dedup
When crawling, crawl() GETs the page, extracts links, and for each new URL re-enters the queue (src&amp;#x2F;index.ts):
const inCache = options.cache.has(result.url.href);
if (!inCache)  
    &amp;#x2F;&amp;#x2F; Mark visited...
    options.cache.add(result.url.href);
    
    &amp;#x2F;&amp;#x2F; Create the promise for this check
    const checkPromise = (async () =&amp;amp;gt;  
        await this.crawl(  url: result.url, &amp;#x2F;* ... *&amp;#x2F;  );
     )();
    
    &amp;#x2F;&amp;#x2F; Store the promise.
    &amp;#x2F;&amp;#x2F; Another page discovering the same URL can wait on this promise
    &amp;#x2F;&amp;#x2F; instead of enqueuing a duplicate check.
    options.pendingChecks.set(result.url.href, checkPromise);
    
    &amp;#x2F;&amp;#x2F; Enqueue...
    options.queue.add(() =&amp;amp;gt; checkPromise);        
 
Because JavaScript is single-threaded, the entire thing executes without interruption.
In Rust or Go, that’s a critical section you must guard with a mutex (and get the ordering right); in Node it’s just three statements. This is the single biggest reason recursion is easier in Node than in Rust.
It’s just a language feature.
linkinator also keeps a relationshipCache of `$ url |$ parent ` keys, and a pendingChecks map so it can wait on an in-flight check and still report a duplicate broken link against every parent that references it. Those reuse-operations are themselves pushed onto the same queue, so onIdle() correctly waits for them too.

    
HEAD vs GET
linkinator uses HEAD for leaf links but GET when it needs to crawl, because recursion needs the response body to find more links:
response = await makeRequest(
  options.crawl ? &amp;amp;#39;GET&amp;amp;#39; : &amp;amp;#39;HEAD&amp;amp;#39;,
  options.url.href, &amp;#x2F;* ... *&amp;#x2F;
);
This is precisely lychee’s remaining open problem: you can only recurse into pages you fetched with a body. linkinator just always GETs when crawling; lychee plans to reuse the body it already has in cache from the check it just performed.

    
Where the tradeoffs are

Single-threaded is both a blessing and a ceiling. No data races, trivially correct dedup, but HTML parsing is CPU work that blocks the one event loop. For thousands of pages, you’re bound by a single core. lychee’s multi-threaded runtime parses and checks in parallel.
It suffers from in-memory result inflation. The source explicitly comments on “massive result inflation for heavily interlinked sites”: the results array, cache, and relationshipCache all grow with the crawl. Fine for a docs site, heavy for a giant one.
Rate limiting is reactive, not proactive. There’s a delayCache that backs off per host on a 429 with Retry-After, but no general per-host concurrency cap like lychee’s HostPool. linkinator can hammer a host until it complains; lychee now paces before the complaint.
For extensibility, it’s an EventEmitter (on(link), on(pagestart), and so on), so it’s embeddable and scriptable, which is nice. It’s a library first, like lychee.


    Takeaways: linkinator
     
queue.onIdle() is the termination mechanism. Simple and provided by the JS runtime.
A single-threaded event loop makes request deduplication pretty much free. This is the biggest structural reason recursion is easier in that case.
Reactive 429 backoff is not the same as proactive per-host pacing. lychee’s HostPool aims higher, at the cost of more machinery.


broken-link-checker (JavaScript): event-driven, using two queues
broken-link-checker (BLC) takes the event-driven model furthest. It’s built on limited-request-queue, a queue with maxSockets (concurrency) and rateLimit, and it nests two of them: a site-level queue feeding a page-level HtmlUrlChecker.
The frontier and dedup live in SiteChecker (lib&amp;#x2F;public&amp;#x2F;SiteChecker.js). Visited pages are tracked in a URLCache, written at enqueue time:
#enqueuePage(url, customData, auth)  
    &amp;#x2F;&amp;#x2F; Mark before crawl to avoid links to self within page. 
    this.#sitePagesChecked.set(url, PAGE_WAS_CHECKED);
    this.#htmlUrlChecker.enqueue(url, customData, auth);
 
Recursion is governed by a filter that decides whether a discovered link becomes a crawled page:
#maybeEnqueuePage(link, customData, auth)  
    const tagGroup = this.#options.tags.recursive[
      this.#options.filterLevel
    ][link.get(HTML_TAG_NAME)] ??   ;
    
    const attrSupported = link.get(HTML_ATTR_NAME) in tagGroup;
    if (!attrSupported ||
        link.get(IS_BROKEN) ||
        !link.get(IS_INTERNAL) ||
        this.#sitePagesChecked.has(rebasedURL) || &amp;#x2F;&amp;#x2F; dedup check
        !this.#isAllowed(link))   &amp;#x2F;&amp;#x2F; robots.txt
          &amp;#x2F;&amp;#x2F; do nothing
      else if (this.#options.includePage(rebasedURL))  
        this.#enqueuePage(rebasedURL, customData, auth);
     
 
Termination by event cascade
BLC has no counter and no onIdle(). It rides the queue’s drain events. When the page-level queue empties it fires END_EVENT, which makes SiteChecker emit SITE_EVENT and call the site queue’s done callback; when the site queue drains, it fires REQUEST_QUEUE_END_EVENT. That’s the public END_EVENT:
.on(END_EVENT, () =&amp;amp;gt;  
    this.emit(SITE_EVENT, this.#currentPageError, this.#currentSiteURL, this.#currentCustomData);
    this.#currentDone();   &amp;#x2F;&amp;#x2F; tell the site queue this site is finished
 );
That’s their termination detection, expressed as “the request queue reported empty.”
And in classic Node.js fashion, the done callback is what actually tells the site queue to free up a slot for another site. So the termination of one site is what allows another to start, and the termination of the whole crawl is what allows the process to exit. It’s a cascade of events that propagates from the page queue to the site queue to the process.

    
Where the tradeoffs are

It’s the best web citizen of the bunch. robots.txt is honored (getRobotsTxt, isAllowed), rel=nofollow is respected, and rateLimit plus maxSockets are first-class. This is a crawler that’s polite by default.
Event cascades are powerful but fiddly. Termination is spread across half a dozen event handlers and two nested queues. It works, but the control flow is much harder to follow than await queue.onIdle(). This is the JS cousin of the “leaky abstraction” problem I described, where recursion-awareness ends up sprinkled across many handlers.
It’s single-threaded, the same ceiling as linkinator, plus the in-memory URLCache per site.
On maturity versus momentum, it’s very widely used (it powers a lot of tooling), but development has slowed. The architecture is still sound and worth studying.


    Takeaways: broken-link-checker
     
Termination is a cascade of queue-drain events, not a counter. Same idea, different syntax.
Politeness is built in. robots.txt, rateLimit, and maxSockets make it the most server-friendly recursive checker by default.
Event-driven control flow is the cost. Distributing recursion logic across many handlers is exactly the kind of spread-out complexity that makes the feature hard to reason about.


A note on markdown-link-check and the “industrial” crawlers
Our README marks markdown-link-check as supporting recursion, but there’s some nuance there: it recurses over Markdown files, not by spidering a live website. There’s no HTTP frontier and no termination problem in the sense above. Worth a mention so the comparison is honest, not worth a teardown.
If you want to see the pattern at full industrial scale, look at Scrapy (Python&amp;#x2F;Twisted) or Colly (Go). Both use the same approach: a scheduler (frontier) with a pluggable, optionally disk-backed queue, a dupefilter (often a Bloom filter rather than a HashSet), a bounded downloader pool, and explicit “engine idle → close spider” termination. They solve exactly the problems lychee struggled with (distributed termination detection, backpressure, dedup), just with years of dedicated crawler engineering behind them. The takeaway isn’t “lychee should be Scrapy”: it’s that crawling is a well-trodden architecture, and lychee is simply standing on a different one right now.

    
Side-by-side


ToolLang &amp;#x2F; runtimeConcurrency modelFrontier“Done?” signalDedup pointPer-host limiting
muffetGo, goroutinesgoroutine pool + semaphore + host throttlermutex-guarded set + daemon channelsync.WaitGroupvisited set at enqueuehost throttler pool
LinkCheckerPython, threadsfixed blocking thread poolunbounded UrlQueuejoinable-queue counter (join())result cache at put()wait_for_host (req&amp;#x2F;s)
linkinatorNode, event loopsingle-thread + p-queue (concurrency)p-queuequeue.onIdle()Set at enqueue (race-free)reactive 429 delayCache
broken-link-checkerNode, event looplimited-request-queue (maxSockets)nested request queuesqueue-drain eventsURLCache at enqueuemaxSockets + rateLimit
lychee (2026)Rust, Tokiotasks + HostPoolchannels + WaitGroupWaitGroupHostPool active_requestsHostPool per-host pool


lychee in 2026 finally has a column-for-column match. The WaitGroup is muffet’s sync.WaitGroup and LinkChecker’s join(). The HostPool is BLC’s rateLimit&amp;#x2F;maxSockets and LinkChecker’s wait_for_host. The per-URI active_requests mutex is everyone’s enqueue-time dedup.

    
So Why Couldn’t We Just Copy Them?
Three reasons, in increasing order of how much they’re actually lychee’s fault.
They started as crawlers; lychee started as a stream.
Every tool above has a back-edge in its core data structure. lychee’s core was a DAG optimized for the 99% case (a list of files&amp;#x2F;URLs, checked once, fast). Retrofitting a cycle onto a pipeline is much harder than having one from the start. The problem is architectural in nature.
The frontier and the rate-limiter must be different objects.
muffet (set + semaphore), LinkChecker (unbounded queue + thread count), linkinator (p-queue + delayCache), BLC (request queue + maxSockets) all keep “what to do next” separate from “how fast to go.” lychee’s early attempts tried to make one bounded channel serve both roles, and a cycle through a bounded channel deadlocks. The fix (lychee’s HostPool plus a WaitGroup over an unbounded work source) is the same separation we’re aiming for now.
Single-threaded runtimes get dedup for free.
Both Node tools dedup with a plain Set and zero locking, because the event loop serializes access. Go and Python pay a mutex. Rust pays a mutex and fights the borrow checker about who owns the shared state across tokio::spawn. That’s the ~30% “Rust tax” I estimated last time: not the algorithm, but the friction of expressing shared mutable frontier state under Send + static.
None of this is a knock on lychee’s design. A unidirectional stream is the right call for the common, non-recursive case: it’s why lychee is fast and why the 30% channel regression from Attempt 2 was a dealbreaker. The other tools pay for their back-edge on every run, recursive or not. lychee refused to, and that principle is exactly why recursion took five years and why, when it lands, it won’t slow down the path everyone actually uses.
I believe that we can have our cake and eat it too: a crawler architecture that supports recursion without sacrificing the speed of a one-shot pipeline. But it’s a harder problem than just “copy what they do,” because most link checkers didn’t start with uncompromising performance as their top goal.


    Key takeaways
     
There is no secret sauce. Every recursive checker is a worklist plus a visited set plus a quiescence detector. The “trick” is being shaped like a crawler from commit one.
Termination is always the same idea wearing different clothes: sync.WaitGroup (muffet), joinable-queue counter (LinkChecker), queue.onIdle() (linkinator), queue-drain events (BLC), WaitGroup (lychee 2026). All of them are distributed termination detection.
Dedup belongs at enqueue, before the request. Marking a URL visited after checking it (what lychee did for four attempts) is the bug. Everyone else claims the URL the moment it enters the frontier.
Separate the frontier from the rate limiter. A bounded channel that is both your queue and your backpressure will deadlock the instant you add a cycle.
There is no free lunch. Node’s single thread makes dedup trivial at the cost of performance; Go’s goroutines and WaitGroup make termination trivial at the cost of a runtime; Rust gives you neither for free but hands you a compiler that refuses to let the races compile and you can get the network card to glow if you know exactly what you are doing.


So when someone asks “how do other link checkers do recursion?”, the real answer is: they made it a part of the architecture from the beginning, and they leaned on a runtime (providing conveniences like a WaitGroup, a joinable queue, an idle promise) that solved termination without solving “distributed termination detection.”
Thanks to the maintainers of muffet, LinkChecker, linkinator, and broken-link-checker: reading your source is the clearest way to learn about crawler architecture out there and we’re all in this together, just with a different set of tradeoffs.
"
},

{
"title": "Five Years of Trying to Add Recursion to lychee",
"url": "https://endler.dev/2026/lychee-recursion/",
"body": "Recursion has been lychee’s longest-standing open issue.
It’s been sitting there, unresolved, for over five years now.
If you haven’t come across it before, lychee is a fast, async link checker written in Rust (BTW).
You point it at your website, your docs, your README, your Markdown files.
I started it in 2020 because I got bored at home. By now, around 40k GitHub repositories depend on it.
Google, AWS, Microsoft, Cloudflare, and many others use it to check links in their documentation.
I gave talks and podcasts about it, in case you’d like to learn more.

    
            lychee goes weeeee…
            
            
lychee got funded by NLnet through their NGI Zero program for open, trustworthy infrastructure.
That funding allowed us to spend serious, focused time on the project instead of coding late at night.1
The funding is now coming to an end, which feels like the right moment to write this post.
And the most honest thing I can say is this: the single most requested feature, recursion, still isn’t shipped. :,(
But there are good reasons! Of course, the gist is “it’s hard,” but let’s go deeper than that.

    
Where It Started
On December 14, 2020, a user named @styfle opened issue #78:

    
            The original recursion issue
            
            
Very reasonable! At that point, lychee was already a fast, concurrent link checker with a lot of features. Surely adding a little --recursive flag to follow links within a domain could be done in an honest day’s work, no?
But five years, four serious implementation attempts, and several abandoned pull requests later, recursion still isn’t merged. The issue is tagged for the v1.0 milestone and we still want to ship it before that. But somewhere along the way it became lychee’s white whale.

    
My Initial Architecture Made It Hard
To understand why recursion is so difficult to add, you need to understand how lychee processes things.
Here’s the flow from back in late 2020:

    
            lychee’s initial architecture
            
            
Basically one big pipeline, from input URLs over link extraction, to link checking, to output formatting.
When @styfle opened the issue, I spotted the core problem almost immediately:

There is no connection back to the extractor.

That missing feedback loop (from checked responses back to the input queue) is the whole problem in a nutshell. lychee’s pipeline was designed as a one-shot, unidirectional flow: inputs go in one end, results come out the other, and the program stops when the input stream stops.
Recursion needs a cycle: responses have to be able to create new inputs. And cycles in async, channel-based pipelines are where the dragons live. 🐲
I knew this on day one. I just badly underestimated how many ways we’d find to get the cycle wrong.

    
Attempt 1: A Simple Counter (February - December 2021)
My first attempt was deliberately small. I didn’t want to rearchitect anything; I just wanted recursion to work!
So I added the handling directly in main.rs. The idea was:

After receiving a response, extract links from it if it came from one of the original input domains.
Push those new links back into the request channel.
Keep a running count of total expected requests vs. completed requests.
Stop when completed == total.

I added a recurse() function that called collector::collect_links() on successful responses, spawned a task to send the new requests into the channel, and returned how many new requests it created. A plain HashSet&amp;amp;lt;String&amp;amp;gt; acted as a “seen” cache so I wouldn’t re-check the same URL twice.
On top of that:

A recursion_level field on the Request and Response structs
A --recursive &amp;#x2F; -r flag
A --depth option for the maximum recursion depth
Domain filtering to stay within the input domains

Straightforward, right?

    
Wrong
The program wouldn’t terminate.
The termination logic was a while curr &amp;amp;lt; total_requests loop:
let mut curr = 0;
while curr &amp;amp;lt; total_requests  
    curr += 1;
    let response = recv_resp.recv().await.context(&amp;amp;quot;Receive channel closed&amp;amp;quot;)?;
    &amp;#x2F;&amp;#x2F; ... process response, potentially incrementing total_requests
 
When responses arrive and generate new requests, total_requests goes up.
So far so good.  But extraction, sending, and receiving all happen concurrently across different tasks, so the count can get out of sync.
I wasn’t happy about it even at the time:

TBH I’m not super happy with the current impl anymore as I count the links in the queue and then close the channel after all links got checked. It can lead to subtle bugs I think. There must be a better way.

Yes, Matthias from the past, the counter is fragile because:

New links are discovered asynchronously, so total_requests can be bumped after the loop has already decided to exit.
If the count is off by even one, you either hang forever (count too high) or quit too early (count too low).
And to add insult to injury, every edge case made the counting logic gnarlier. Cached responses, failed responses, empty pages,…

@pawroman gave me a genuinely thorough review here, including a careful analysis of memory usage for the HashSet cache (fine for up to millions of links), a suggestion to use signed depth values to express infinite recursion, and a nudge for integration tests. It was good feedback. It just couldn’t fix the thing that was actually wrong, which was the whole approach to termination.

    
The Death Blow
In September 2021 we decided to do a bigger rewrite: a stream-based architecture (PR #330) to improve concurrency. It changed Collector::collect_links from returning a Vec to returning a Stream, removed the ClientPool abstraction, and reshaped how tasks talked to each other. That was a great improvement as it meant that the collector was lazy and we wouldn’t allocate big Vecs of requests anymore. But it also meant that the recursion branch was borked and got its rug pulled from underneath.

Will put this on hold once again as we started implementing a stream-based approach in #330, which might supersede this branch soon. Sorry to everyone waiting on recursion support to land, but I’d like to get this right instead of merging a buggy solution prematurely.

PR #165 was closed in December 2021. The stream refactor landed and gave us a 35–50% speedup. Nice! Tradeoffs, I guess.


    Takeaways
     
Counting outstanding work in an async pipeline is fragile. An off-by-one in distributed counting means a deadlock or an early exit.
Big refactors and feature branches don’t get along. The stream rewrite made the recursion branch stale before it was ever ready.
Recursion touches almost every layer. This isn’t something you bolt on.


And one honest aside on the language question, because I get asked it a lot: the counting problem here is not Rust’s fault. A Go version with goroutines and channels, or a Python asyncio version, would hit the same off-by-one bugs. The race between “response processed” and “new requests discovered” is inherent to any concurrent recursive crawler. Rust’s Stream trait and the way it plays with ownership made a streaming architecture feel natural, and that’s what invalidated the work. So that’s perhaps a Rust-specific point.

    
Attempt 2: Feed It Back Through a Channel (January - July 2022)
Now that the stream architecture was in place, I took another stab at it. This time, instead of counting requests by hand, I’d feed discovered URLs back through a channel connected to the collector.
The collector would read from an input channel and turn what it received into a stream of requests.
Recursion would just mean sending newly discovered URLs into that channel. (Look, a feedback loop!)
The stream would close naturally when the channel closed.
I also played with unifying the input type so one method could take either a Vec or a Stream:
pub enum InputType  
    Stream(Pin&amp;amp;lt;Box&amp;amp;lt;dyn Stream&amp;amp;lt;Item = Input&amp;amp;gt;&amp;amp;gt;&amp;amp;gt;),
    Seq(Vec&amp;amp;lt;Input&amp;amp;gt;),
 
It hung. Again. But for a completely different reason this time.
The feedback loop created a circular dependency:

The collector reads from an input channel and produces a stream of requests.
The checker reads requests and produces responses.
The recursion handler reads responses and sends new inputs back to the collector’s channel.

Do you see the problem?
For the collector’s stream to end, the input channel has to close.
For the channel to close, all senders have to be dropped.
But the recursion handler holds a sender; it needs one to push discovered URLs back.
And the recursion handler only stops when there are no more responses, which only happens when there are no more requests, which only happens when the collector’s stream ends.
Another circular dependency causing a deadlock.
I said as much at the time:

I had very little time to look at the issue so far, but it hangs because the input channel does not get dropped, leading to a dangling connection. I thought that the channel would be closed (and dropped) automatically once futures::StreamExt::for_each_concurrent finishes.

@untitaker confirmed it and could reproduce the deadlock in even trivial cases:

You want to drop the sender once there’s nothing to process anymore right? But won’t for_each_concurrent hang forever because you didn’t do that yet? (and can’t, because you need the sender for more cloning)


I can repro a deadlock even with time lychee --offline -b . **&amp;#x2F;*.htm* -T1 on an empty directory.

This is the heart of using channels for cyclic data flow: channels use sender-drop as their termination signal, but in a cycle you can never drop all the senders, because each stage needs to hold one to keep the cycle alive.
I took the problem to the Tokio Discord, and the advice that came back was: “Stop using channels for this. Use semaphores with tokio::spawn instead.”

    
The Performance Problem Too
Even ignoring the deadlock, there was a second issue. The new from_chan method benchmarked roughly 30% slower than the existing from method. The extra channel indirection cost something, and it cost it even in the non-recursive case, which is the case basically everyone uses.


    Takeaways
     
Channels are the wrong tool for cyclic pipelines. Their close-on-last-sender-drop semantics are fundamentally at odds with a feedback loop.
for_each_concurrent looks perfect and isn’t. It processes a stream concurrently but gives you no way to feed items back in.
The common path can’t get slower. Recursion support is worthless if it taxes everyone who never uses it.


The channel-cycle deadlock is inherent to any channel-based system. Go channels have the same problem. Closing one means knowing nobody will send again, and a cycle makes that impossible. Erlang&amp;#x2F;OTP sidesteps it with process monitoring instead of channel semantics. The 30% regression, though, has a Rust angle. Rust’s zero-cost-abstraction culture means people (me included) expect to pay nothing for features they don’t use. In a runtime-heavy language, a 30% regression on an unused path might slide. In Rust, “you don’t pay for what you don’t use” is practically a moral position, and it made that regression a non-starter for me.

    
Attempt 3: Semaphores (February 2022)

    
What I Tried
I dropped channels for the recursion loop entirely and reached for:

Arc&amp;amp;lt;Semaphore&amp;amp;gt; to cap concurrency (replacing the channel’s natural backpressure)
tokio::spawn for each unit of work (replacing for_each_concurrent)
OwnedSemaphorePermit handed to each task, so work could be “transferred” when spawning a recursive sub-task

The prototype was pretty clean, honestly:
const MAX_CONCURRENCY: usize = 10;

fn recurse(permit: OwnedSemaphorePermit, i: usize) -&amp;amp;gt; JoinHandle&amp;amp;lt;()&amp;amp;gt;  
    tokio::spawn(async move  
        handle_input(permit, i).await;
     )
 

async fn handle_input(permit: OwnedSemaphorePermit, i: usize)  
    println!(&amp;amp;quot;got =  i &amp;amp;quot;);
    if i % 9 == 0  
        recurse(permit, 10).await.unwrap();
     
 
But I guess you can tell what the problem with it was: it still locked up.
When I tried to bring this model into the real codebase, the ownership requirements got ugly fast. The link checker needs the client config, the cache, the progress bar, the stats, and a handful of other things. To share all of that across spawned tasks, it all wanted to be wrapped in Arc&amp;amp;lt;RwLock&amp;amp;lt;State&amp;amp;gt;&amp;amp;gt;.
I tried this model on the branch, but it gets quite ugly because of ownership and Send.

    
Semaphores Aren’t Enough
A semaphore solves the concurrency-limiting problem. It does nothing for the termination problem. With tokio::spawn, there’s no built-in way to know when all spawned tasks — including the ones spawned recursively — have finished. You’d need a separate coordination mechanism, which is to say: you’d be reinventing the counter from Attempt 1, except now spread across an unbounded number of spawned tasks. We’d come full circle to the very thing I was trying to escape.
There’s a subtlety with the permits, too. Swapping for_each_concurrent for raw tokio::spawn loses the bounded concurrency that channels gave us for free. The semaphore adds it back, but you have to manage permits carefully. If a task acquires a permit, spawns a child, and transfers the permit, the parent can’t do more work. If it clones the permit, you can blow past your concurrency limit. Getting the permit lifecycle exactly right is fiddly.


    Takeaways
     
Semaphores solve concurrency, not termination. You still need something to tell you “all the work is done.”
Arc&amp;amp;lt;RwLock&amp;amp;lt;State&amp;amp;gt;&amp;amp;gt; is a code smell in async Rust. When you start wrapping everything in locks, you’re fighting the ownership model instead of working with it. That can leave a lot of performance on the table since every access is a lock acquisition across all threads.
The real question was never “how do I recurse?” It was “how do I know when I’m done recursing.”


This was the most Rust-specific failure of the bunch.
The semaphore approach is idiomatic in Go. A sync.WaitGroup plus a semaphore channel, with state shared across goroutines via sync.Mutex is how you’d do that in Golang because it has green threads and a runtime that manages goroutine lifecycles for you.
But in Rust, the Send + static bounds on tokio::spawn, the borrow checker’s aversion to shared mutable state, and the cost of Arc&amp;amp;lt;RwLock&amp;amp;lt;T&amp;amp;gt;&amp;amp;gt; get in the way. Rust made the “just wrap everything in Arc and Mutex” escape hatch painful enough that it became a dead end.

    
2022–2024 😴
For more than two years, the recursion issue kept collecting comments from people who wanted it.
People suggested workarounds (piping sitemap URLs through xargs was a popular one). The person who originally filed it built their own tool and moved on, which I completely understood.
I was honest about it whenever it came up:
Someone offered a €100 bounty. Others pointed to muffet, which already does recursive checking. lychee wasn’t standing still during these years; a lot of work went into performance, caching, rate limiting, and other features. But recursion was the elephant in the room.

    
Attempt 4: Gwenn Takes a Swing (January – March 2025)
In late 2024, a community contributor, @gwennlbh, picked up the gauntlet. Her plan went back to the channel-based model but with a twist: instead of trying to close channels for termination, she used an Arc&amp;amp;lt;AtomicUsize&amp;amp;gt; counter. Like Attempt 1, but atomic and shared across tasks!
And it looked so elegant:

Keep the two existing mpsc channels (requests and responses).
After receiving a response, extract links from the body and send them as new requests.
Use the Arc&amp;amp;lt;AtomicUsize&amp;amp;gt; to track remaining work — increment when new requests are sent (recursive ones included), decrement when a response is processed, and break out of the receive loop when it hits zero.
Lean on the existing cache to avoid cycles (don’t re-check URLs already seen).

This was the most functional attempt yet. It actually worked on real websites:
lychee -R https:&amp;#x2F;&amp;#x2F;endler.dev 
       --recursed-domains endler.dev
I was really excited watching it come together, and I tried to give useful design guidance along the way:

Default recursion depth of 5
Strict domain matching (no subdomain checking)
Rate limiting deferred to a separate PR
Breaking changes to lychee-lib’s public API accepted


Where It Broke
And then it hit the same wall, from several directions at once.

    
1. Channel Backpressure Deadlock
When recursion discovered a lot of links, the response handler tried to send new requests into the request channel. But if that channel was full (bounded by max_concurrency), the send blocked. A blocked response handler means no responses get processed, which means no request slots free up. Classic backpressure deadlock.
@gwennlbh worked around it by spawning the “send new requests” work in a separate tokio::spawn, decoupling response processing from request sending. It worked, but it meant there was no longer a limit on how many of these background tasks could pile up (and with that, use unbounded memory).

    
2. Duplicate Requests
Because requests are processed in parallel, the same URL could be discovered by multiple pages and sent into the channel before any of them got cached. The cache check happened too late: after the request was already in flight. There was no per-URL synchronization to stop concurrent duplicates:

Because of the parallel nature of the request-to-response task, it seems to me that sending the same request twice to the channel is hard to prevent. I tried adding guards basically everywhere […] and I still seem to get duplicates.

As a stopgap, a dedup check went into Stats::insert, but that only stopped duplicate reporting, not duplicate checking. The real fix would arrive much later, with the HostPool’s per-URI active_requests mutex, but that machinery didn’t exist yet.

    
3. The Counter, Yet Again
The Arc&amp;amp;lt;AtomicUsize&amp;amp;gt; counter is, at heart, the same idea as Attempt 1 — and it brought the same fragility. With Ordering::Relaxed (the weakest memory ordering), increments and decrements across threads could be reordered, so the counter could briefly read zero before the work was actually done. On Wikipedia with --max-depth=0, it would lock up on the very last URL.

    
4. Changes Everywhere
Adding subsequent_uris (the list of discovered links) to the Response type meant touching nearly every file that builds or consumes a Response. Every Response::new() call needed two new arguments (vec![] and 0 for the non-recursive case).

    
5. The Collector Got Bypassed
To extract links from response bodies, the code built a fresh Collector inline in the checker, sidestepping the configured collector that respects user flags like --exclude, --include, and fragment checking.

    
The End of That Road
After a burst of energy in January 2025, things slowed. Merge conflicts piled up. CI linting rules changed underneath the branch. @gwennlbh switched to Windows and couldn’t get the OpenSSL dependency to build. In March 2025 she wrote, honestly:

even though I was kinda denying it, it’s pretty clear that I’ve lost motivation to keep working on this […] I’m sorry T_T

I didn’t want her to apologize. She got further than anyone, on a hard feature, in a complex async codebase, as a volunteer. Instead, I’m grateful for the time she invested to push tings forward.


    Takeaways
     
The atomic counter is a manual counter in a trenchcoat. It had the same failure modes.
When you’re adding vec![] and 0 to every Response::new() call, that’s a leaky abstraction.
Outside contributors face extra friction. Build-environment differences, conflicts with a moving target, and the sheer cognitive load of a big async codebase make this an especially brutal feature to contribute.


How much of the issues were Rust-specific?
I’d say around half.
The backpressure is simply part of the problem space.
Any concurrent crawler in any language meets that.
The Ordering::Relaxed trap is somewhat Rust-specific in that Rust makes you choose a memory ordering (Go’s sync&amp;#x2F;atomic does too, but most Go folks reach for sync.WaitGroup instead).

    
So Why Is This Actually Hard?
Four attempts in five years.
If we take a step back, I think the difficulties can be grouped into a few categories:

    
Knowing When You’re Done
Every implementation faced the same question: how do you know when you’re finished?
In a non-recursive pipeline the answer is easy. You’re done when the input stream is exhausted and the in-flight requests have completed.
Close the channel sender, drain the receiver, and Bob’s your uncle.
In a recursive pipeline the input stream is never truly exhausted, because every response might create new inputs. You need a separate way to detect quiescence: the state where nothing is in progress and nothing new will be generated.
Turns out, the problem has a name in distributed systems: ✨ distributed termination detection. ✨
The classic solutions (Dijkstra–Scholten, token passing) just don’t map well onto Tokio’s channel-based world.

    
The Cycle
lychee’s architecture is fundamentally a DAG.
Inputs flow one direction through the stages.
Recursion introduces a cycle.
And cycles in channel-based systems deadlock, because channels use “all senders dropped” as their done signal, and in a cycle that condition is never met on its own.

    
Backpressure
Bounded channels give you natural backpressure: if the checker is slow, the sender blocks until there’s room.
Which is lovely, until you want recursion.
Now the response handler needs to send into the request channel.
If that channel is full, the response handler blocks; if it blocks, no responses are consumed; if no responses are consumed, no request slots free up.

    
Deduplication Races
We check links concurrently, which means multiple pages can hold the same link.
Without synchronization, several tasks discover the same URL and submit it before any of them can mark it “seen.”
Through attempts 1–4 the cache didn’t save us, because cache entries were written after checking, not before submission.

    
Leaky Abstraction
Recursion-awareness wants to live “everywhere.”
Responses need to carry discovered links, Requests need a depth, the collector needs to understand recursive inputs, stats and formatters need to handle duplicates.

    
How Much of This Is Rust’s Fault?
I think this is the question people reading my blog really want answered, so let me be direct.
My honest estimate is… about 30%?
The termination problem, the cycle problem, and the backpressure problem are all just part of the problem space.
Any concurrent recursive crawler, be it written in Go, Python, Java, or Erlang, has to solve that.
At some point, Scrapy, Colly, and the other mature crawling frameworks all had to do distributed termination detection and backpressure management.
What Rust adds is friction at the implementation level:

Ownership and Send bounds make it harder to share state across spawned tasks. In Go you capture variables in a goroutine closure and move on. In Rust everything in async-land wants to be Arc-wrapped and Send + static.
Explicit memory ordering on atomics forces you to think about concurrency correctness and also makes “eh, just use relaxed” a tempting but dangerous choice.
Channel termination semantics in Tokio are stricter than in some other ecosystems. Go’s context.Context gives you an orthogonal cancellation mechanism that Tokio channels don’t natively have. (In Tokio, you’d use a CancellationToken for that.)

But on the other side, Rust also prevented a lot of issues:

The compiler caught every unsafe attempt to share mutable state. In Go those would’ve been subtle runtime bugs I’d find in production or maybe with the race detector.
Using the type system in our favor, we can make the right thing be the ergonomic thing.

Put another way, Rust made the wrong approaches fail loudly and painfully e.g. with compiler errors (but also deadlocks in tests) and made the right approach more solid and ergonomic.

    
A New Hope
Despite all the failed attempts, the ground has quietly shifted under this problem in 2025–2026. A bunch of work, most of it not even about recursion, has made a real implementation finally look within reach.

    
Per-Host Rate Limiting (December 2025)
Recursion without rate limiting is dangerous.
Gwenn found that out firsthand by accidentally DDoS’ing their own WiFi router while recursively checking Wikipedia. 😬
Per-host rate limiting, which got merged in PR #1929, makes recursive crawling respect server limits.
I previously waved this off as “out of scope” but it’s super important in practice.
The underlying issue (#1605) was one I opened on January 6, 2025 — the same week PR #1603 (Attempt 4) opened. That timing was no accident. The moment we tried recursion for real, the lack of per-host rate limiting showed up as a glaring gap. It caused concurrent requests to the same host to throw 429s, the cache to be ineffective under high concurrency due to races (issue #1593), and global concurrency settings being too coarse for a workload spread across many hosts at once.
The fix introduced a HostPool, which is a per-host request queue with configurable rate limits, delays, and concurrent-request caps.
Each host gets its own bucket with its own settings, configurable via lychee.toml:
[hosts.&amp;amp;quot;github.com&amp;amp;quot;]
max_concurrent_requests = 10
request_delay = &amp;amp;quot;100ms&amp;amp;quot;
The HostPool would later become a central abstraction. It’s the very same HostPool that PR #2100 reused to unify input fetching with link checking, which means it’s now the single entrypoint that all HTTP requests flow through.
It’s important for recursion because the HostPool gives us per-host rate limiting, deduplication (via each Host’s per-URI active_requests mutex and HostCache), and caching at the right granularity, which lets recursive crawling stay a good web citizen (respecting rate-limit headers, backing off on 429s).

    
The WaitGroup (February 2026)
The single most important recent thing is the WaitGroup primitive, contributed by Kait and merged in PR #2046.
It is one step towards solving the termination problem.
WaitGroup is a mechanism for waiting on a dynamic set of tasks that can themselves spawn more tasks. It’s two pieces:

WaitGroup, a single waiter that fires when all the work is done.
WaitGuard, a cloneable guard held by each task. When the last guard is dropped, the waiter completes.

The key move is that a WaitGuard can be cloned. A task can spawn sub-tasks (recursion!) while preserving the invariant that the WaitGroup only completes once every guard — including the ones held by recursive sub-tasks — has been dropped.
That cleanly solves the termination problem:
let (waiter, guard) = WaitGroup::new();

&amp;#x2F;&amp;#x2F; Each request carries a guard clone
send_req.send((guard.clone(), request)).await;

&amp;#x2F;&amp;#x2F; In the response handler, if recursing:
&amp;#x2F;&amp;#x2F; the guard is cloned for each new request
for new_request in discovered_links  
    send_req.send((guard.clone(), new_request)).await;
 

&amp;#x2F;&amp;#x2F; The original guard is dropped when the response is fully processed.
&amp;#x2F;&amp;#x2F; When ALL guards are dropped (no more work), waiter.wait() returns.
It’s already wired into lychee’s main check loop. The collect_responses function uses take_until(waiter.wait()) to stop receiving when the work is done. There’s even a comment in the current code anticipating exactly this:
&amp;#x2F;&amp;#x2F; unused for now, but will be used for recursion eventually. by holding
&amp;#x2F;&amp;#x2F; an extra `send_req` endpoint, we prevent the natural termination when
&amp;#x2F;&amp;#x2F; each channel finishes and closes. instead, we rely on the WaitGroup to
&amp;#x2F;&amp;#x2F; break the cyclic channels.
let _ = send_req;
That’s the missing piece that our previous attempts lacked.

    
Unified Request Handling (PR #2100, Merged March 2026)
PR #2100 unified input URL fetching with the link checker’s HostPool. Before this, CLI input URLs went through a separate reqwest::Client that didn’t share config (user-agent, rate limiting, TLS settings) with the checker. That caused real bugs (Wikipedia returning 403 for input URLs because no user-agent was set).
After it, input fetching and link checking go through the same pool. For recursion this matters because recursively discovered pages need to be fetched and parsed, and they should use the same client config as everything else.

    
Sitemap Support (February 2026)
Sitemap support is a partial solution to a lot of recursion use cases. By parsing sitemap.xml, lychee can discover every page on a site without crawling recursively at all. It’s not a replacement for true recursion (it doesn’t help sites without sitemaps, and it won’t find dynamically linked pages), but it unblocks a lot of use-cases.

    
What Proper Recursion Could Look Like
With all that in place, here’s what’s left. The striking part is how much of it is already done:

Knowing when the crawl is done is solved by the WaitGroup.
Deadlocks are avoided by spawning the follow-up work instead of blocking on a full channel.
The per-host pool already paces requests, so we don’t hammer a server.
lychee already skips URLs it has seen, which matters when every page links to the same nav and footer.
Getting the page back is the one open problem. lychee throws the page away after checking it, but recursion needs the HTML to find more links. It’s still in cache from the check that just happened, so we can grab it again for free. (Assuming the request method is GET, not a HEAD, which doesn’t return a body.)

Once those are in, the actual recursion is just a handful of lines. When a checked page is on an allowed domain and under the depth limit, grab its content from cache, pull out the links, and send them back through the same pipeline as fresh requests:
if recursive &amp;amp;amp;&amp;amp;amp; is_same_domain(&amp;amp;amp;response, &amp;amp;amp;recursion_domains) &amp;amp;amp;&amp;amp;amp; depth &amp;amp;lt; max_depth  
    let content = resolver.url_contents(response.url()).await?;  &amp;#x2F;&amp;#x2F; cache hit
    let links = extractor.extract(&amp;amp;amp;content);
    for req in request::create(links, ...)  
        send_req.send((guard.clone(), Ok(req))).await;
     
 
The hard parts (knowing when to stop, not deadlocking, not flooding a server) are already solved by work that was never about recursion in the first place.
Recursion becomes a by-product of good architecture, not a special case bolted onto a pipeline that was never built for it.

    
So, Did We Fail…?
For a long time I told myself we’d failed. Four attempts, five years, seemingly nothing shipped.
But writing it all out changed how I see it.
Every attempt hit some mix of channel termination semantics, backpressure deadlocks, ownership ergonomics, and distributed termination detection.
None of those are lychee problems. They’re hard concurrent-systems problems.
We just lacked the vocabulary to talk about them, and while I wasn’t looking, those primitives got built.
Sometimes the most important code you write for a feature is the code that never mentions the feature at all.
So no, I don’t think we failed. We made progress by stumbling into the right direction.
Thanks to NLnet for funding the work on lychee, and to everyone who contributed to the recursion effort over the years, whether in code, design feedback, or moral support. It’s been a long road, but we’re closer than ever to the finish line.


Well, to be fair, I still code late at night. But that’s just how I’m wired. ↩


"
},

{
"title": "Hosting My Own Newsletter",
"url": "https://endler.dev/2026/newsletter-setup/",
"body": "I had a newsletter on this blog for years, but I didn’t send a single email for a long time.
This is the story of how I finally got it back up and running, and what I learned along the way.
A quick note up front, because this caused some confusion: by “hosting my own” I mean I don’t use a newsletter platform. The signup backend and the CLI I use to send issues are mine, and the issues themselves are just markdown files in a git repo. I still use Plunk as the sending backend (so SES, bounces, suppression lists, and unsubscribe pages aren’t my problem). Plunk is open source and I could self-host it, but the deliverability side has enough edge cases that I’m happy to pay someone else to run it. 🙃

    
The Tinyletter Years

    
            The old Tinyletter landing page, now a sad 404.
            
            
            Source: Wayback Machine
            
        
For years my setup was a small form on the website pointing at Tinyletter, a small newsletter service that was focused on writers.
What I liked about it was the simplicity.
I never had to think about email deliverability, bounce rates, suppression lists, SPF, DKIM, DMARC, or any of that.
I wrote a thing, hit send, people got it.

    
            The Tinyletter compose page, showing the simplicity of the interface.
            
            
It just worked. Then Tinyletter shut down.
A bit of history: Tinyletter was built in 2010 by Philip Kaplan, reportedly coded on a single Sunday, the 31st of October, 2010.
It got acquired by Mailchimp one year later, and quietly became the de facto home for writers who wanted a personal newsletter without thinking about funnels, segments, or A&amp;#x2F;B tests.
Then in late 2023, Mailchimp (now part of Intuit) announced they’d shut it down.
The official wording was that their “business priorities have evolved” and that they were “laser focused on building tools to serve marketers and help small businesses grow.”
Writers were never their core customers.

    
            Mailchimp’s shutdown announcement, late 2023.
            
            
            Source: EmailOctopus
            
        
Just before Tinyletter went dark on February 29, 2024, I made a final backup of my subscriber list, but I didn’t have a plan for what to do with it.

    
Denial
At this point, I became hostile to the idea of using a third-party service.
The same story could repeat itself again.
I still looked at all options and bounced off all of them:

Too expensive! Most services price per contact and assume you’re running a business funnel, not writing letters to people.
Too marketing-focused! Templates, drag-and-drop builders, A&amp;#x2F;B tests, engagement scoring, tracking pixels. The whole vocabulary is wrong. I don’t want to run campaigns; I want to send email!
Not hacker-friendly. No markdown, no CLI, no API I’d actually enjoy using. Everything happens in a web dashboard built for a marketing team.
Not open source. If the next Tinyletter shuts down, I’d like to keep going without having to migrate again.
Tracking by default! Open tracking, click tracking, pixels in every footer. I don’t want to know who opened what. I want to write, you read it (or don’t), the end.


Migrating to Fly.io
People kept asking me when the newsletter was coming back, so I cobbled something together on fly.io.
It was a small Rust API, a CSV file with subscribers, and a way to subscribe through the website.
The idea was to deal with the sending later, but at least offer a way to sign up for now.
Then the list just sat there.
Turns out, a cold list is a problem all by itself.
When you finally do send to a list of people who haven’t heard from you in a long time, mail providers get suspicious and you can get flagged as spam.
Suddenly your own newsletter can turn against you.

    
The Hunt for a Sending Service
This was the hardest part by far.
I looked into Resend, Postmark, SendGrid, Mailgun, Amazon SES, and many more.
All of them were either quite expensive for a small newsletter, had a terrible API, didn’t comply with GDPR regulations, or were way too complicated.
I was about to give up when I found Plunk.
It is open source, the pricing scales with your list size, and the API doesn’t fight me.
It does the deliverability work I don’t want to think about (SES integration, bounce handling, suppression list, hosted unsubscribe pages).
I’m a paying customer now.
I’m not affiliated, just a genuinely happy user.
I even sent them a small contribution and they merged it in ten minutes.
This made me feel like I was actually part of a community.
The first real newsletter issue went out to a thousand-plus contacts that hadn’t heard from me in ages.
I was bracing for a wave of bounces, but it went fine.
Bounce rate around 1%, only very few unsubscribes, and no deliverability issues.
Wow!
I didn’t do anything fancy: no batching, no slow warmup, no clever subject line.
I sent it all at once and let Plunk (well, SES underneath) auto-prune obviously dead addresses via bounce handling.
The one thing I did do was lead the first issue with a short, frank reintroduction – something like “hey, you signed up because you read a blog post of mine once, sorry for the silence” – which I think did most of the work in keeping unsubscribes low.
Cost-wise, one send to the full list costs me roughly $1.
For a newsletter I send irregularly, that’s nothing.

    
            The Plunk dashboard, showing the campaign overview and deliverability report. As you can see, I don’t track who opens my emails.
            
            
            Source: Plunk
            
        
This Feels Like Home!
I realized I could write issues as plain markdown files in a folder, version-controlled, with a small CLI for everything else.
That’s where I feel at home.
Just me, a cup of hot chocolate, my editor, the terminal, and git.
No more web dashboard between me and the writing.
The whole thing lives in a single repo:
newsletter&amp;#x2F;
├── issues&amp;#x2F;         # one .md per edition (1.md, 2.md, ...)
├── send&amp;#x2F;           # the CLI I run locally
└── subscribe&amp;#x2F;      # tiny HTTP service behind the website signup form
The CLI is called send. Here’s what it can do:
$ send help

Usage: send &amp;amp;lt;COMMAND&amp;amp;gt;

Commands:
  new      Create a new issue file and open $EDITOR
  list     List local issues
  lint     Check links in an issue (or all issues)
  test     Send a test email to myself 
  publish  Publish the issue to all subscribed contacts
  status   Show contact-list and deliverability report
  prune    Delete unsubscribed contacts
send publish 2 shows me a preview, the recipient count, and a y&amp;#x2F;N prompt before it actually fires anything off.
The subject line gets built automatically as corrode v0.N.0 # &amp;amp;lt;topic&amp;amp;gt; – semver-styled, with the major version stuck at 0 forever as a small joke about projects that never quite reach 1.0.
send status shows me per-campaign deliverability with bounce-rate cells colour-coded against the SES thresholds, plus daily bounces and unsubscribes, so I can spot trouble early.
send lint runs every link in an issue through lychee before I hit publish.
I am a lychee maintainer, so dogfooding it here was an obvious choice and a nice quality-of-life improvement over the old Tinyletter web editor, which had no link checking at all.
The signup form on the website POSTs to the tiny subscribe service, which runs on my server.
It validates the email, drops anything with the honeypot field filled in, and POSTs to Plunk with a subscribe-requested event.
Plunk creates the contact in the unsubscribed state and fires off the transactional confirmation email through its Action workflow.
Only when the recipient clicks the link does Plunk flip them to subscribed1.
No webhook back to my side, no callback, no JavaScript on the page.
I just push to git, my server detects the change, builds and runs the server crate, and the new version is live.
The running service takes absolutely no CPU or memory.

    
A Little DNS, Briefly
Plunk needs three things in DNS to send on my behalf: an SPF record (saying SES is allowed to send for the domain), a DKIM key (so SES can sign outgoing mail), and a return-path MX record (so bounces come back somewhere Plunk can read them).
All three live under a subdomain.
Don’t worry, Plunk tells you exactly how to set this up and you can copy-paste the records into your DNS provider’s dashboard.
The one thing worth not forgetting: do not add Plunk’s optional inbound MX at the apex of your domain. That would steal mail away from whoever currently handles your inbox (mailbox.org in my case), and replies stop landing where you expect.

    
A Minor Hiccup
I forgot that the From: address actually needs to be a real mailbox if you want replies to work.
The first issue went out as newsletter@corrode.dev, which didn’t exist as a mailbox.
A kind reader (hey Kevin!) replied to say hi, his message bounced, and he forwarded the bounce notice back to me to let me know.
I created the alias on mailbox.org, and replies have landed in my inbox ever since.

    
One List, Not Two
While I was at it, I also collapsed my older endler.dev newsletter and the corrode.dev one into a single list.
Both were always written by me, and running two parallel setups never really made sense. Same person on the keyboard, mostly overlapping audience, twice the maintenance.
The merge itself was uneventful: I had a CSV exported from Tinyletter (the original endler.dev list) and another from my fly.io service (the corrode.dev list I’d started collecting when corrode.dev launched). Same format. Both went into Plunk and deduplication was a non-issue. In the first issue I made the framing explicit (one newsletter for all my writing) so nobody had to guess what they were now signed up for. 2
Going forward, there’s just one newsletter. If any of this isn’t for you, you can always unsubscribe and never hear from me again. No hard feelings.

    
What I’d Tell You
If you’ve been thinking about doing this yourself: do it.
Self-hosting is genuinely easier than it used to be.
There are great open source services for almost every piece now.
In general, building small things yourself is one of the best ways to actually understand them and to keep owning the parts that matter.
That would be its own blog post, so let me know if you want me to write it.
If you’d like a peek at the (somewhat hacky) repo, send me a mail and I’ll send you a link. It’s really not that interesting, but if you’re curious about how it works, I’m happy to share.
Or wait until I clean it up a bit and open source it properly, which will just take me another few years to get around to it.
And the best part is that you can now test my setup by filling out the form below and subscribing to the newsletter!


The confirmation click matters to comply with GDPR regulations, and Plunk handles that for me. ↩


As for backups: Plunk holds the canonical subscriber list, and I can export it to CSV any time. I think they also have an API for that, but I haven’t tried it yet. ↩


"
},

{
"title": "On Taste",
"url": "https://endler.dev/2026/taste/",
"body": "I’ve been refining this blog’s design for two decades now.
With each new version, I get a little better at knowing what I want.
Turns out, my designs tend to get simpler with time.
At this point, it’s all typography and negative space.
I got better at knowing what I value and what I can take away.
Uncluttering brings joy.
Maybe that’s a deeply human thing?

    
Screenshots of my blog’s design over the years

    
            2007
            
            
            2011
            
            
            2018
            
            
            2026
            
            
Obsession and Preference
Spend enough time with anything, and you’ll develop strong opinions about it.
You become obsessed with the details.
And everything is somebody’s obsession.
Sneakers. Mechanical keyboards. Coffee.
Watch enough movies and you might start analyzing Dutch angles and obsessing over color grading.
You start noticing subtle issues in pacing.
You’ll recognize the same actor in different roles, even if they’re just a random extra.
Once you start seeing nuance, you can never unsee it.
You’ve developed a strong personal preference.
But at this stage, you’re just a guy with an opinion.
Having preferences isn’t the same as having good taste.
To develop good taste, you have to recognize quality outside your own preferences.
It’s seeing how something expresses an idea through deliberate choice, and being able to tell when those choices are honest.
Good taste is rooted in context.
It requires understanding the history and craftsmanship behind a thing.
Without that, quality flies right by you.

    
Taste and Expertise
Good taste and expertise are siblings.
Both come from caring deeply.
Both require knowing why things work.
Both demand awareness of the effect each decision has.
But they’re not the same.
Taste is recognition. You can have taste in something you can’t make yourself.
Expertise is production. You can be an expert in something whether you have taste or not.
What’s interesting is how often one drags the other along.
People with taste care so much about the details that they eventually start making the thing themselves.
And experts who care about their craft tend to develop taste as a byproduct.

    
Trigger
Take Willie Nelson’s guitar, Trigger, a 1969 Martin N-20.
Out of context, it’s just a worn-down chunk of wood.
It could just as easily be a busker’s guitar.
But after 10,000 performances, it has become a part of Willie Nelson.
He could have bought a new guitar a hundred times over.
He didn’t.
He developed such a strong taste for the sound he wanted that he refused to settle for anything less than Trigger.
The same repairman has maintained it since 1977.
Willie demands control over his sound, because the details are what shape it.
He didn’t do anything out of the ordinary. He just did the ordinary so well that it became extraordinary.
Is it Willie who shaped the guitar, or the guitar that rounded Willie’s sound?
They aged together. They’ll die together.
If you watch an artist like him perform, it looks effortless.
That’s because he knows what matters and what doesn’t.
He focuses on the details that make a difference and ignores the rest, which makes the task simpler for him.
That’s the trick: good taste tells you what to look for,
and expertise teaches you how to achieve it — what to emphasize and what to leave out.

    
Apple
Buying an Apple device doesn’t mean you have good taste.
Millions own them.
For many, it’s just a status symbol.
Apple helps you think different in the same way that Air Jordans help you jump higher. (I do love the posters, though.)
In fact, people with good taste often avoid brands.
Brands are what kill great products.
The cycle goes like this:

A company builds something great.
Early adopters rave about it.
Popularity grows.
The product gets adjusted for the mass market.
Quality drops.
Early adopters move on.

Most products are built for the average customer, so most products settle for average quality.
We can’t be experts in everything, so we trust what others say, then wonder why most things suck.
We sense when something’s sub-par, even if we can’t put our finger on it.
That’s a little tragic.
Brands don’t care about quality or being honest.
They care about money.

    
The Algorithm Problem
It’s harder to develop a unique taste today.
We’re surrounded by feeds and algorithms that show us only what we already like.
Without exposure to things we don’t like, we can’t discover the things we do.
If we get too comfortable, we stop developing taste at all.
Our preferences become narrow and shallow.
We just like what everyone else likes, with no way to say why.
The way out is to care about something. Anything!
Care so much it becomes part of who you are.
Everyone should have at least one thing they irrationally obsess over.
Something they know inside and out.
It makes for a more interesting personality. A richer life.
Humans socialize over shared obsessions.
We bond over espresso grinders and motorbikes.
Suddenly, you can connect with a stranger.
When you create for others, your taste becomes visible.
It shows who you are and what you care about.
That’s scary. But it’s how you find your people.
I don’t think I will ever stop refining this blog’s design.
Maybe I’m not an expert, just a guy with an opinion, but my blog is my zen garden. It brings me joy. Now go out and build your own.
"
},

{
"title": "Now's The Time To Start That Blog",
"url": "https://endler.dev/2026/personal-blog/",
"body": "I was listening to a podcast recently where someone pointed out something curious: machines have been better at playing chess than humans for three decades now, and nobody cares.
People still watch human chess players.
If anything, chess is experiencing a renaissance.
I don’t know anyone who wants to watch robots play chess.
But I know plenty of people who love watching humans play.
I’m not a big chess player myself (heck, I could hardly win any game), but I catch myself watching matches on YouTube.
The reason is that the game is way more than moving pieces on a board.
There’s a background story to every player and the match is a truly human experience where people don’t only compete against an opponent but also against themselves.
That’s what makes it so interesting, not the fact that there’s an algorithm which has 3646 Elo.
Humans can still contribute!
And I think it’s the same with writing.
Everyone wants to read personal thoughts from real human beings, but no one writes them anymore.
What we get instead is slop, and that’s hardly a good read.
The moment I notice I’m reading autogenerated text, I care less.
That’s why I keep writing.
My personal blog is a weird mix of ramblings about reviewing code, the best programmers I know, and random thoughts.
But you know what? People are reading it and from time to time I get an email from someone who found one of my articles helpful.
Time and again, when I talk to friends, they share the same experience.
My friend Thomas wrote his first article about his Experience with Atlassian and many people found it helpful and reached out.
And even if no one does, writing is a joyful hobby and it helps me clear my thoughts.
I believe there has rarely been a better time to start writing.
"
},

{
"title": "Building Up And Sanding Down",
"url": "https://endler.dev/2025/building-up-and-sanding-down/",
"body": "Over the years, I’ve gravitated toward two complementary ways to build robust software systems: building up and sanding down.
Building up means starting with a tiny core and gradually adding functionality.
Sanding down means starting with a very rough idea and refining it over time.
Neither approach is inherently better; it’s almost a stylistic decision that depends on team dynamics and familiarity with the problem domain.
On top of that, my thoughts on the topic are not particularly novel, but I wanted to summarize what I’ve learned over the years.

    
Building Up

    
            Working on a solid stone block in ancient Egypt
            
            
            Source: Wikimedia Public Domain
            
        
Building up focuses on creating a solid foundation first.
I like to use it when working on systems I know well or when there is a clear specification I can refer to.
For example, I use it for implementing protocols or when emulating hardware such as for my MOS 6502 emulator.
I prefer “building up” over “bottom-up” as the former evokes construction and upward growth. “Bottom-up” is more abstract and directional.
Also “bottom-up” always felt like jargon while “building up” is more intuitive and very visual, so it could help communicate the idea to non-technical stakeholders.
There are a few rules I try to follow when building up:

Focus on atomic building blocks that are easily composable and testable.
Build up powerful guarantees from simple, verifiable properties.
Focus on correctness, not performance.
Write the documentation along with the code to test your reasoning.
Nail the abstractions before moving on to the next layer.

When I collaborate with highly analytical people, this approach works well.
People who have a background in formal methods or mathematics tend to think in terms of “building blocks” and proofs.
I also found that functional programmers tend to prefer this approach.
In languages like Rust, the type system can help enforce invariants and make it easier to build up complex systems from simple components.
Also, Rust’s trait system encourages composition, which aligns well with that line of thinking.
The downside of the “build up” approach is that you end up spending a lot of time on the foundational layers before you can see any tangible results.
It can be slow to get to an MVP this way.
Some people also find this approach too rigid and inflexible, as it can be hard to pivot or change direction once you’ve committed to a certain architecture.
For example, say you’re building a web framework.
There are a ton of questions at the beginning of the project:

Will it be synchronous or asynchronous?
How will the request routing work?
Will there be middleware? How?
How will the response generation work?
How will error handling be done?

In a building-up approach, you would start by answering these questions and designing the core abstractions first.
Foundational components like the request and response types, the router, and the middleware system are the backbone of the framework and have to be rock solid.
Only after you’ve pinned down the core data structures and their interactions would you move on to building the public API.
This can lead to a very robust and well-designed system, but it can also take a long time to get there.
For instance, here is the Request struct from the popular http crate:
#[derive(Clone)]
pub struct Request&amp;amp;lt;T&amp;amp;gt;  
    head: Parts,
    body: T,
 

&amp;#x2F;&amp;#x2F;&amp;#x2F; Component parts of an HTTP `Request`
&amp;#x2F;&amp;#x2F;&amp;#x2F;
&amp;#x2F;&amp;#x2F;&amp;#x2F; The HTTP request head consists of a method, uri, version, and a set of
&amp;#x2F;&amp;#x2F;&amp;#x2F; header fields.
#[derive(Clone)]
pub struct Parts  
    &amp;#x2F;&amp;#x2F;&amp;#x2F; The request&amp;amp;#39;s method
    pub method: Method,

    &amp;#x2F;&amp;#x2F;&amp;#x2F; The request&amp;amp;#39;s URI
    pub uri: Uri,

    &amp;#x2F;&amp;#x2F;&amp;#x2F; The request&amp;amp;#39;s version
    pub version: Version,

    &amp;#x2F;&amp;#x2F;&amp;#x2F; The request&amp;amp;#39;s headers
    pub headers: HeaderMap&amp;amp;lt;HeaderValue&amp;amp;gt;,

    &amp;#x2F;&amp;#x2F;&amp;#x2F; The request&amp;amp;#39;s extensions
    pub extensions: Extensions,

    _priv: (),
 
There are quite a few clever design decisions in this short piece of code:

The Request struct is generic over the body type T, allowing for flexibility in how the body is represented (e.g., as a byte stream, a string, etc.).
The Parts struct is separated from the Request struct, allowing for easy access to the request metadata without needing to deal with the body.
Extensions can be used to store extra data derived from the underlying protocol.
The _priv: () field is a zero-sized type used to prevent external code from constructing Parts directly. It enforces the use of the provided constructors and ensures that the invariants of the Parts struct are maintained.

With the exception of extensions, this design has stood the test of time.
It has remained largely unchanged since the very first version in 2017.

    
Sanding Down

    
            Drawing of the part of wall painting in the tomb of Rekhmire
            
            
            Source: Wikimedia Public Domain
            
        
The alternative approach, which I found to work equally well, is “sanding down.”
In this approach, you start with a rough prototype (or vertical slice) and refine it over time.
You “sand down” the rough edges over and over again, until you are happy with the result.
It feels a bit like woodworking, where you start with a rough piece of wood and gradually refine it into a work of art.
(Not that I have any idea what woodworking is like, but I imagine it’s something like that.)
Crucially, this is similar but not identical to prototyping.
The difference is that you don’t plan on throwing away the code you write.
Instead, you’re trying to exploit the iterative nature of the problem and purposefully work on “drafts” until you get to the final version.
At any point in time you can stop and ship the current version if needed.
I find that this approach works well when working on creative projects which require experimentation and quick iteration.
People with a background in game development or scripting languages tend to prefer this approach, as they are used to working in a more exploratory way.
When using this approach, I try to follow these rules:

Switch off your inner perfectionist.
Don’t edit while writing the first draft.
Code duplication is strictly allowed.
Refactor, refactor, refactor.
Defer testing until after the first draft is done.
Focus on the outermost API first; nail that, then polish the internals.

This approach makes it easy to throw code away and try something new.
I found that it can be frustrating for people who like to plan ahead and are very organized and methodical.
The “chaos” seems to be off-putting for some people.
As an example, say you’re writing a game in Rust.
You might want to tweak all aspects of the game and quickly iterate on the gameplay mechanics until they feel “just right.”
In order to do so, you might start with a skeleton of the game loop and nothing else.
Then you add a player character that can move around the screen.
You tweak the jump height and movement speed until it feels good.
There is very little abstraction between you and the game logic at this point.
You might have a lot of duplicated code and hardcoded values, but that’s okay for now.
Once the core gameplay mechanics are pinned down, you can start refactoring the code.
I think Rust can get in the way if you use Bevy or other frameworks early on in the game design process.
The entity component system can feel quite heavy and hinder rapid iteration.
(At least that’s how I felt when I tried Bevy last time.)
I had a much better experience creating my own window and rendering loop using macroquad.
Yes, the entire code was in one file and no, there were no tests.
There also wasn’t any architecture to speak of.
And yet… working on the game felt amazing!
I knew that I could always refactor the code later, but I wanted to stay in the moment and get the gameplay right first.
Here’s my game loop, which was extremely imperative and didn’t require learning a big framework to get started:
#[macroquad::main(&amp;amp;quot;Game&amp;amp;quot;)]
async fn main()  
    let mut player = Player::new();
    let input_handler = InputHandler::new();

    clear_background(BLACK);
    loop  
        &amp;#x2F;&amp;#x2F; Get inputs - only once per frame
        let movement = input_handler.get_movement();
        let action = input_handler.get_action();

        &amp;#x2F;&amp;#x2F; Update player with both movement and action inputs
        player.update(&amp;amp;amp;movement, &amp;amp;amp;action, get_frame_time());

        &amp;#x2F;&amp;#x2F; Draw
        player.draw();

        next_frame().await
     
 
You don’t have to be a Rust expert to understand this code.
In every loop iteration, I simply:

get the inputs
update the player state
draw the player
wait for the next frame

It’s a very typical design for that type of work.
If I wanted to, I could now sand down the code and refactor it into a more modular design until it’s production-ready.
I could introduce a “listener&amp;#x2F;callback” system to separate input handling from player logic or a scene graph to manage multiple game objects or an ontology system to manage game entities and their components.
But why bother?
For now, I care about the game mechanics, not the architecture.

    
Finding the Right Balance
Both variants can lead to correct, maintainable, and efficient systems.
There is no better or worse approach.
I found that most people gravitate toward one approach or the other.
However, it helps to be familiar with both approaches and know when to apply which mode.
Choose wisely, because switching between the two approaches is quite tricky as you start from different ends of the problem.
"
},

{
"title": "On Choosing Rust",
"url": "https://endler.dev/2025/choosing-rust/",
"body": "Since my professional writing on Rust has moved to the corrode blog, I can be a bit more casual on here and share some of my personal thoughts on the recent debate around using Rust in established software.
The two projects in question are git (kernel thread, Hacker News Discussion) and the recently rewritten coreutils in Rust, which will ship with Ubuntu 25.10 Quizzical Quokka.
What prompted me to write this post is a discussion on Twitter and a blog post titled “Are We Chasing Language Hype Over Solving Real Problems?”.
In both cases, the authors speculate about the motivations behind choosing Rust, and as someone who helps teams use Rust in production, I find those takes… hilarious.
Back when I started corrode, people always mentioned that Rust wasn’t used for anything serious.
I knew about the production use cases from client work, but there was very little public information out there.
As a consequence, we started the ‘Rust in Production’ podcast to show that companies indeed choose Rust for real-world applications.
However, people don’t like to be proven wrong, so that conspiracy theory has now morphed into “Big Rust” trying to take over the world. 😆
Let’s look at some of the claims made in the blog post and Twitter thread and see how these could be debunked pretty easily.

“GNU Core Utils has basically never had any major security vulnerabilities in its entire existence”

If only that were true.
A quick CVE search shows multiple security issues over the decades, including buffer overflows and path traversal vulnerabilities. Just a few months ago, a heap buffer under-read was found in sort, which would cause a leak of sensitive data if an attacker sends a specially crafted input stream.
The GNU coreutils are one of the most widely used software packages worldwide with billions of installations and hundreds (thousands?) of developers looking at the code.
Yes, vulnerabilities still happen.
No, it is not easy to write correct, secure C code.
No, not even if you’re extra careful and disciplined.
ls is five thousand lines long. (Check out the source code). That’s a lot of code for printing file names and metadata and a big attack surface!

“Rust can only ever match C performance at best and is usually slower”

Work by Trifecta shows that it is possible to write Rust code that is faster than C in some cases.
Especially in concurrent workloads and with memory safety guarantees.
If writing safe C code is too hard, try writing safe concurrent C code!
That’s where Rust shines.
You can achieve ridiculous levels of parallelization without worrying about security issues.
And no, you don’t need to litter your code with unsafe blocks.
Check out Steve Klabnik’s recent talk about Oxide where he shows that their bootloader and their preemptive multitasking OS, hubris – both pretty core systems code – only contain 5% of unsafe code each.
You can write large codebases in Rust with no unsafe code at all.
As a trivial example, I sat down to rewrite cat in Rust one day.
The result was 3x faster than GNU cat on my machine.
You can read the post here.
All I did was use splice to copy data, which saves one memory copy.
Performance is not only dependent on the language but on the algorithms and system calls you use.
If you play into Rust’s strengths, you can match C’s performance.
At least there is no technical limitation that would prevent this.
And I personally feel more willing to aggressively optimize my code in Rust, because I don’t have to worry about introducing memory safety bugs.
It feels like I’m not alone.

“We reward novelty over necessity in the industry”

This ignores that most successful companies (Google, Meta, etc.) primarily use battle-tested tech stacks, not bleeding-edge languages.
These companies have massive codebases and cannot afford to rewrite everything in the latest trendy language.
But they see the value of using Rust for new components and gradually rewriting existing ones.
That’s because 70% of security vulnerabilities are memory safety issues and these issues are extremely costly to fix.
If these companies could avoid switching to a new language to do so, they would.
Besides, Rust is not exactly new anymore.
Rust 1.0 was released 10+ years ago!
The industry is moving slowly, but not that slowly.
You’d be surprised to find out how many established companies use Rust without even announcing it or thinking of it as “novelty”.

“100% orchestrated”

Multiple people in the Twitter thread were convinced this is some coordinated master plan rather than developers choosing better tools,
while the very maintainers of git and coreutils openly discussed their motivations in public forums for everyone to see.

“They’re trying to replace&amp;#x2F;erase C. It’s not going to happen”

They are right. C is not going away anytime soon.
There is just so much C&amp;#x2F;C++ code out there in the wild, and rewriting everything in Rust is not feasible.
The good news is that you can incrementally rewrite C&amp;#x2F;C++ code in Rust, one component at a time.
That’s what the git maintainers are planning, by using Rust for new components.

“They’re rewriting software with a GNU license into software with an MIT license”

Even if you use Rust, you can still license your code under GPL or any other license you want.
Git itself remains GPL, and many Rust projects use various licenses, not only MIT.
The license fear is often brought up by people who don’t understand how open source licensing works or it might just be FUD.
MIT code is still compatible with GPL code and you can use both of them in the same project without issues.
It’s just that the end product (the thing you deliver to your users, i.e. binary executables) is now covered by GPL because of its virality.

“It’s just developers being bored and wanting to work with shiny new languages”

The aging maintainers of C projects are retiring, and there are fewer new developers willing to pick up C just to maintain legacy code in their free time.
C developers are essentially going extinct.
New developers want to work with modern languages and who can blame them?
Or would you want to maintain a 40-year-old COBOL codebase or an old Perl script?
We have to move on.

“Why not build something completely new instead of rewriting existing tools?”

It’s not that easy.
The code is only part of the story.
The other part is the ecosystem, the tooling, the integrations, the documentation, and the user base.
All of that takes years to build.
Users don’t want to change their workflows, so they want drop-in replacements.
Proven interfaces and APIs, no matter how crude and old-fashioned, have a lot of value.
But yes, new tools are being built in Rust as well.

“They don’t know how to actually solve problems, just chase trends”

Talk about dismissing the technical expertise of maintainers who’ve been working on these projects for years or decades and understand the pain points better than anyone.
If they were just chasing trends, they wouldn’t be maintaining these projects in the first place!
These people are some of the most experienced developers in the world, and yet people want to tell them how to do their jobs.

“It’s part of the woke mind virus infecting software”

Imagine thinking memory safety is a political conspiracy.
Apparently preventing buffer overflows is now an ideological stance.
The closest thing to this is the White House’s technical report which recommends memory-safe languages for government software and mandating memory safety for software receiving federal funding is a pretty reasonable take.

    
Conclusion
I could go on, but I think you get my point.
People who give Rust an honest chance know that it offers advantages in terms of memory safety, concurrency, and maintainability.
It’s not about chasing hype but about long-term investment in software quality.
As more companies successfully adopt Rust every day, it increasingly becomes the default choice for many new projects.
If you’re interested in learning more about using Rust in production, check out my other blog or listen to the Rust in Production podcast.
Oh, and if you know someone who posts such takes, stop arguing and send them a link to this post.
"
},

{
"title": "How To Review Code",
"url": "https://endler.dev/2025/how-to-review-code/",
"body": "I’ve been reviewing other people’s code for a while now, more than two decades to be precise.
Nowadays, I spend around 50-70% of my time reviewing code in some form or another.
It’s what I get paid to do, alongside systems design.
Over time, I learned a thing or two about how to review code effectively.
I focus on different things now than when I started.

    
Think About The Big Picture
Bad reviews are narrow in scope.
They focus on syntax, style, and minor issues instead of maintainability and extensibility.
Good reviews look at not only the changes, but also what problems the changes solve, what future issues might arise, and how a change fits into the overall design of the system.
I like to look at the lines that weren’t changed. They often tell the true story.
For example, often people forget to update a related section of the codebase or the docs.
This can lead to bugs, confusion, breaking changes, or security issues.
Be thorough and look at all call-sites of the new code.
Have they been correctly updated?
Are the tests still testing the right thing?
Are the changes in the right place?


     Here’s a cheat sheet of questions I ask myself when reviewing code:

How does this code fit into the rest of the system?
What’s its interaction with other parts of the codebase?
How does it affect the overall architecture?
Does it impact future planned work?


These questions have more to do with systems design than with the changes themselves.
Don’t neglect the bigger picture because systems become brittle if you accept bad changes.
Code isn’t written in isolation.
The role of more experienced developers is to reduce operational friction and handle risk management for the project.
The documentation, the tests, and the data types are equally as important as the code itself.
Always keep an eye out for better abstractions as the code evolves.

    
Naming Is Everything
I spend a big chunk of my time thinking about good names when reviewing code.
Naming things is hard, which is why it’s so important to get it right.
Often, it’s the most important part of a code review.
It’s also the most subjective part, which makes it tedious because it’s hard to distinguish between nitpicking and important naming decisions.
Names encapsulate concepts and serve as “building blocks” in your code.
Bad names are the code smell that hint at problems running deep.
They increase cognitive overhead by one or more orders of magnitude.
For example, say we have a struct that represents a player’s stats in a game:
struct Player  
    username: String,
    score: i32,
    level: i32,
 
I often see code like this:
&amp;#x2F;&amp;#x2F; Bad: using temporary&amp;#x2F;arbitrary names creates confusion
fn update_player_stats(player: Player, bonus_points: i32, level_up: bool) -&amp;amp;gt; Player  
    let usr = player.username.trim().to_lowercase();
    let updated_score = player.score + bonus_points;
    let l = if level_up   player.level + 1   else   player.level  ;
    let l2 = if l &amp;amp;gt; 100   100   else   l  ;
    
    Player  
        username: usr,
        score: updated_score, 
        level: l2,
     
 
This code is hard to read and understand.
What is usr, updated_score, and l2? The purpose is not conveyed clearly.
This builds up cognitive load and make it harder to follow the logic.
That’s why I always think of the most fitting names for variables, even if it feels like I’m being pedantic.
&amp;#x2F;&amp;#x2F; Good: meaningful names that describe the transformation at each step
fn update_player_stats(player: Player, bonus_points: i32, level_up: bool) -&amp;amp;gt; Player  
    &amp;#x2F;&amp;#x2F; Each variable name describes what the value represents
    let username = player.username.trim().to_lowercase();
    let score = player.score + bonus_points;

    &amp;#x2F;&amp;#x2F; Use shadowed variables to clarify intent
    let level = if level_up   player.level + 1   else   player.level  ;
    let level = if level &amp;amp;gt; 100   100   else   level  ;
    
    &amp;#x2F;&amp;#x2F; If done correctly, the final variable names
    &amp;#x2F;&amp;#x2F; often match the struct&amp;amp;#39;s field names
    Player  
        username,
        score,
        level,
     
 
Good names become even more critical in larger codebases where values are declared far away from where they’re used
and where many developers have to have a shared understanding of the problem domain.

    
Don’t Be Afraid To Say “No”
I have to decline changes all the time and it’s never easy.
After all someone put in a lot of effort and they want to see their work accepted.
Avoid sugarcoating your decision or trying to be nice.
Be objective, explain your reasoning and provide better alternatives.
Don’t dwell on it, but focus on the next steps.
It’s better to say no than to accept something that isn’t right and will cause problems down the road.
In the future it will get even harder to deny a change once you’ve set a precedent.
That’s the purpose of the review process: there is no guarantee that the code will be accepted.
In open source, many people will contribute code that doesn’t meet your standards.
There needs to be someone who says “no” and this is a very unpopular job (ask any open source maintainer).
However, great projects need gatekeepers because the alternative is subpar code and eventually unmaintainable projects.
At times, people will say “let’s just merge this and fix it later.”
I think that’s a slippery slope.
It can lead to tech debt and additional work later on.
It’s hard to stand your ground, but it’s important to do so.
If you see something that isn’t right, speak up.
When it gets hard, remember that you’re not rejecting the person, you’re rejecting the code.
Remind people that you appreciate their effort and that you want to help them improve.
Even though you’ll develop an intuition for what to focus on in reviews, you should still back it up with facts.
If you find yourself saying “no” to the same thing over and over again, consider writing a style guide or a set of guidelines for your team.
Be gracious but decisive; it’s just code.

    
Code Review Is Communication
Code reviews aren’t just about code; people matter too.
Building a good relationship with your coworkers is important.
I make it a point to do the first couple of reviews together in a pair programming session if possible.
This way, you can learn from each other’s communication style.
Building trust and getting to know each other works well this way.
You should repeat that process later if you notice a communication breakdown or misunderstanding.

    
Use Multiple Iterations Of Reviews
“Can you take a quick look at this PR? I want to merge it today.”
There often is an expectation that code reviews are a one-time thing.
That’s not how it works.
Instead, code reviews are an iterative process.
Multiple iterations should be expected to get the code right.
In my first iteration, I focus on the big picture and the overall design.
Once I’m done with that, I go into the details.
The goal shouldn’t be to merge as quickly as possible, but to accept code that is of high quality.
Otherwise, what’s the point of a code review in the first place?
That’s a mindset shift that’s important to make.
Reviews aren’t exclusively about pointing out flaws, they’re also about creating a shared understanding of the code within the team.
I often learn the most about writing better code by reviewing other people’s code.
I’ve also gotten excellent feedback on my own code from excellent engineers.
These are invaluable “aha moments” that help you grow as a developer.
Experts spent their valuable time reviewing my code, and I learned a lot from it.
I think everybody should experience that once in their career.

    
Don’t Be A Jerk
From time to time, you’ll disagree with the author.
Being respectful and constructive is important.
Avoid personal attacks or condescending language.
Don’t say “this is wrong.”
Instead, say “I would do it this way.”
If people are hesitant, ask a few questions to understand their reasoning.

“Will this break existing workflows if we do it this way?”
“Which alternatives have you considered?”
“What happens if you call this function with an empty array?”
“If I don’t set this value, what will be the error message presented to the user?”

These “Socratic questions”1 help the author think about their decisions and can lead to better designs.
People should enjoy receiving your feedback.
If not, revisit your review style.
Only add comments that you yourself would be happy to receive.
From time to time, I like to add positive comments like “I like this” or “this is a great idea.”
Keeping the author motivated and showing that you appreciate their work goes a long way.

    
If Possible, Try To Run The Code
It’s easy to miss subtle details when you look at code for too long.
Having a local copy of the code that I can play with helps me a lot.
I try to run the code, the tests, and the linters if I can.
Checking out the branch, moving things around, breaking things, and trying to understand how it works is part of my review process.
User-facing changes like UI changes or error messages are often easier to spot when you run the code and try to break it.
After that, I revert the changes and, if needed, write down my findings in a comment.
Better understanding can come from this approach.

    
Be Upfront About Your Availability
Code reviews are often a bottleneck in the development process, because they can’t be fully automated:
there’s a human in the loop who has to look at the code and provide feedback.
But if you wait for your colleagues to review your code, that can lead to frustration.
Avoid being that person.
Sometimes you won’t have time to review code and that is okay.
If you can’t review the code in a reasonable time, let the author know.
I’m still working on this, but I try to be more proactive about my availability and set clear expectations.

    
Never Stop Learning
Code reviews are my favorite way to learn new things.
I learn new techniques, patterns, new libraries, but most importantly, how other people approach problems.
I try to learn one new thing with each review.
It’s not wasted time, if it helps the team improve and grow as a whole.

    
Don’t Be Nitpicky
Formatters exist for a reason: leave whitespace and formatting to the tools.
Save your energy for issues that truly matter.
Focus on logic, design, maintainability, and correctness. Avoid subjective preferences that don’t impact code quality.
Ask yourself: Does this affect functionality or would it confuse future developers? If not, let it go.

    
Focus On The Why, Not The How
When reviewing code, focus on the reasoning behind the changes.
This has a much better chance of success than pointing out flaws without any reasoning.
Consider the following two code review comments.
The first one is unhelpful and dismissive.

The second suggests an alternative, links to the documentation, and explains why the change could lead to problems down the road.

Which one would you prefer to receive?
I realize that this requires more time and effort, but it’s worth it!
Most of the time, the author will appreciate it and avoid making the same mistake in the future. There is a compound effect from helpful reviews over time.

    
Don’t Be Afraid To Ask Stupid Questions
Asking is better than assuming.
If you don’t understand something, ask the author to explain it.
Chances are, you’re not the only one who doesn’t get it.
Often, the author will be happy to explain their reasoning.
Better understanding of the code and the system as a whole can result from this.
It can also help the author see things from a different perspective.
Perhaps they’ll learn that their assumptions were wrong or that the system isn’t self-explanatory.
Perhaps there’s missing documentation?
Asking great questions is a superpower.

    
Ask For Feedback On Your Reviewing Style
From time to time, ask the author for feedback on your feedback:

Have you been too harsh&amp;#x2F;nitpicky&amp;#x2F;slow&amp;#x2F;sloppy?
Did you point out the right things?
Did your feedback help them?
Do they have suggestions for improvement?

Basically, you ask them to review your review process, heh.
Learning how to review code is a skill that needs constant practice and refinement.
Good luck finding your own style.


Thanks for pointing out that term to me, Lucca! ↩


"
},

{
"title": "Repeat Yourself",
"url": "https://endler.dev/2025/repeat-yourself/",
"body": "One of the most repeated pieces of advice throughout my career in software has been “don’t repeat yourself,” also known as the DRY principle.
For the longest time, I took that at face value, never questioning its validity.
That was until I saw actual experts write code: they copy code all the time1.
I realized that repeating yourself has a few great benefits.

    
Why People Love DRY
The common wisdom is that if you repeat yourself, you have to fix the same bug in multiple places, but if you have a shared abstraction, you only have to fix it once.
Another reason why we avoid repetition is that it makes us feel clever.
“Look, I know all of these smart ways to avoid repetition! I know how to use interfaces, generics, higher-order functions, and inheritance!”
Both reasons are misguided.
There are many benefits of repeating yourself that might get us closer to our goals in the long run.

    
Keeping Up The Momentum
When you’re writing code, you want to keep the momentum going to get into a flow state.
If you constantly pause to design the perfect abstraction, it’s easy to lose momentum.
Instead, if you allow yourself to copy-paste code, you keep your train of thought going and work on the problem at hand.
You don’t introduce another problem of trying to find the right abstraction at the same time.
It’s often easier to copy existing code and modify it until it becomes too much of a burden, at which point you can go and refactor it.
I would argue that “writing mode” and “refactoring mode” are two different modes of programming.
During writing mode, you want to focus on getting the idea down and stop your inner critic, which keeps telling you that your code sucks.
During refactoring mode, you take the opposite role: that of the critic.
You look for ways to improve the code by finding the right abstractions, removing duplication, and improving readability.
Keep these two modes separate.
Don’t try to do both at the same time.2

    
Finding The Right Abstraction Is Hard
When you start to write code, you don’t know the right abstraction just yet.
But if you copy code, the right abstraction reveals itself; it’s too tedious to copy the same code over and over again, at which point you start to look for ways to abstract it away.
For me, this typically happens after the first copy of the same code, but I try to resist the urge until the 2nd or 3rd copy.
If you start too early, you might end up with a bad abstraction that doesn’t fit the problem.
You know it’s wrong because it feels clunky.
Some typical symptoms include:

Generic names that don’t convey intent, e.g., render_pdf_file instead of generate_invoice
Difficult to understand without additional context
The abstraction is only used in one or two places
Tight coupling to implementation details


It’s Hard To Get Rid Of Wrong Abstractions
We easily settle for the first abstraction that comes to mind, but most often, it’s not the right one.
And removing the wrong abstraction is hard work, because now the data flow depends on it.
We also tend to fall in love with our own abstractions because they took time and effort to create.
This makes us reluctant to discard them even when they no longer fit the problem—it’s a sunk cost fallacy.
It gets worse when other programmers start to depend on it, too.
Then you have to be careful about changing it, because it might break other parts of the codebase.
Once you introduce an abstraction, you have to work with it for a long time, sometimes forever.
If you had a copy of the code instead, you could just change it in one place without worrying about breaking anything else.

  
      Duplication is far cheaper than the wrong abstraction
    
  
  —Sandi Metz, The Wrong Abstraction

Better to wait until the last moment to settle on the abstraction, when you have a solid understanding of the problem space.3

    
The Mental Overhead of Abstractions
Abstraction reduces code duplication, but it comes at a cost.
Abstractions can make code harder to read, understand, and maintain because you have to jump between multiple levels of indirection to understand what the code does.
The abstraction might live in different files, modules, or libraries.
The cost of traversing these layers is high.
An expert programmer might be able to keep a few levels of abstraction in their head, but we all have a limited context window (which depends on familiarity with the codebase).
When you copy code, you can keep all the logic in one place.
You can just read the whole thing and understand what it does.

    
Resist The Urge Of Premature Abstraction
Sometimes, code looks similar but serves different purposes.
For example, consider two pieces of code that calculate a sum by iterating over a collection of items.
total = 0
for item in shopping_cart:
    total += item.price * item.quantity
And elsewhere in the code, we have
total = 0
for item in package_items:
    total += item.weight * item.rate
In both cases, we iterate over a collection and calculate a total.
You might be tempted to introduce a helper function, but the two calculations are very different.
After a few iterations, these two pieces of code might evolve in different directions:
def calculate_total_price(shopping_cart):
    if not shopping_cart:
        raise ValueError(&amp;amp;quot;Shopping cart cannot be empty&amp;amp;quot;)
    
    total = 0.0
    for item in shopping_cart:
        # Round for financial precision
        total += round(item.price * item.quantity, 2)
    
    return total
In contrast, the shipping cost calculation might look like this:
def calculate_shipping_cost(package_items, destination_zone):
    # Use higher of actual weight vs dimensional weight
    total_weight = sum(item.weight for item in package_items)
    total_volume = sum(item.length * item.width * item.height for item in package_items)
    dimensional_weight = total_volume &amp;#x2F; 5000  # FedEx formula
    
    billable_weight = max(total_weight, dimensional_weight)
    return billable_weight * shipping_rates[destination_zone]
Had we applied “don’t repeat yourself” too early, we would have lost the context and specific requirements of each calculation.

    
DRY Can Introduce Complexity
The DRY principle is misinterpreted as a blanket rule to avoid any duplication at all costs, which can lead to complexity.
When you try to avoid repetition by introducing abstractions, you have to deal with all the edge cases in a place far away from the actual business logic.
You end up adding redundant checks and conditions to the abstraction, just to make sure it works in all cases.
Later on, you might forget the reasoning behind those checks, but you keep them around “just in case” because you don’t want to break any callers.
The result is dead code that adds complexity to the codebase; all because you wanted to avoid repeating yourself.
The common wisdom is that if you repeat yourself, you have to fix the same bug in multiple places.
But the assumption is that the bug exists in all copies.
In reality, each copy might have evolved in different ways, and the bug might only exist in one of them.
When you create a shared abstraction, a bug in that abstraction breaks every caller, breaking multiple features at once.
With duplicated code, a bug is isolated to just one specific use case.

    
Clean Up Afterwards
Knowing that you didn’t break anything in a shared abstraction is much harder than checking a single copy of the code.
Of course, if you have a lot of copies, there is a risk of forgetting to fix all of them.
The key to making this work is to clean up afterwards.
This can happen before you commit the code or during a code review.
At this stage, you can look at the code you copied and see if it makes sense to keep it as is or if you can see the right abstraction.
I try to refactor code once I have a better understanding of the problem, but not earlier.
A trick to undo a bad abstraction is to inline the code back into the places where it was used.
For a while, you end up “repeating yourself” again in the codebase, but that’s okay.
Rethink the problem based on the new information you have.
Often you’ll find a better abstraction that fits the problem better.

  
      When the abstraction is wrong, the fastest way forward is back.
    
  
  —Sandi Metz, The Wrong Abstraction


tl;dr
It’s fine to look for the right abstraction, but don’t obsess over it.
Don’t be afraid to copy code when it helps you keep momentum and find the right abstraction.
It bears repeating: “Repeat yourself.”


For some examples, see Ferris working on Rustendo64 or tokiospliff working on a C++ game engine. ↩


This is also how I write prose: I first write a draft and block my inner critic, and then I play the role of the editor&amp;#x2F;critic and “refactor” the text.
This way, I get the best of both worlds: a quick feedback loop which doesn’t block my creativity, and a final product which is more polished and well-structured.
Of course, I did not invent this approach. I recommend reading “Shitty first drafts” from Anne Lamott’s book Bird by Bird: Instructions on Writing and Life if you want to learn more about this technique. ↩


This is similar to the OODA loop concept, which stands for “Observe, Orient, Decide, Act.” It was developed by military strategist John Boyd. Fighter pilots use it to wait until the last responsible moment to decide on a course of action, which allows them to make the best decision based on the current situation and available information. ↩


"
},

{
"title": "Watching Millionaires",
"url": "https://endler.dev/2025/watching-millionaires/",
"body": "I watched the Champions League final the other day when it struck me:
I’m basically watching millionaires all the time.
The players are millionaires, the coaches are millionaires, the club owners are millionaires. It’s surreal.
This week I watched John Wick Ballerina and, again, there’s Keanu Reeves, who is a millionaire, and Ana de Armas, who is as well.
Yesterday I heard about Trump and Musk fighting. They are not millionaires, they are billionaires!
As I’m writing this, I’m watching the Rock am Ring live stream, a music festival in Germany. Weezer is playing. These guys are all millionaires.
I don’t know what to make of it.
It’s a strange realization, but one that feels worth sharing.
I could go down the road of how this fixation on elites distracts us from the people nearby, but that’s not quite it.
What interests me more is how normalized this has become.
Maybe it’s just the power law in action: a few rise to the top, and we amplify them by watching. But most people in every field aren’t millionaires. We just don’t see them.
You’re on a tiny blog by a tiny man and if you made it this far, I appreciate you. It looks as if you care about the little stories as well.
If you’re anything like me, you’re not only enjoying the little stories, you’re actively seeking them out – but there’s so few of it nowadays. Yes, there are still places where people share their stories, but you need to know where to look.
If anything, we all should share more. Write about the little things, the everyday moments, the people you meet, the things you care about. Don’t live anybody else’s life!
Rivers Cuomo, Weezer’s lead singer, once wrote:

My motivation is much different now than it was then: then I was terribly discontent and dreaming of being a classical composer, a writer, or basically anything that I wasn’t; now I just want to enjoy my life and do the responsible thing—graduate.

That’s from his Letter For Readmission To Harvard (2005).
Nobody forced him to go back to Harvard after so many years.
He was a freaking millionaire rock star by then.
And yet, he did.
He stopped pretending and started living.
We don’t have to keep watching other people’s lives.
Live your own.
"
},

{
"title": "Paolo the Plumber",
"url": "https://endler.dev/2025/paolo/",
"body": "Paolo was a plumber.
People knew him as a reliable and thorough craftsman.
He fixed the pipes in his small town and made a good living doing so.
One day, his friend Mario told him that he’d bought a plumbing machine.
Paolo was intrigued and asked how it worked.
“It’s magical!” said Mario.
“I show it what’s broken, and it fixes the problem in no time!”
Paolo asked if he could watch the machine work.
The next day, Paolo and Mario took the machine to a house with a broken pipe.
Paolo watched as Mario positioned the machine by the pipe.
“Beep boop,” and the machine started working, and quickly.
Paolo noticed the machine turned the wrench back and forth instead of steady pressure - something he could adapt for his own work.
Within minutes, the pipe was fixed.
“Soon no one will need plumbers anymore,” said Mario.
“I can already do the work of ten plumbers with this machine!”
That night, Paolo couldn’t sleep.
He thought about his job and how it might change.
He loved being a plumber and helping people.
But what if machines really took over?
Within a few weeks, Paolo’s phone stopped ringing.
People were calling Mario instead because he did quicker, cheaper work.
Some of Paolo’s old customers told him he was “old-fashioned” and “out of touch.”
In the past, none of his customers had ever complained about his work.
He always took time to do things right.
He would check every joint, seal every pipe, and make sure everything was perfect before leaving.
Sometimes he noticed other problems that needed fixing and he would offer to fix those too.
Then one day, he got a call from an old customer.
It was an emergency.
The pipes in the restaurant were leaking and they needed help fast.
Paolo rushed over and found a mess.
He got to work and fixed the problem.
“We just got it fixed the other day!”
When Paolo asked who did the work, the owner said it was Mario.
From that day on, more people called Paolo.
They all had problems after working with Mario and the machine.
Paolo kept finding the same mistakes: pipes not properly sealed, joints not aligned correctly, leaks temporarily fixed with instant glue.
Sometimes the machine would add extra parts: pipes that ended nowhere, valves that didn’t connect to anything.
Paolo recognized these as signs of the machine at work.
Paolo called Mario and told him what he’d found.
Mario knew about the issues: “I told it to fix it, but it didn’t work right. Even when I asked multiple times and was very polite.”
And worse: “One time I looked away for a moment and the machine started remodeling the bathroom! It added a new sink that wasn’t there before.”
Paolo asked why he didn’t just fix it himself.
“I can’t,” Mario said.
“I don’t know how to do it without the machine.”
Mario had been a reputable plumber before he got the machine.
Now he was relying on a machine that didn’t always work.
Worse, Mario didn’t own the machine but rented it from a company far away.
The rent was cheap in the beginning, but now it was getting more expensive.
Paolo realized that Mario wasn’t the only one.
Many plumbers were using machines now, and new plumbers were learning machines instead of tools.
It wasn’t just plumbers—electricians, carpenters, other tradespeople were all relying on machines.
The machines caused problems, but the company promised they would fix everything and get better with time.
They kept updating the machines and gave them fancy names, but the problems remained.
Paolo just kept working.
He fixed what the machines broke.
His customers called him back for more work.
Soon his phone was ringing like before.
A while later, a salesperson came to town with a new machine.
Paolo heard Mario talking to him at the coffee shop.
"
},

{
"title": "Reinvent the Wheel",
"url": "https://endler.dev/2025/reinvent-the-wheel/",
"body": "One of the most harmful pieces of advice is to not reinvent the wheel.
It usually comes from a good place, but is typically given by two groups of people:

those who tried to invent a wheel themselves and know how hard it is
those who never tried to invent a wheel and blindly follow the advice

Either way, both positions lead to a climate where curiosity and exploration gets discouraged.
I’m glad that some people didn’t follow that advice; we owe them many of the conveniences of modern life.
Even on a surface level, the advice is bad:
We have much better wheels today than 4500–3300 BCE when the first wheel was invented.
It was also crucially important that wheels got reinvented throughout civilizations and cultures.


     Note: When I say “wheel” throughout this post, please replace it with whatever
tool, protocol, service, technology, or other invention you’re personally interested in.


Inventing Wheels Is Learning

“What I cannot create, I do not understand”
– Richard Feynman, Physicist and Nobel Prize Winner

To really understand something on a fundamental level, you have to be able to implement a toy version first.
It doesn’t matter if it’s any good; you can throw it away later.
In Computer Science, for example, there are many concepts that are commonly assumed to be beyond the abilities of mere mortals:
protocols, cryptography, and web servers come to mind.
More people should know how these things work.
And therefore I think people should not be afraid to recreate them.

    
Everything Is A Rabbit Hole
Too often, fundamental things are taken for granted.
For example strings or paths are super complicated concepts in programming.
It’s a great exercise to implement a string or a path library yourself
if you’re interested in how they work.
Even if nobody ends up using your work, I bet you’ll learn a lot. For example:

There is an infinite complexity in everyday things.
Building something that even a single other person finds useful is a humbling experience.
Humans like you created these abstractions. They are not perfect and you can make different tradeoffs in your own design.

On the last point, everything is a tradeoff and there are dozens, sometimes hundreds of footguns with every toy problem.
Along the way, you will have to make decisions about correctness, simplicity, functionality, scalability, performance, resource usage, portability, and so on.
Your solution can be great in some of these things, but not all of them and not for all users.
That also implies that existing solutions have flaws and might not be designed to solve your particular problem; no matter how well-established the solution is.
Going down rabbit holes is fun in its own way, but there is one other benefit:
It is one of the few ways to level up as an engineer… but only if you don’t give up before you end up with a working version of what you tried to explore.
If you jump between projects too often, you will learn nothing.

    
Reasons for Reinventing the Wheel
There are great reasons to reinvent the wheel:

Build a better wheel (for some definition of better)
Learn how wheels are made
Teach others about wheels
Learn about the inventors of wheels
Be able to change wheels or fix them when they break
Learn the tools needed to make wheels along the way
Learn a tiny slice of what it means to build a larger system (such as a vehicle)
Help someone in need of a very special wheel. Maybe for a wheelchair?

Who knows? The wheel you come up with might not be the best use for a car, but maybe for a… skateboard or a bike?
Or you fail building a nicer wheel, but you come up with a better way to test wheels along the way.
Heck, your wheel might not even be meant for transportation at all!
It might be a potter’s wheel, “a machine used in the shaping (known as throwing) of clay into round ceramic ware” according to Wikipedia.
You might end up building a totally different kind of wheel like a steering wheel or a flywheel.
We need more people who think outside the box.

    
Reuse vs Reinvent
Of course, don’t disregard the works of others – study their work and reuse where you see fit. Don’t reinvent the wheel out of distrust or ignorance of the work of others.
On the other side, if you never tried to put your knowledge to the test, how would you ever learn enough about your field to advance it?
I observed you can move very quickly by running little experiments. Especially in software engineering, building small prototypes is cheap and quick. Solve your own problem, start small, keep it simple, iterate.
So, with all of the above, here’s my advice:
Reinvent for insight. Reuse for impact.
"
},

{
"title": "No Matter What",
"url": "https://endler.dev/2024/no-matter-what/",
"body": "As kids, our parents established a few simple rules that we would all follow, no matter the circumstances. One of them was that we’d always have dinner together in the evening, typically around 6pm.
In almost two decades, they never broke that rule. We had dinner on 9&amp;#x2F;11 and when mom was at the hospital. It’s not always easy.
There’s a nice thing that happens when you have such a golden rule: it has ripple effects. Since we had dinner together every evening, we would always have time to talk about the day. Problems would be uncovered earlier. We would know about each other’s appointments for the next day. It provided structure throughout the rest of the day. It put things into perspective. It grounded us.

Bad grade at school? Dinner at 6.
Played computer games all afternoon and lost track of time? Dinner at 6.
No matter how bad your day was, dinner is always waiting for you.

As a kid, it sounded like one of those “stupid” rules only grown-ups would come up with. And in fact, my parents knew that it was stupid. They did it anyway. As a kid, that made their life look extremely dull and boring. I remember pitying my dad once for being such a slave to society. Yet, they persisted because without it, things would fall apart. Skipping dinner is about way more than skipping dinner.

    
These Rules Are Simple, But Not Easy
It’s a simple rule with little room for interpretation. However, it’s not easy: there are times when you have to drop something else to make dinner at 6 work. That’s when the rule counts the most! That’s what makes or breaks it.
Following the rule 90% of the time is much easier than following it 100% of the time. You have to make sacrifices. You have to say no sometimes. That’s the price it takes to stick to the rule.
Yes, such rules “sound” stupid, but there’s a deeper, almost stoic realization to it:
Life is complicated and will throw obstacles in your way. But if you really want to make progress, you have to find a way. If nothing else helps, make up a stupid rule;
and the harder you struggle, the more specific the rule should be.
Dinner. Every day. At 6 o’clock.
Only now am I discovering this for myself. In 2019, I mentioned to my friend Abu that I felt bad for not doing any sports. It’s not that I didn’t try, it’s just that nothing lasted for long. He suggested going for a run together on Tuesdays – no matter what. I thought that was ridiculous. I told him that it couldn’t possibly work. Why Tuesdays of all days!? It felt so random. In my mind, I started negotiating. But there’s no point in negotiating with irrationality. Fast forward 5 years, and I still run every Tuesday.
I actually suck at running. My pace isn’t fast. The distance isn’t far, but it’s a solid effort. Time was made. It worked out. Again, it had positive rippling effects: I ran on Crete in Greece and Sardinia in Italy. Different people joined me on my runs. If Tuesday finds me elsewhere, my running shoes come along. Now, did I always manage to run on a Tuesday? No. It’s not easy! But I always gave it a solid attempt and I can remember each time I didn’t run. Since Abu and I run together a lot, we would talk about our week. If we didn’t make up that rule, we would never have started to know each other on such a deep level.
Some people won’t understand when you tell them that you have to do a thing “no matter what.” Instead of telling them I have to go for a run, I say I’m busy that evening. Nobody ever asks any questions.

    
Isn’t this just a habit?
With “no matter what” there can be serious consequences. If you have to take care of a loved one, you can’t skip a day. Or if you’re an Air Traffic Controller, failure is not an option.
My stakes are not as high, but I take them very seriously.
“No matter what” rules aren’t habits, at least not in the beginning. They can, however, turn into super strong habits with time.
I found that the best way to implement a “NMW Rule” is to do it on the spot. When my dentist asked me if I floss every day (I didn’t), I made the decision to start right then and there and never skip a day.
Another good way to get started is to take on some lightweight responsibility.
For example, I recommend getting plants.
Then you have to water them – no matter what.
If the plant dries out, you broke the rule; simple as that.
The great thing is that the watering interval is usually pretty low, so there’s time to get used to it
(but getting used to it you must).
If it works, you’ll enjoy the feeling of continuity.
It’s like a chain of good deeds.
A new habit is born.
In the past, I never had any plants.
Now our apartment is full of them.
I love the companionship and the continuity.

    
What’s your “NMW”?
If you already have a “no matter what” rule, you have my deepest respect.
If not, whether you want to write that book, run that marathon, or just save a few bucks each month, make it work – no matter what.
"
},

{
"title": "The Best Programmers I Know",
"url": "https://endler.dev/2025/best-programmers/",
"body": "I have met a lot of developers in my life.
Lately, I asked myself: “What does it take to be one of the best? What do they all have in common?”
In the hope that this will be an inspiration to someone out there, I wrote down the traits I observed in the most exceptional people in our craft. I wish I had that list when I was starting out. Had I followed this path, it would have saved me a lot of time.

    
Read the Reference
If there was one thing that I should have done as a young programmer,
it would have been to read the reference of the thing I was using.
I.e. read the Apache Webserver Documentation,
the Python Standard Library,
or the TOML spec.
Don’t go to Stack Overflow, don’t ask the LLM, don’t guess, just go straight to the source.
Oftentimes, it’s surprisingly accessible and well-written.

    
Know Your Tools Really Well
Great devs understand the technologies they use on a fundamental level.
It’s one thing to be able to use a tool and a whole other thing to truly grok (understand) it.
A mere user will fumble around, get confused easily, hold it wrong and not optimize the config.
An expert goes in (after reading the reference!)
and sits down to write a config for the tool of which they understand every single line and can explain it to a colleague.
That leaves no room for doubt!
To know a tool well, you have to know:

its history: who created it? Why? To solve which problem?
its present: who maintains it? Where do they work? On what?
its limitations: when is the tool not a good fit? When does it break?
its ecosystem: what libraries exist? Who uses it? What plugins?

For example, if you are a backend engineer and you make heavy use of Kafka,
I expect you to know a lot about Kafka – not just things you read on Reddit.
At least that’s what I expect if you want to be one of the best engineers.

    
Read The Error Message
As in Really Read the Error Message and Try to Understand What’s Written.
Turns out, if you just sit and meditate about the error message, it starts to speak to you.
The best engineers can infer a ton of information from very little context.
Just by reading the error message, you can fix most of the problems on your own.
It also feels like a superpower if you help someone who doesn’t have that skill.
Like “reading from a cup” or so.

    
Break Down Problems
Everyone gets stuck at times.
The best know how to get unstuck.
They simplify problems until they become digestible.
That’s a hard skill to learn and requires a ton of experience.
Alternatively, you just have awesome problem-solving skills, e.g., you’re clever.
If not, you can train it, but there is no way around breaking down hard problems.
There are problems in this world that are too hard to solve at once for anyone involved.
If you work as a professional developer, that is the bulk of the work you get paid to do:
breaking down problems.
If you do it right, it will feel like cheating:
you just solve simple problems until you’re done.

    
Don’t Be Afraid To Get Your Hands Dirty
The best devs I know read a lot of code and they are not afraid to touch it.
They never say “that’s not for me” or “I can’t help you here.”
Instead, they just start and learn.
Code is just code.
They can just pick up any skill that is required with time and effort.
Before you know it, they become the go-to person in the team for whatever they touched.
Mostly because they were the only ones who were not afraid to touch it in the first place.

    
Always Help Others
A related point.
Great engineers are in high demand and are always busy, but they always try to help.
That’s because they are naturally curious and their supportive mind is what made them great engineers in the first place.
It’s a sheer joy to have them on your team, because they are problem solvers.

    
Write
Most awesome engineers are well-spoken and happy to share knowledge.
The best have some outlet for their thoughts: blogs, talks, open source, or a combination of those.
I think there is a strong correlation between writing skills and programming.
All the best engineers I know have good command over at least one human language – often more.
Mastering the way you write is mastering the way you think and vice versa.
A person’s writing style says so much about the way they think.
If it’s confusing and lacks structure, their coding style will be too.
If it’s concise, educational, well-structured, and witty at times, their code will be too.
Excellent programmers find joy in playing with words.

    
Never Stop Learning
Some of the best devs I know are 60+ years old.
They can run circles around me.
Part of the reason is that they keep learning.
If there is a new tool they haven’t tried or a language they like, they will learn it.
This way, they always stay on top of things without much effort.
That is not to be taken for granted: a lot of people stop learning really quickly after they
graduate from University or start in their first job.
They get stuck thinking that what they got taught in school is the “right” way to do things.
Everything new is bad and not worth their time.
So there are 25-year-olds who are “mentally retired” and 68-year-olds who are still fresh in their mind.
I try to one day belong to the latter group.
Somewhat related, the best engineers don’t follow trends, but they will always carefully
evaluate the benefits of new technology. If they dismiss it, they can tell you exactly why,
when the technology would be a good choice, and what the alternatives are.

    
Status Doesn’t Matter
The best devs talk to principal engineers and junior devs alike. There is no hierarchy.
They try to learn from everyone, young and old.
The newcomers often aren’t entrenched in office politics yet and still have a fresh mind.
They don’t know why things are hard and so they propose creative solutions.
Maybe the obstacles from the past are no more, which makes these people a great source of inspiration.

    
Build a Reputation
You can be a solid engineer if you do good work,
but you can only be one of the best if you’re known for your good work;
at least within a (larger) organization.
There are many ways to build a reputation for yourself:

You built and shipped a critical service for a (larger) org.
You wrote a famous tool
You contribute to a popular open source tool
You wrote a book that is often mentioned

Why do I think it is important to be known for your work?
All of the above are ways to extend your radius of impact in the community.
Famous developers impact way more people than non-famous developers.
There’s only so much code you can write.
If you want to “scale” your impact, you have to become a thought leader.
Building a reputation is a long-term goal.
It doesn’t happen overnight, nor does it have to.
And it won’t happen by accident.
You show up every day and do the work.
Over time, the work will speak for itself.
More people will trust you and your work and they will want to work with you.
You will work on more prestigious projects and the circle will grow.
I once heard about this idea that your latest work should
overshadow everything you did before.
That’s a good sign that you are on the right track.

    
Have Patience
You need patience with computers and humans.
Especially with yourself.
Not everything will work right away and people take time to learn.
It’s not that people around you are stupid; they just have incomplete information.
Without patience, it will feel like the world is against you and
everyone around you is just incompetent. That’s a miserable place to be.
You’re too clever for your own good.
To be one of the best, you need an incredible amount of patience, focus, and dedication.
You can’t afford to get distracted easily if you want to solve hard problems.
You have to return to the keyboard to get over it.
You have to put in the work to push a project over the finishing line.
And if you can do so while not being an arrogant prick, that’s even better.
That’s what separates the best from the rest.

    
Never Blame the Computer
Most developers blame the software, other people, their dog, or the weather for
flaky, seemingly “random” bugs.
The best devs don’t.
No matter how erratic or mischievous the behavior of a computer seems,
there is always a logical explanation: you just haven’t found it yet!
The best keep digging until they find the reason.
They might not find the reason immediately, they might never find it,
but they never blame external circumstances.
With this attitude, they are able to make incredible progress and learn things that others fail to.
When you mistake bugs for incomprehensible magic, magic is what it will always be.

    
Don’t Be Afraid to Say “I Don’t Know”
In job interviews, I pushed candidates hard to at least say “I don’t know” once.
The reason was not that I wanted to look superior (although some people certainly had that impression).
No, I wanted to reach the boundary of their knowledge.
I wanted to stand with them on the edge of what they thought they knew.
Often, I myself didn’t know the answer. And to be honest, I didn’t care about the answer.
What I cared about was when people bullshitted their way through the interview.
The best candidates said
“Huh, I don’t know, but that’s an interesting question! If I had to guess, I would say…”
and then they would proceed to deduce the answer.
That’s a sign that you have the potential to be a great engineer.
If you are afraid to say “I don’t know”, you come from a position of hubris or defensiveness.
I don’t like bullshitters on my team.
Better to acknowledge that you can’t know everything.
Once you accept that, you allow yourself to learn.
“The important thing is that you don’t stop asking questions,” said Albert Einstein.

    
Don’t Guess
“In the Face of Ambiguity, Refuse the Temptation to Guess”
That is one of my favorite rules in PEP 20 – The Zen of Python.
And it’s so, so tempting to guess!
I’ve been there many times and I failed with my own ambition.
When you guess, two things can happen:

In the best case you’re wrong and your incorrect assumptions lead to a bug.
In the worst case you are right… and you’ll never stop and second guess yourself.
You build up your mental model based on the wrong assumptions.
This can haunt you for a long time.

Again, resist the urge to guess.
Ask questions, read the reference, use a debugger, be thorough.
Do what it takes to get the answer.

    
Keep It Simple
Clever engineers write clever code.
Exceptional engineers write simple code.
That’s because most of the time, simple is enough.
And simple is more maintainable than complex.
Sometimes it does matter to get things right, but
knowing the difference is what separates the best from the rest.
You can achieve a whole lot by keeping it simple.
Focus on the right things.

    
Final Thoughts
The above is not a checklist or a competition;
and great engineering is not a race.
Just don’t trick yourself into thinking that you can skip the hard work.
There is no shortcut. Good luck with your journey.
"
},

{
"title": "So You Want to Start a (Tech) Podcast",
"url": "https://endler.dev/2024/so-you-want-to-start-a-tech-podcast/",
"body": "For the past year, I’ve been hosting the Rust in
Production,
a podcast about companies who shape the future of infrastructure.
This journey has taught me a lot about what it takes to create and maintain a
successful podcast. Well, success is always relative; at the moment we have
around 5k regular (monthly) listeners. Maybe not a ton of people, but it puts us
comfortably into the top 5% of podcasts – at least by some
statistics.
Whether you’re considering starting your own podcast or just curious about the
process, I hope my experiences can offer some valuable insights.

    
            The ‘Rust in Production’ podcast cover
            
            
Do Your Research
Before you dive into the world of podcasting, take some time to explore the landscape. Think about branding and positioning first.

    
Topic
When choosing your topic, make sure it’s something you can easily generate ideas
for. Try to come up with at least 10 potential episode ideas
before you settle on it. If you’re
planning an interview-based podcast, ensure you have a large enough network to
secure at least 10 guests.

    
Competition
Research your competition. Listen to similar podcasts and note down what you like and dislike about them. This will help you differentiate your podcast from others in the same niche.
If a podcast is already covering your topic, that’s not necessarily a bad thing; it just shows there’s an audience for it. However, you need to find a unique angle or a different format to stand out. If you can’t be the first, be the best. Be the funniest, or the most in-depth, or the one with the most interesting guests.
Be honest with yourself about what you can offer that others can’t. If you can’t find a unique angle, it might be better to choose a different topic.
If you’re not sure, ask your friends or colleagues for their opinion.

    
Podcast Name
What’s the title of your podcast?

Is it catchy and easy to remember?
Does it convey what your podcast is about?
Is the domain name available?
Are the social media handles available?
Is there a simple abbreviation you can use for hashtags or mentions?

Especially the last few points are often overlooked. You want to make it as easy as possible for people to find your podcast.
I’d say don’t be too clever with your podcast name. It should be easy to remember and spell. If you have to explain it, it’s probably too complicated. Also, don’t use special characters or numbers in your podcast name. It makes it harder to remember and type.
Don’t pick a too generic name either. Be specific about your niche. So instead of “The JavaScript Podcast,” go for “Refactoring JavaScript” or
“React Weekly.”
Don’t forget about SEO. Consider what people might search for when looking for content like yours. My podcast is titled “Rust in Production,” which is a commonly searched term. This has helped with discoverability.
Another version of that, which could work, is to think about questions that people Google for.
E.g. “What is functional programming?” or “How to refactor legacy code?”
and then coming up with a podcast name that answers that question.
For example, “Functional Programming Explained” or “Refactoring Legacy Code.”

    
Cover Art
Your podcast’s cover is equally crucial. It’s the first thing people recognize about your podcast (except for the title) before they decide what to listen to, so it needs to stand out from the crowd.
What I did was open my podcast app and look at the grid of covers.

    
            The grid of podcast covers in my podcast app
            
            
I asked myself which ones stood out and why.
I also asked a few friends and my partner to do the same. I got some great feedback that way.
This visual first impression can make a big difference in attracting new listeners.

    
Length
Next, decide on your podcast’s length. Fifteen minutes is great for news content, 30 minutes work well for commutes, and one hour is suitable for deep dives. Anything longer, and listeners might hesitate to commit their time.

    
Plan Your Content
Once you’ve done your research, it’s time to plan your content strategy. Having a regular schedule is key - weekly or biweekly episodes work well for many podcasts. Start conservatively; you can always increase frequency later, but underestimating the workload can lead to burnout.
I highly recommend buffering content by recording a few episodes before you start publishing. This gives you a cushion and reduces stress, especially when you’re just starting out.
Consider a season-based approach. For “Rust in Production,” we do 7-8 episodes and then take a break. This allows for better planning and reduces ongoing pressure.

    
Respect Your Guests
If you’re doing an interview-based podcast, treating your guests with respect is paramount. Explain why you want them on your show in the initial email. Keep them informed about the process and be flexible with scheduling. At the start of the recording, explain how things will work and ask if they have any time constraints.
Remember, your guests are likely doing this for free. Respect their time and make the experience as smooth as possible for them.

    
Invest in Quality
Audio quality can make or break a podcast. Invest in good equipment - get a decent microphone and headphones, and consider ways to improve your room’s acoustics. If you’re interviewing guests remotely, consider their equipment too. A pre-call to check their setup can be invaluable, or you might even consider sending them equipment if you want consistent quality across episodes.
Always remind guests to stay close to the mic. It’s a small detail that can make a big difference in audio quality.
On two occasions, I had guests who had their condenser mic backwards, and that sounds pretty dull. You get better at picking up on these things the more you record. It helps to know which way the mic should be facing (usually the logo on the mic and the volume knob should be facing you). Both guests were very grateful for the tip and the audio quality improved significantly.

    
Production Tips
One of the best decisions I made was not to edit the podcast myself. I’m incredibly thankful that Simon Brüggen agreed to do the editing for “Rust in Production.” It would have been an enormous amount of work on top of finding guests, recording, and hosting.
It also helps that Simon is a Rust developer and understands the content. He can give tips on how to improve the content from a technical perspective.
For recording, tools like Zencastr, Riverside, or Descript are excellent. They capture audio on both sides, giving you uncompressed files to work with. Auphonic is great for cleaning up audio, removing filler words, and creating transcripts.
When it comes to hosting, I use Letscast. They’re not the cheapest option, but their customer service is top notch and the website is fast and not bloated.

    
Develop Your Style
As you progress, you’ll naturally develop your own podcasting style. For me, I prefer to let guests do most of the talking, only interjecting occasionally with questions or comments.
The motto is “say less, ask more.” It’s a good rule of thumb for interviews. It’s not about you, it’s about the guest. Let them shine.
In pursuit of asking better questions, I wrote an essay on how to ask better questions.
I’ve also found that taking notes during recording helps me ask better follow-up questions.
Don’t be afraid to encourage your guests when they make good points. A nod, a smile, or a thumbs up can go a long way in making them feel comfortable and valued.

    
Don’t Sweat the Small Stuff
While it’s tempting to obsess over metrics, try not to focus on them too much. Instead, concentrate on producing content you’d enjoy consuming yourself. Be passionate about your topic and create your podcast as if no one is listening - ironically, this often leads to the most engaging content.
Starting a podcast is a lot of work, but it’s incredibly rewarding. The podcast space isn’t oversaturated yet - it
reminds me of the golden age of YouTube a few years ago. Podcasting is becoming more professional now, but there’s still plenty of room for new formats and perspectives.
Remember: it’s okay to start small and grow. You’ll learn and improve with each new episode. The most important thing is to enjoy the process and share your personality with the world.
If you’re interested in Rust, consider listening to the Rust in Production podcast. I’d love to hear what you think!
"
},

{
"title": "Follow the Hackers",
"url": "https://endler.dev/2024/follow-the-hackers/",
"body": "Want to see tomorrow’s important technologies?
Watch what hackers are passionate about today.

    
Defining “Hacker”
I’m using the term “hacker” in the spirit of the Hacker Ethic, as described by authors like Steven Levy and Pekka Himanen.
In this context, a hacker is someone who:

Follows their passion and seeks self-fulfillment through technology
Creates something beneficial for the wider community
Values freedom, cooperation, and voluntary work
Challenges traditional work ethics with a focus on creativity and sharing

These folks are a small subset of the population, but they have some traits that make them excellent predictors of the future:

They care deeply about their field - programming being just one example
They’re passionate about the things they believe in
They’d use something even if no one else cared about it
They work at the cutting edge, so they need the best tools to do their job
They hold strong opinions on what works and what doesn’t, backed by solid evidence
They don’t care about investors, quarterly earnings reports, or politics - they purely focus on the technology’s value


Catalysts For Success And Red Flags
Of course, not every hyped technology makes it big.
Remember NFTs or Web3?
The key difference?
Real hackers were never passionate about these technologies - ordinary people were.
Another red flag is when the technology’s benefits are hard to explain.
If hardcore tech people can’t explain the benefits to you, that’s a bad sign.
Look deeper and you’ll find different motivations at work!
It’s typically people looking to profit from technology.
They brand themselves as Investors, “Serial Entrepreneurs”, and “Thought Leaders”.
You’ll find them on LinkedIn, updating their profiles with the latest buzzwords every few months.
While a few are legitimate, most are opportunists who couldn’t explain the technology to save their lives.
Profit is their only motivation.
The hackers?
They don’t care what you think about them.
They’ve got nothing to sell you.
They’re too busy building cool stuff!
A question hackers care about is “who owns the platform”?
Companies always have an agenda.
Pour in your time and effort, and they might lock you out to profit from your work.
Hackers don’t like that.
Therefore, all the winning ideas I mentioned are open source.
If a technology isn’t, that’s a major red flag when evaluating its future potential.
It’s not even optional anymore - it’s pretty much mandatory.
But there’s another reason why open source is a catalyst for success:
Initially, open source projects start as minimally functional versions without user-friendly documentation.
They might be tough to set up, but the core idea is there.
If people stick with it despite the lack of hand-holding, you know it’s solving a real problem - and that’s a sign of a winning idea.

    
You Still Need Patience
You probably know I’m all in on Rust.
After all, I make my living as a Rust consultant.
It took Rust over a decade of development to gain any real traction in the industry.
It’s been a slow but steady climb.
It takes time for the public to catch up before a technology hits its stride.
For core technologies like programming languages or databases, it often takes a decade or more.
That’s simply how long technology needs to mature.
That’s why I tell founders to stay slightly conservative when adopting new tech.
The industry needs time to catch up, and big companies need specialized tools to integrate new tech into their existing systems.
On the other side, investing early in promising technologies is a calculated risk because the writing is on the wall.

    
What Can You Learn From This?
Hackers are already living in the future.
You can use that to your advantage.
Ask 10 hackers what new things they’re really excited about, and you’ll get a good picture of what’s going to be important in a few years.
Most business people don’t talk to hackers regularly.
That’s a fact you can use to your advantage.
If you’re selling to developers (and you probably shouldn’t), the key is to really listen to what the hackers are saying and then follow their lead.
"
},

{
"title": "Be Simple",
"url": "https://endler.dev/2024/be-simple/",
"body": "Last night I realized that my life is very simple.
That’s not by chance, but by conscious effort.
Life becomes complex all by itself if you do nothing about it.
One day you’ll wake up and you have a mortgage, 10 on-demand subscriptions, 20 insurances, 1000 open browser tabs, a demanding job and a dog.
And when you realize it, you wonder how you got there.
I keep my life simple because I know my time is limited.
Time and health are my best proxies for happiness.

    
Simple Doesn’t Mean Boring
Quite the contrary: give me enough time and I find ways to entertain myself.
My friends might disagree, but I consider myself to be an introvert.
I like to spend time on my own to explore and learn.
There hasn’t been a boring moment in a long time.
If life was more complex, that would take away my time, but time is the resource I can’t replenish, so I protect it.
How? Mostly by saying NO.

No streaming subscriptions. No Disney+, no Netflix. I rarely watch TV anyway.
No gym memberships. Just run in the park.
No Instagram or TikTok, but part of that is getting older.
My shoes are 6 years old. So is my wardrobe.
No meetings if possible. I’m the guy who sits at his desk for 8 hours straight, only getting up for bathroom breaks twice.
No property; I’m a happy tenant.
No commute; I work remotely.

Great home cinema setup you have there.
Thanks for inviting me over!
At home, I don’t even have external speakers.
You’re planning a trip to the Bahamas? Enjoy! Send me a photo.
Regarding technology, that means:

Even though I’m a programmer, I only have a single screen.
I evaluate tools, but I keep the number of tools limited.
My editor doesn’t have a debugger.
No Notion or Obsidian if a text file + git is enough.
I limit the number of browser tabs with an extension.


Simple Doesn’t Mean Minimalistic
My goal is not to have as few possessions as possible – I own a lot – but to lead a simple life.
I’ll happily buy things if they make my life simpler.
The last big life improvements were a robot vacuum cleaner (four years ago) and an automated cat litter box (two years ago).
If I decide to buy something, I make sure it’s the absolute best I can afford.
(My rule is actually “don’t buy crap”.)
For example, I spent a ridiculous amount on the best laptop I could buy.
It’s my daily driver that I spend most of my time with, so it needs to be an absolute workhorse.
My work is also compute-intensive, so I saw the purchase as justified.
I always pay the price in full. No lease, no monthly payments.
I have to use services for work, but I prefer monthly payments over yearly subscriptions, even if they are 30% more expensive.
The fact that I can cancel at any time is more important to me.
I know that when I buy something, it demands my attention.
Maintenance is not fun.
Even though I like the idea of owning something, I probably don’t truly own it.
That makes me the worst consumer possible.
I keep things in my Amazon basket forever. From time to time I look at the items, and when I enjoy seeing them in my basket… I keep them there.
The rest, I just delete.
This way, I get the “feeling” of owning things without spending any money.

    
Simple Doesn’t Mean Convenient
To live with such a person is not easy.
We have regular discussions about “investing” money into things that I’m skeptical about.
It takes me ages to reach a conclusion.
Vacation planning is definitely one of my weaknesses.
I am very well aware that my approach is not perfect at times.
I am okay with making these sacrifices for protecting my time and therefore my happiness.
Recently, I got a few emails from people telling me that my newsletter subscription is broken.
I’m aware of that. My newsletter provider shut down.
I won’t fix it.
It turns out that people find other ways to follow me; either on Mastodon or via RSS.
I also don’t have a comment section – Reddit or HN work just fine.
Even though there are a few folks on my old newsletter list, I never got around to sending many emails.
I don’t particularly enjoy writing a newsletter, so it might be best that it finally broke.
I will probably remove the signup box.
It would be nice to have it “just work”, but the next best thing is to not have it at all.

    
Simple Means Letting Go
Perhaps another way to explain it is the midwit theme:

    
I try to stay on the left side of this curve as much as I can.
I’m aware that there are “smarter” ways to do things, but I don’t want to dedicate time to learning about them.
I only dedicate time to things that matter to me and that I want to go really deep into.
The meme shows simple approaches on both ends, with a complicated phase in between.
Getting to the right side of the spectrum takes lots of effort, and I’ve only made that journey a few times in my life.
For the rest, knowing there’s an awkward complicated phase in between keeps me happily on the simple side.
It’s fine.

    
Simple Means Focused
Greatness comes from dedicating time to the things that matter.
The most productive people I know are focused.
Yes, there’s a creative process and they allow themselves to be creative, but they do so in a very constrained environment: their office.
While others chase trends, they do the thing they’re always doing.
They put in the hours.
It’s way easier to be focused when life is simple.
When there’s no room for distractions and complexity.
I find that constraints help as well.
Technology is one major source of distraction.
Some of the best stories were written with a typewriter.
In itself, it’s a very limited environment, but it takes away all the distractions and lets you focus on the task at hand.
I find that inspiring, liberating.
That’s why I like constraints.
When I give presentations, I wonder what I’d write if I could only have 5 slides with 5 words each,
or I could only use two colors, or only show images.
It keeps me focused on the message.
It’s simple.
Simple is beautiful.
Simple makes me happy.
"
},

{
"title": "What to Write",
"url": "https://endler.dev/2024/what-to-write/",
"body": "People sometimes ask me how I come up with things to write.
To me, it’s the same as asking how I come up with things to say.
There’s always something to say.
It might not be novel or interesting to most, but it’s important to me
and hopefully to someone else.
What people actually want to know is how to come up with something interesting to write.
But why should that matter?
What if people don’t find it interesting?
Was it a waste of time?

    
Why Write?
There’s this funny thing which happens when you write for a while:
you forget what excited you about writing in the first place.
Instead, you find yourself chasing trends, trying to get more views, and build a following.
Even if you’re aware that this is happening, it’s hard to stop.
Your inner monologue tells you that what you’re writing isn’t good enough
or that your readers won’t like it.
Writing becomes a chore.
Eventually, you stop writing.
Somewhat tautologically, people come here exactly for one reason: to read what I write.
If I make it about them, I have to guess what they want to hear, which kills the joy in writing,
and also, in reading, as the content becomes predictable.

    
Interesting Doesn’t Mean Novel.
Just because someone else wrote about the same topic doesn’t mean it’s off-limits.
There are a million love songs to disprove that.
As it turns out, while they all revolve around the same topic, they’re all unique.
They are personal, which is what makes them different.
Some of these songs I like because I can relate to them.
To me, that’s what makes them interesting: it’s the same story but told in a different way – a personal way.
And that personal makes it new and that new makes it interesting.

    
If You Take Away the Personal, You Take Away the Interesting
Writing is a lot like that.
I get to learn about how other people feel and how they think.
It’s mostly an experience; shaped into words.
It’s beautiful to think how writing is such a simple way to learn from the experiences of others.
And how, with just a few words, you can emotionally connect with a stranger.
It’s a very human experience.
Often, what you leave out is more important than what you keep; the reader fills in the blanks.
Eventually, a story starts a life of its own; when it gets shared; when it gets retold.
It’s no longer the author’s story but the reader’s.
It becomes part of lore.
Who wrote it isn’t that important.
I can’t tell who reads this and why should I care?
Instead of trying to make other people enjoy my writing, I want to connect with people who like the same topics.
Big difference.

    
Knowing That Is Liberating
It gives me confidence that I will never run out of things to write.
At least not as long as I remember why I write.
It’s liberating because I don’t have to chase the new.
Instead, whatever it turns out to be is enough.
At times, I’m as clueless as the reader to see where this leads me.
Maybe someone else will find joy in it, maybe not.
It doesn’t matter.
What matters is what you think matters, and that’s what you should write about.
"
},

{
"title": "Move Slow and Fix Things",
"url": "https://endler.dev/2024/move-slow-and-fix-things/",
"body": "Growing up as a kid in rural Bavaria, I always dreamed of moving to the US to run a startup.
Many kids in my generation shared that dream.
To me, it felt like the only way to combine my two greatest passions: writing code and building things.
As I got older, I became disillusioned with the narrative surrounding Silicon Valley.
The hockey stick growth, the VC money, the “get rich quick” mentality – it was all one big illusion.
For a long time, I couldn’t put my finger on what exactly bothered me about it.
Part of what made me increasingly uncomfortable was the glorification of hustle culture –
the idea that you have to work yourself to the bone to make it big against all odds.

    
The other part was the “winner takes all” mentality and the mindset that you have to “move fast and break things” to succeed.
I don’t believe that has to be the case.
As it turns out, I’ve always been drawn to the exact opposite:
sustainable growth, robust solutions, and a long-term mindset.
That’s why I’ve been contributing to open source for 15 years, why I only run small, bootstrapped businesses or non-profits,
and why I focus on writing and knowledge sharing.
Paul Graham and his VC buddies would have you believe that your ultimate goal as a founder should be to build a unicorn.
But when I look at the Ubers, Facebooks, and Googles of this world, I see greed, gatekeeping, systemic exploitation, user tracking,
excessive resource consumption, and lawsuits against competitors.
These companies will do anything in their power to stay on top – even if it means bending the law or finding legal loopholes.
What kind of role model is that?

    
The Other Side
Who’s on the “other side”?
It’s the humble minority building small but meaningful things.
These people advocate for privacy, develop civic tech, try to live within their means, move deliberately, and fix what’s broken.
They fly under the radar because their success isn’t measured in dollars, and they lack big marketing budgets.
Instead, they focus on their product, doing a lot with very little.
I find that far more inspiring.

    
It’s incredibly rewarding to build something people love that can provide you with a comfortable living.
There’s no need to risk it all, drop out of college, work insane hours, and leave a trail of destruction in your wake.
You can build something small and useful instead, without a venture capitalist breathing down your neck.
It’s still hard work, but you’re leaving the campsite a little better than you found it.
Note that moving slowly doesn’t mean you can’t make quick decisions.
It’s just that the execution should be deliberate.
Don’t wreak havoc along the way.
Because the time to fix what you might break rarely comes.

    
VCs are Not Your Friends
In “Why to Not Not Start a Startup”, Paul Graham writes:

So, paradoxically, if you’re too inexperienced to start a startup, what you should do is start one. That’s a way more efficient cure for inexperience than a normal job. In fact, getting a normal job may actually make you less able to start a startup, by turning you into a tame animal who thinks he needs an office to work in and a product manager to tell him what software to write.

Let me be blunt: that’s nonsense. It paints a black and white picture of the world that simply doesn’t exist.
You don’t have to choose between starting a startup and working a soul-crushing job as a “tame animal” in a cubicle.
There’s a whole spectrum of possibilities in between!
For instance, you could work for yourself or with a small team, making use of your creativity and coming up with your own ideas.
Paul wants you to start a startup because he wants to fund you and profit from your hard work. His motives are purely egoistical.
If you happen to hit the startup jackpot, Paul gets even richer and you might become wealthy too. If you don’t, you’re left with nothing while Paul, already rich, gets to write an essay about your failure.
That’s a whole lot of risk for very little upside.
You might wonder why I’m picking on Paul Graham so much.
It’s because I once looked up to him and valued his essays.
He represents a worldview I used to believe in, but now consider harmful.
Most of his essays seem true on the surface, but dig deeper and you’ll find his claims are based on a narrow worldview and rarely supported by evidence.
Misleading young, impressionable people is dangerous.
Startup founders bear all the downside risk, while venture capitalists are well-insulated from failure. VCs spread their bets across numerous startups, ensuring they profit regardless of individual outcomes. For you, the founder, it’s an all-or-nothing gamble with your time, energy, and often your financial stability.

    
Is All VC Money Bad?
Of course not. But I’d argue it’s becoming less and less relevant in today’s world.
Take building a software product, for example. You don’t need a fortune to get started anymore. There are website builders, cloud hosting solutions, and open source software at your fingertips.
Why take on VC money when you’re just starting out?
Some might argue that you profit from valuable networking opportunities and business advice along with the funding.
But most of that information is freely available online these days. There’s an abundance of podcasts, videos, and books on the subject if you’re willing to learn.
It’s trickier, of course, if you’re building a physical product.
But even that has become much easier in recent years.
Could you sell a 3D printed prototype before scaling up production?
Or launch a Kickstarter campaign to fund your first batch of products?
There are now print-on-demand services for t-shirts, mugs, posters, and books.
Plus, there are plenty of brick-and-mortar stores you could approach with your business idea if you’re looking to collaborate.
Let’s say you do become wildly successful and urgently need capital to scale. Maybe then VC money is one way to go. Or you could take out a loan.
Either way, remember that money always comes with strings attached.
Funding might force you to do things you’re not comfortable with, like compromising your users’ privacy or your own values.
And even if not, you’ll constantly be pressured to find new avenues for growth.
Wouldn’t it be nicer to focus on making your product better instead?
That’s often far more rewarding.
But often, if you spend enough time thinking through a problem, you might find a way to prove your concept at a smaller scale.

    
There Is No Infinite Growth
Ask yourself: What truly motivates you? Is it power, money, or fame?
If so, there are more meaningful things in life.
Your self-worth isn’t tied to building a unicorn.
I don’t know who needs to hear this, but it’s perfectly fine to be 23 and not be a millionaire founder.
If you’re a high school or college student dreaming of running a startup, know that there’s another path.
Don’t sell out your moral compass for a quick buck.
You don’t have to jump on the AI bandwagon just because it’s the flavor of the month.
At least 144 out of 251 companies from the YC W24 batch are building products with “AI”.
That’s 57% of the batch.
How many of them are actually doing something meaningful with it?
How many will still be around in 5 years? What lasting value do these companies bring to the world?
We’re accumulating a massive amount of tech debt while rewarding short-term thinking and profit over sustainability.
Sure, Paul will benefit if one of them hits it big. You can read all about it in his next essay.
But maybe there’s another way.
Maybe it’s okay to have a small business with a loyal user base that pays the bills.
Maybe you don’t need to grow exponentially.
Maybe you can move slow and fix things instead.
"
},

{
"title": "Asking Better Questions",
"url": "https://endler.dev/2024/asking-better-questions/",
"body": "Recently, I realized that I mostly get paid to ask questions.

As a consultant, advising companies
As a podcast host
In calls with potential clients

The curious thing is that, like most people in a similar position, I never had any formal training in asking questions!
I basically just wing it and try to get better over time.
That got me thinking: What makes a good question?
The other day, I reflected on that. Here’s what I came up with.

    
1. Good Questions Are Open-ended
Recently, we did a survey about our podcast, and someone mentioned that some questions I asked the guests were “either-or” type of questions.
Q: “Do you prefer dogs or cats?”
A: “Dogs.”
Ouch, not a very interesting conversation!
A better one might have been “What’s your favorite pet?”.
It allows for surprising answers.
Q: “What’s your favorite pet?”
A: “Tarantula!”
We have to ask ourselves what we want out of the answer,
which leads me to my second observation:

    
2. Never Ask A Question If You Don’t Care About The Answer
Or to rephrase it, “Only ask things you care about.”
If you don’t care about the other person’s answer, why even ask?
Ask something else you care about instead!
For example, when you ask someone if they prefer dogs or cats, what are you really asking?

Do you wonder if the person is nice?
Do you care about the logistics of owning a pet as a full-time employee?
Or do you rather want to hear a certain answer from the other person that happens to fit your narrative?
(The obvious correct answer is “cats”.)

Be honest!
Dig deeper into “why” you’re asking the question.
If it’s one of the former two questions, get straight to the point:
Q: “Are you a nice person?”
A: …
Q: “How do you manage having a pet if you have to work all day?”
A: …
It will lead to better answers.
If it’s about your own agenda and you’re just looking for someone to give you the answer you’re hoping for (confirmation bias):
don’t. Again, allow them to surprise you!
Who knows? You might learn a thing or two about your preconceptions.
Which leads to…

    
3. Good Questions Reveal Something About The Person Who Answers; Bad Questions About The Person Who Asks
It’s very easy to slip into a role where you’re framing people, and that lets your bias speak more about you than
the person you’re talking to.
Be conscious about that so that you can avoid it when it happens.
Q: “Why do you love X?”
A: “I don’t.”
Better:
Q: “Tell me something you truly believe about X”
A: “I believe that…”
The more unique the answer, the more you learn about the person.
So I would even say that a good question is one that reveals something unique about the person who answers.

What is it that only they can say?
What is it that they have unique insight into?

Q: “As an expert in X, who has been in the field for 20 years, what is one thing that people always get wrong about X?”
A: “People always think that X is about Y, but it’s really about Z.”

    
4. Good Questions Are Stacked On Top Of Each Other
Do you know the Five Whys technique?
It’s simple: Ask “why” five times to get to the root of the issue.
Q: Are you happy with your job?
A: No.
Q: Why?
A: It drains my energy.
Q: Why?
A: I have to do boring things.
Q: Why?
A: My boss thinks they must be done by someone.
Q: Why?
A: No one got around to automating them.
Q: Why?
A: We don’t have the skills to automate boring tasks.
Aha! If they learn how to automate things, this might lead to better job happiness!
That’s insightful.
What’s nice is that it was a purely mechanical process.
With every step, we dug deeper into the underlying question.
At some point, the truth revealed itself.
I love this technique. Good questions are built on top of each other.
The questions themselves don’t have to be complicated. It can actually be the same question asked a few times in a row – even a child could do that.
In fact, they do. A lot!
This is how they learn about the world around them.
As grown-ups, we should not unlearn this technique.
“It’s rude to ask that.” “Don’t pry.” “Don’t be nosy.”
We should relearn it! Ask follow-up questions to get to the root of things.
But also!
Hold the other person accountable.
Q: “How can we fix poverty?”
A: “I will do everything in my power to fix it.”
Q: “But how?”
A: “I will try countermeasures which were discussed with…”
Q: “Can you give me a concrete example?”
A: “…”
When someone answers your question, ask yourself if the answer really covered everything you wanted to know.
Often, the most interesting pieces are omitted. Sometimes on purpose. But this is the most revealing part,
the part at the verge of uncertainty and insecurity and you have to uncover it to get to the heart of the matter!
If you don’t do this, conversations stay shallow.
Speaking of which…

    
5. Good Questions Run Deep

“Why is the sky blue?”
“How do people fall in love?”
“Are you happy?”

These are simple questions!
But they touch on the very foundation of what we know, our perception of the world, and ourselves.
The simpler the question, the deeper the answer.
Answering with “I don’t know” is totally fine.
The important part is to stay curious and to be genuinely interested in the answer.

    
6. Let The Winners Run And Cut Your Losses Short
Sometimes, no matter what you try, there’s just nothing in a conversation.
You might have ended up in that weird space where people are simply out of their depth
and you turn circles.
Cut the cord.
Just acknowledge it and move on.
Trained conversationalists do it all the time without anyone noticing.
The conversation gets boring, so they just move on to the next topic.
There doesn’t even have to be a transition.
Q: “So, what do you do for a living?”
A: “I’m a plumber.”
Q: “Oh, interesting. So, what are your plans if you win?”
A: …
Once you notice that, you will see it everywhere.
We do it all the time in our daily lives, too, for example on the phone.
Just cut your losses, move on to the next topic.
Similarly, if you notice that you both are really into a topic, just run with it.

    
7. Give People Space To Think
Pauses are powerful.
Ask your question and then… wait. Just wait.
Don’t fill the silence with your own thoughts or insecurities.
Let the question speak for itself.
If you’re truly interested in a deeper thought, you need to give people time to unpack it - for you and for themselves.
If you’re not willing to wait, you’ll miss out on the best part of the conversation.
We are trained to give quick answers.
If someone asks you “How’s it going?”, the expected answer is “Good, you?”.
People are surprised when I take a moment to answer and give them a truly honest answer, which might be deeply personal.
Similarly, let people give you their quick answer first. Then wait.
Often, they will stop and follow up with a much more personal answer.
Use pauses to your advantage.

    
8. Obvious Questions Can Be The Best Questions
If you look close enough, which question to ask becomes obvious.

“Why are we still working on this?”
“Should we split up?”
“How do you feel after the diagnosis?”

You’re addressing the elephant in the room!
But just because you have an obvious question, that doesn’t make it easy to ask!
You might know that the question could hurt.
Or that the answer is uncomfortable to handle.
And yet, it’s still the right question to ask.
What I noticed is that oftentimes multiple people have the same “obvious” question in mind.
They are just too afraid to ask.
Most people dance around the topic because they want to be polite.
They don’t address problems head-on because it’s easier!
The result is small talk.
Especially if you get paid to ask questions, your job is to ask questions that no one else wants to ask.
Often, politics, infighting, and hidden agendas make it very hard for people to break out of their role and ask the obvious question.
If you’re not afraid to ask the obvious question, you will look like Houdini.
Suddenly, an avalanche of follow-up questions gets unleashed.
An honest, constructive conversation emerges.
The thing is, just because you don’t ask the uncomfortable question, it doesn’t make the problem go away.
In fact, you might make it worse in the long run.
It’s easier to get it out of the way and move on!
Q: Why are we still working on this?
A: Actually, I have no idea. Let’s find something else.
Q: Should we split up?
A: Yes, I think so. Let’s talk about it.
Q: How do you feel after the diagnosis?
A: I’m scared. I don’t know what to do. But I’m glad you asked.
The truth is hidden in plain sight.
If no one dares to ask, these questions stay unanswered.
The trick is to accept the answer for what it is.
Don’t be mad or angry at people who honestly answer your question.
Be mad or angry for not asking sooner.

    
If You Don’t Understand The Answer, Ask Again
In the past, I would often gloss over an answer and pretend I understood it.
Turns out that it was a bad strategy:
In the best case, I would have missed out on a great opportunity to learn something new.
In the worst case, I wouldn’t have a good question to follow up with.
Funnily, I’m rarely alone with this problem.
There’s usually more than one confused person in the room.
So don’t be afraid to refine your question to clarify any misunderstandings.
Q: “Can you explain that differently?”
A: …
Q: “Am I correct in understanding that you mean…?”
A: …
Q: “Can you give me an example?”
A: …
There’s an old Chinese proverb that goes like this:

He who asks a question is a fool for five minutes; he who does not ask a question remains a fool forever.


Good Observations Yield Good Questions
I noticed that many people who ask great questions have exceptional observation skills.
They notice things that others don’t.
Q: “Why are you still working here?”
A: “I need the money.”
Q: “Tough luck. What’s for lunch?”
A: …
vs.
Q: “Why are you still working here?”
A: “I need the money.”
Q: “I noticed that you have this book on your desk. What’s that about?”
A: “Oh, that’s my passion! I’m reading up on sales because I want to start my own business.”
Q: “Nice! What is it about sales that you find so interesting?”
A: …
See how this tiny observation about a book on the desk led to a much deeper conversation?
Suddenly, you’re talking about someone’s passion and dreams.
You might inspire them to have more conversations with customers during work time to get some practice.
If you want to improve your observation skills, come prepared.
Learn more about the person you’re talking to.

What are they passionate about?
Who’s their role model?
What are they doing in their free time?
What are they reading?
Which projects are they working on?

If you come prepared, you’ll have an easier time asking good questions.
Context makes good questions obvious.

    
The Best Follow-up To An Answer Is A Question
Okay, you got an answer.
Now what?
Many people follow up with a statement about themselves.
“Oh, I also like that!”
“Yes, I also did that!”
“Let me tell you about my experience!”
“You should do this!”
This shows that you’re not interested in the other person, only in yourself.
Instead, what if you could only follow up with a question?
Q: “What’s your favorite sport?”
A: “Table tennis.”
Q: “How did you get into that?”
A: “My dad played it when I was young.”
Q: “What’s your favorite memory of playing table tennis with your dad?”
A: “We used to play in the basement. It was so much fun!”
Q: “What made it so much fun?”
A: “I don’t know. It was just the two of us. It was our thing.”
Q: “What did you learn from your dad about table tennis?”
A: “He taught me how to serve. He was really good at it.”
Q: “What’s the most important thing about serving in table tennis?”
A: “You have to hit the ball at the right angle. Otherwise, it’s easy to return.”
Q: “What’s the most difficult angle to return?”
A: “The one that goes straight to the corner. It’s hard to reach.”
See how this conversation unfolded?
It’s like peeling an onion.
You get deeper and deeper into the topic.
You learn more about the person.
You learn more about the topic.
Step by step, you tap into someone else’s hard-earned wisdom.
Look at all the things you learned in such a short time!
Their favorite sport, their childhood memories, their relationship with their dad, how to serve in table tennis, the most difficult angle to return a serve.
If you had followed up with “I like that too,” you would have missed out on all of this and probably never learned about it.
All just because we kept asking questions instead of making statements.
Think about all the people you know. Who do you like to talk to?
I bet it’s the people who ask you questions and listen to your answers instead of talking about themselves all the time.
Interestingly, it’s entirely selfish to ask questions.
You learn more about the other person than they learn about you.
You’re in control of the conversation.

    
Summary
I’m still not good at asking questions.
I wrote this mostly as practice for myself.
Maybe it helps someone else too.
Here’s a summary of what I wrote:

Good Questions Are Open-ended
Never Ask A Question If You Don’t Care About The Answer
Good Questions Reveal Something About The Person Who Answers; Bad Questions About The Person Who Asks
Good Questions Are Stacked On Top Of Each Other
Good Questions Run Deep
Let The Winners Run And Cut Your Losses Short
Give People Space To Think
Obvious Questions Can Be The Best Questions
If You Don’t Understand The Answer, Ask Again
Good Observations Yield Good Questions
The Best Follow-up To An Answer Is A Question

Don’t be mistaken! Asking good questions is hard work!
You have to be present, you have to listen, you have to reflect.
It helps to take mental notes while you’re talking to someone.
After a while, you will get better at spotting the patterns.
Asking good questions is a skill that can be learned and improved upon.
Asking more questions can’t hurt along the way.
So, how do you ask better questions?
"
},

{
"title": "The Dying Web",
"url": "https://endler.dev/2024/the-dying-web/",
"body": "I look left and right, and I’m the only one who still uses Firefox.

    
At conferences and in coworking spaces, it’s always the same scene: people using some flavor of Chrome.
Sometimes it’s Brave, sometimes Chromium, most of the time it’s just Google Chrome.
I find that hilariously appalling.
An entire generation grew up with access to great free tools and open standards, which helped them jumpstart their careers and
gave them access to excellent technology for free.
Now, the world’s largest websites are owned by the same company, which also owns the world’s most popular browser and
search engine. Coincidentally, they are also the world’s largest advertising company.
And people are wondering why they can’t block ads on YouTube anymore.
We gave it all away for nothing.
Let me be the first to admit that I too am not without sin.
There was a weak moment about 15 years ago when browser performance became so unbearable on anything other than Chrome
that it forced my hand to make the switch.
And yes, for a while, life was good and websites loaded quickly.
Reluctantly, I made the switch back to Firefox after a while, because open standards and privacy were
more important than a few milliseconds of loading time.
I could still understand why people would use Chrome, but I was happy with my choice.
Then Firefox Quantum came around, and I
told all my fellow developer friends about it.
To me, it was the best browser on the market, and I was proud to be a Firefox user again.
It was fast, snappy, and had a great add-on ecosystem.
To my surprise, nobody cared.

    
Bad Habits Die Hard
Maybe people stayed with Chrome out of habit.
Performance and privacy aside, I just don’t know how people can cope with Chrome’s limited customizability.
It’s hilarious to watch people with 200 tabs named “G”, “Y”, or “X” struggle to find that one document they opened a week ago.
In comparison, vertical tabs on Firefox with add-ons like Sidebery
make Chrome look like a toy.
Anyhow, Chrome.
There was a time when I tried to educate people on the negative effects of browser monoculture.
Okay, my mum didn’t get it, but I was more disappointed by my fellow devs.
Everyone took the easy route and happily stayed on Uncle Google’s lap.
At this point, I neither have the willpower nor the energy to fight back;
it’s hopeless. It’s probably easier to get blood from a stone
than to convince someone to switch back to Firefox.
It’s so easy to switch, you won’t even lose any open tabs!

    
Nobody Forces You to Use Chrome
True, but the issues don’t stop at my front door.
As an outsider, I need to live with the consequences of browser monoculture every day.
Quite a few websites are unusable by now because they got “optimized for Chrome.”
Microsoft Teams, for example,
and the list is long.
These websites fail for no good reason.
There are positive examples, too.
Zencastr, for example, used to be broken on Firefox, but they fixed it.
Update: Zencastr is still broken on Firefox.
Thanks to Randell from Mozilla for pointing that out. Their support page states that Chrome, Edge, or Brave are required. They stopped supporting Firefox in February 2021 as per this blog post. There’s an open ticket in the Mozilla Bugzilla tracking this issue. It’s currently blocked on some other issues, but there’s progress being made. The WebCodecs API, which might be related to this problem, is currently in beta and progressing through Mozilla’s release process. While it’s disappointing that Zencastr doesn’t work on Firefox yet, it’s encouraging to see that Mozilla is actively working on resolving the underlying issues.
I also use Chrome for online calls, because tools like Jitsi don’t work well on Firefox.
Maybe it’s because of Firefox’s WebRTC support? Or, maybe it’s because of Chrome:

Pop Quiz: If a website wants to play out of different speakers on your system, what permission must it have?

Speaker-selection permission
Microphone permission

If you answered 2, then chances are you know your WebRTC stuff well, but you’re probably on a Chromium browser.


How could Google get free rein?
Because everyone and their car stopped testing their stuff anywhere else.
If everyone tweaks their site for Chrome, well, of course the site will work just fine on Chrome!
We find ways around Chrome’s weird quirks.
More users join the bandwagon because stuff “just works” and the vicious cycle continues.
I can’t blame them.
It’s easier to ride a horse in the direction it is going.

    
But at what cost?
*Elrond voice*: We’ve been down this road before. (Okay, I was there.)
We called it the Browser Wars: Netscape vs Internet Explorer. Netscape lost and Microsoft ruled over the web with an iron fist. It wasn’t fun.
We had more hacks around browser limitations than actual website functionality. Parents put their kids through college by working around browser bugs for money.
Microsoft tried really hard to make life as miserable as possible for everybody:

Internet Explorer has introduced an array of proprietary extensions to many of the standards, including HTML, CSS, and the DOM. This has resulted in several web pages that appear broken in standards-compliant web browsers and has introduced the need for a “quirks mode” to allow for rendering improper elements meant for Internet Explorer in these other browsers. — Wikipedia

Essentially, they broke the web and we all warmed our hands on the dumpster fire.
All we got in return was quirks mode.
Google is smarter! They break the web, too, but they make you stand inside the fire.

    
Why should I care about a browser? They are all the same anyways.
…says the developer who gets tracked by Google
every waking moment.

    
            Source: https:&amp;#x2F;&amp;#x2F;www.skeletonclaw.com&amp;#x2F;image&amp;#x2F;710734055173472257
            
        
You see, Chrome is reeeeally good at marketing.
They say all the right things: We’re fast! We’re open source! We have the latest features!
What they don’t tell you is that they control the narrative of the World Wide Web.
They make you feel guilty for using adblockers and add weird
nonstandard browser features because they can.
Lately, the uBlock Origin team just threw in the towel and stopped supporting Chrome.
But did anyone decide to jump ship?
I get the feeling that by now people turn a blind eye to Google’s evil practices.

    
But shouldn’t Brave, Edge, Opera, or Vivaldi be sufficient?
Unfortunately not. They all use the same browser engine under the hood.
Browser makers make mistakes, so this engine is not perfect.
If it contains a bug and there’s no competition, that bug becomes the standard.
Alternative browser engines need to implement the bug as well to support websites which depend on it.

    
I Use Safari
Congratulations, you switched from a browser controlled by a 2 trillion dollar company to a browser controlled by a 3 trillion dollar company. Oh, and it doesn’t run on Windows or Linux.
Both Apple and Google would throw you under the bus if it made them more profit.
Oh, and did you know that Google paid Apple 20 billion dollars in 2022 to be the default search engine on Safari?

    
What Can I Do?
If you’ve made it this far, do yourself a favor and spend a few minutes trying Firefox.
Who knows? You might just like it.
Try Firefox today, please?
"
},

{
"title": "How To Sell To Developers",
"url": "https://endler.dev/2024/selling-to-developers/",
"body": "One of the hardest challenges I know is how to sell to developers.
This is NOT an article for developers. Today, I want to write for non-developers whose job it is to sell to developers. My goal is to help you understand how they think.

    
Developers Hate Being Sold To
We tend to be a skeptical bunch! We’re not reachable through Google ads because we use ad blockers. We expect software to be free and open source, so we don’t usually pay for it. We subscribe to a handful of services that provide a lot of value for very little cost. Everything else we build ourselves. See the problem?
How do you sell to these people?

    
The Developer’s Dilemma
Strangely enough, when developers think about starting a business, the first products they consider are often those aimed at other developers, despite there being many easier markets to sell to! We love building things so much that we try to sell the things we build to people just like us.
Here’s my advice:

Don’t build a product for developers.
Seriously, don’t build a product for developers.
Since you’ll likely ignore the first two pieces of advice, at least learn how to market to developers effectively.


What You Need to Know About Marketing to Developers
Here’s the bitter truth. To sell to developers…

You will have to be present in the places where they hang out.
You will have to invest a lot of time in community building.
You will have to be extremely patient.
There is no shortcut.

What if you don’t have the time or the patience? See rule 1 above.

    
My Developer Product Journey So Far
You might ask, “Why should I listen to you? Have you ever sold a product to developers?”
I’ll let you be the judge of that. Here are a few products that I have built and sold to developers:

    
codeprints

    
            Homepage of CodePrints with my GitHub profile as a print
            
            
This was a fun experiment during the pandemic. We sold posters of GitHub timelines. Our marketing strategy included posts on Reddit and Hacker News, and we sent free prints to a few “devfluencers.” The novelty effect and viral marketing worked well. Eventually, we sold the company to a developer agency because we didn’t want to deal with the logistics.
Read more about CodePrints in this blog post.

    
Lychee

    
            Homepage of Lychee’s documentation page
            
            
I built this tool for myself because I needed it. Lychee is a command-line tool that checks for broken links in markdown and HTML files. It has become quite popular on GitHub, with companies like Google and Amazon using it. However, I’ve never made any money from it. I reached out to some companies to sponsor the project, but it was very hard to get a response or explain why they should sponsor a free tool.
I wrote about making money with Open Source here.

    
Analysis Tools

    
            Homepage of analysis-tools.dev
            
            
This is a directory of tools for developers. It’s a side project that I started in 2015 (did I mention you need to be patient?), and it has grown to be a popular resource. Together with two friends, we sell sponsorships and ads on the site, which provides a nice side income.
I think this project works because it’s not a product for developers but a product for people who build products for developers.
Our customers are mostly developer advocates, developer relations folks, and marketing people who want to reach developers.
Sponsoring on Analysis Tools is extremely cheap compared to other marketing channels like Google ads. On Google, you can easily spend thousands of dollars per month trying to reach developers.
On Analysis Tools, you can reach them for a fraction of that cost. The basic tier is $100 per month, and for that, you get your logo on the site and in every repo. Thousands of developers visit the site every month. It’s basically a no-brainer to sponsor the site if you build a linter, a static code analyzer, or any other tool that developers use.

    
            Tools page of analysis-tools.dev
            
            
Think of it this way: when was the last time these companies had the chance to reach thousands of developers for $100? What is the total cost of acquisition for a developer? How much would you pay for a developer to try out your tool?
And still…

    
Only 20% Of Companies Immediately Get It
From the companies I’ve talked to, only around 20% get it immediately. There are perhaps 30% that need a little handholding. The other 50% see it as an ad placement and want to know the click-through rate.
Let me repeat: It’s not about the click-through rate. It’s about the branding and the reach.
You will never reach a critical audience if you don’t start investing in the developer community. This is not a one-time thing. You need a lot of exposure until you get noticed and developers start talking about your product. If you try to fake it, you won’t get far. Devs will avoid your product like the plague and tell their friends to do the same.
Your main constraint is the attention of developers!
These folks constantly get bombarded with millions of products and don’t have time to evaluate them all. They stopped actively looking for new tools a long time ago. They rely on a handful of trusted sources they follow.

    
Building Trust Is The Only Way
Getting into their circle of trust is hard, and for that reason, you need to be patient and invest a lot of time into relationship building. If you can afford it, hire a developer advocate. A good one is heavily invested in open source and knows how to write for developers.
At the very least, you need to be present in the places where developers hang out. So reach out to open source maintainers in your niche and see if you can collaborate or sponsor their projects. Don’t waste cash on Google ads or other traditional marketing channels. Go where the developers are.
I hope this article helped someone who is struggling to sell to developers. Perhaps one of you will reach out to some open source maintainers in their niche and sponsor their projects. That would be a great outcome!
There’s a plus side to all of this: once you understand that you need to invest in the developer community, you will have a competitive advantage because most other companies don’t get it.
"
},

{
"title": "Cursed Rust: Printing Things The Wrong Way",
"url": "https://endler.dev/2023/cursed-rust/",
"body": "

 
  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );

There is a famous story about a physicist during an exam at the University of
Copenhagen. The candidate was asked to describe how to determine a skyscraper’s
height using a barometer. The student suggested dangling the barometer from the
building’s roof using a string and then measuring the length of the string plus
the barometer’s height. Although technically correct, the examiners were not
amused.
After a complaint and a reevaluation, the student offered various physics-based
solutions, ranging from dropping the barometer and calculating the building’s
height using the time of fall, to using the proportion between the lengths of
the building’s shadow and that of the barometer to calculate the building’s
height from the height of the barometer.
He even humorously suggested simply asking the caretaker in exchange for the
barometer.
The physicist, as the legend goes, was Niels Bohr, who
went on to receive a Nobel Prize in 1922. This story is also known as the
barometer question.

    
Why Is This Story Interesting?
The question and its possible answers have an important didactic side effect:
they convey to the learner that one can also get to the solution with unconventional
methods — and that these methods are often more interesting than the
canonical solution because they reveal something about the problem itself.
There is virtue in learning from unconventional answers to conventional questions.
To some extent, this fosters new ways of thinking and problem-solving, which is
an essential part of innovation.

    
Applying The Same Principle To Learning Rust
One of the first examples in any book on learning Rust is the “Hello, world!”
program.
fn main()  
    println!(&amp;amp;quot;Hello, world!&amp;amp;quot;);
 
It’s an easy way to test that your Rust installation is working correctly.
However, we can also have some fun and turn the task on its head:
let’s find ways to print “Hello, world!” without using println!.
Let’s try to come up with as many unconventional solutions as possible.
The weirder, the better! As you go through each of the solutions
below, try to understand why they work and what you can learn from them.
This started as a meme,
but I decided to turn it into a full article after the post got a lot of
attention.
It goes without saying that you should never use any of these solutions in
production code. Check out this enterprise-ready version of
hello world instead.

    
Solution 1: Desugaring println!
use std::io::Write;

write!(std::io::stdout().lock(), &amp;amp;quot;Hello, world!&amp;amp;quot;);
This solution is interesting, because it shows that println! is just a macro that
expands to a call to write! with a newline character appended to the string.
The real code is much weirder. Search for print in this file if you want to be amazed.
write! itself desugars to a call to write_fmt, which is a method of the Write trait.
There is a real-world use case for this: if you want to print things really fast, you can
lock stdout once and then use write!. This avoids the overhead
of locking stdout for each call to println!. See this article on how to write a very
fast version of yes with this trick.

    
Solution 2: Iterating Over Characters
&amp;amp;quot;Hello, world!&amp;amp;quot;.chars().for_each(|c| print!(&amp;amp;quot;  &amp;amp;quot;, c));
This shows that you can implement println! using Rust’s powerful iterators.
Here we iterate over the characters of the string and print each one of them.
chars() returns an iterator over Unicode scalar values.
Learn more about iterators here.

    
Solution 3: Impl Display
struct HelloWorld;

impl std::fmt::Display for HelloWorld  
    fn fmt(&amp;amp;amp;self, f: &amp;amp;amp;mut std::fmt::Formatter&amp;amp;lt;&amp;amp;#39;_&amp;amp;gt;) -&amp;amp;gt; std::fmt::Result  
        write!(f, &amp;amp;quot;Hello, world!&amp;amp;quot;)
     
 
println!(&amp;amp;quot; HelloWorld &amp;amp;quot;);
This teaches us a little bit about how traits work in Rust: We define a struct
that implements the Display trait, which allows us to print it using print!.
In general, Display is intended to make more complex types printable, but it
is also possible to implement it for a hardcoded string!

    
Solution 4: Who Needs Display?
How about we create our own trait instead of using Display?
trait Println  
    fn println(&amp;amp;amp;self);
 

impl Println for &amp;amp;amp;str  
    fn println(&amp;amp;amp;self)  
        print!(&amp;amp;quot;  &amp;amp;quot;, self);
     
 
&amp;amp;quot;Hello, world!&amp;amp;quot;.println();
We can exploit the fact that we can name our trait methods however we want. In
this example, we choose println, making it look like it is part of the
standard library.
This completely turns the println! macro on its head. Instead of
passing a string as an argument, we call a method on the string itself!

    
Solution 5: Who Needs println! When You Got panic!?
panic!(&amp;amp;quot;Hello, world!&amp;amp;quot;);
There are other ways to print things in Rust than using println!. In this
case, we use panic!, which prints the string (as a side-effect) and
immediately terminates the program. It works as long as we only want to print
a single string…

    
Solution 6: I ♥︎️ Closures
(|s: &amp;amp;amp;str| print!(&amp;amp;quot;  &amp;amp;quot;, s))(&amp;amp;quot;hello&amp;amp;quot;);
Rust allows you to call a closure directly after its
definition. The closure is defined as an anonymous function that takes a
string slice as an argument and prints it. The string slice is passed as an
argument to the closure.
In practice, this can be useful for defining a closure that is only used once
and for which you don’t want to come up with a name.

    
Solution 7: C Style
extern crate libc;
use libc:: c_char, c_int ;
use core::ffi::CStr;

extern &amp;amp;quot;C&amp;amp;quot;  
    fn printf(fmt: *const c_char, ...) -&amp;amp;gt; c_int;
 

fn main()  
    const HI: &amp;amp;amp;CStr = match CStr::from_bytes_until_nul(b&amp;amp;quot;hellon0&amp;amp;quot;)  
        Ok(x) =&amp;amp;gt; x,
        Err(_) =&amp;amp;gt; panic!(),
     ;

    unsafe  
        printf(HI.as_ptr());
     
 
You don’t even need to use Rust’s standard library to print things! This
example shows how to call the C standard library’s printf function from Rust.
It’s unsafe because we are using a raw pointer to pass the string to the
function. This teaches us a little bit about how FFI works in Rust.
Credit goes to &amp;#x2F;u&amp;#x2F;pinespear on Reddit and @brk@infosec.exchange.

    
Solution 8: C++ Style
We’re well into psychopath territory now… so let’s not stop here.
If you try extremely hard, you can bend Rust to your will and make it look
like C++.
use std::fmt::Display;
use std::ops::Shl;

#[allow(non_camel_case_types)]
struct cout;
#[allow(non_camel_case_types)]
struct endl;

impl&amp;amp;lt;T: Display&amp;amp;gt; Shl&amp;amp;lt;T&amp;amp;gt; for cout  
    type Output = cout;
    fn shl(self, data: T) -&amp;amp;gt; Self::Output  
        print!(&amp;amp;quot;  &amp;amp;quot;, data);
        cout
     
 
impl Shl&amp;amp;lt;endl&amp;amp;gt; for cout  
    type Output = ();
    fn shl(self, _: endl) -&amp;amp;gt; Self::Output  
        println!(&amp;amp;quot;&amp;amp;quot;);
     
 
cout &amp;amp;lt;&amp;amp;lt; &amp;amp;quot;Hello World&amp;amp;quot; &amp;amp;lt;&amp;amp;lt; endl;
The Shl trait is used to implement the &amp;amp;lt;&amp;amp;lt; operator. The cout
struct implements Shl for any type that implements Display, which allows
us to print any printable type. The endl struct implements Shl for cout,
which prints the newline character in the end.
Credit goes to Wisha Wanichwecharungruang for this
solution.

    
Solution 9: Unadulterated Control With Assembly
All of these high-level abstractions stand in the way of printing things
efficiently. We have to take back control of your CPU. Assembly is the way. No more wasted cycles.
No hidden instructions. Pure, unadulterated performance.
use std::arch::asm;

const SYS_WRITE: usize = 1;
const STDOUT: usize = 1;

fn main()  
    #[cfg(not(target_arch = &amp;amp;quot;x86_64&amp;amp;quot;))]
    panic!(&amp;amp;quot;This only works on x86_64 machines!&amp;amp;quot;);

    let phrase = &amp;amp;quot;Hello, world!&amp;amp;quot;;
    let bytes_written: usize;
    unsafe  
        asm!  
            &amp;amp;quot;syscall&amp;amp;quot;,
            inout(&amp;amp;quot;rax&amp;amp;quot;) SYS_WRITE =&amp;amp;gt; bytes_written,
            inout(&amp;amp;quot;rdi&amp;amp;quot;) STDOUT =&amp;amp;gt; _,
            in(&amp;amp;quot;rsi&amp;amp;quot;) phrase.as_ptr(),
            in(&amp;amp;quot;rdx&amp;amp;quot;) phrase.len(),
            &amp;#x2F;&amp;#x2F; syscall clobbers these
            out(&amp;amp;quot;rcx&amp;amp;quot;) _,
            out(&amp;amp;quot;r11&amp;amp;quot;) _,
         
     
    assert_eq!(bytes_written, phrase.len());
 
(Rust Playground)
If you’re wondering why we use Rust in the first place if all
we do is call assembly code, you’re missing the point!
This is about way more than just printing things.
It is about freedom! Don’t tell me how I should use my CPU.
Okaaay, it only works on x86_64 machines, but that’s a small sacrifice to make
for freedom.
Submitted by isaacthefallenapple.

    
Solution 10: “Blazing Fast”
Why did we pay a premium for all those CPU cores if we aren’t actually using
them? Wasn’t fearless concurrency one of Rust’s promises? Let’s put those cores
to good use!
use std::sync:: Arc, Mutex ;
use std::thread;
use std::time::Duration;

fn main()  
    let phrase = &amp;amp;quot;hello world&amp;amp;quot;;
    let phrase = Arc::new(Mutex::new(phrase.chars().collect::&amp;amp;lt;Vec&amp;amp;lt;_&amp;amp;gt;&amp;amp;gt;()));

    let mut handles = vec![];

    for i in 0..phrase.lock().unwrap().len()  
        let phrase = Arc::clone(&amp;amp;amp;phrase);
        let handle = thread::spawn(move ||  
            thread::sleep(Duration::from_millis(((i + 1) * 100) as u64));
            print!(&amp;amp;quot;  &amp;amp;quot;, phrase.lock().unwrap()[i]);
         );
        handles.push(handle);
     

    for handle in handles  
        handle.join().unwrap();
     
    println!();
 
Here, each character is printed in a separate thread. The threads are spawned
in a loop, and each thread sleeps for a certain amount of milliseconds before
printing its character. This uses the full power of your CPU to print a string!
It might not always consistently print the characters in the right order (hey, scheduling
is hard!), but
that’s a worthwhile trade-off for all the raw performance gains.

    
Your Turn!
If you’ve got more solutions, please send me a message.
Also, if you liked this article, you might also enjoy the yearly obfuscated C
code contest. Check out the previous winners here.
If you were actually more intrigued by the barometer story, read Surely You’re
Joking, Mr.
Feynman!,
a book by Richard Feynman, another famous physicist and Nobel Prize winner, who
was known for his unconventional way of thinking.
We should all strive to think outside the box and come up with unconventional
solutions to problems. Who knows, maybe that’s the key to a deeper
understanding of the problem itself?
"
},

{
"title": "Deploy Rust Code Faster",
"url": "https://endler.dev/2023/move-fast-rust/",
"body": "I’ve come a long way in my tech journey, from dealing with bare metal servers to
exploring the world of cloud computing. Initially, it seemed so straightforward
– spin up a server, deploy a container, and you’re done. But as I delved deeper,
I realized that the ease of infrastructure is not as simple as it appears.
Cloud providers offer a multitude of tools, each with its own learning curve:

Google Cloud &amp;#x2F; AWS
Kubernetes
Helm
Docker
Terraform
GitHub Actions

If you’re adventurous, you might even venture into managed Kubernetes services
like EKS or GKE. It’s tempting, with just a few clicks, your application is
ready to roll. But the reality hits when you start juggling monitoring, logging,
security, scaling, and more.
Soon, you find yourself unintentionally leading a DevOps team instead of
focusing on your product. You hire more staff to manage infrastructure while
your competitors are shipping features and growing their user base.

    
My Frustration
The cloud promised to make infrastructure easy, but the array of tools and
services can be overwhelming. Even if you don’t use them all, you must be aware
of their existence and learn the basics. The result? Your focus on the product
diminishes.
I appreciate dealing with infrastructure, but I also love delivering a product.
Sadly, many companies waste precious time and money on infrastructure, repeating
the same mistakes.
What if there was a way to eliminate infrastructure concerns altogether?

    
The Allure of Serverless
Serverless architecture seems promising - no servers, no containers, just pure
business logic. However, it’s not without challenges:

Cold start times
Lambda size limitations
Memory issues
Long-running processes
Debugging complexities
Lack of local testing

Serverless has its merits for certain use cases, but for larger applications,
you might still need some servers.

    
Platform-As-A-Service (PaaS)
Platforms like Heroku and Netlify introduced a third option – managed services
that handle all infrastructure for you. No more infrastructure concerns; you
simply push code, and it deploys. What’s great about these solutions is their
deep integration with specific programming language ecosystems.
I was looking for a platform tailored for Rust developers, aiming to provide a
top-notch developer experience. I wanted deep integration with the Rust
ecosystem (serde, sqlx, axum,…).
A while ago, I came across Shuttle while trying to
find ways to make my Rust
development workflow a bit smoother. It’s a tool that kind of just fits into the
existing Rust ecosystem, letting you use cargo as you normally would, but with
some of the infrastructural heavy lifting taken out of the picture. Now, it’s
not a magic wand that solves all problems, but what I appreciate about Shuttle
is its simplicity. You’re not thrown into a completely new environment with a
steep learning curve. Instead, you stick to your Rust code, and Shuttle is there
in the background, helping manage some of the server-side complexities. So, in
essence, it’s about sticking to what you know, while maybe making life a tad
easier when it comes to deployment and server management. It’s not about a
revolutionary change in how you code, but more about a subtle shift in managing
the background processes that can sometimes be a bit of a headache.

    
My Shuttle Experience So Far
Until now, I built two smaller Rust services with Shuttle: Zerocal and Readable.
Shuttle takes your Rust code and with very few annotations, it can be
deployed to the cloud. The developer experience is pretty close to ideal
given that provisioning and deployment are usually the most painful parts of
building a service.
Instead, it’s just a matter of adding a few lines of code. See for yourself.
The boilerplate just vanishes. What’s left is the business logic.
 
  
Your browser does not support the video tag. :( But the video is great.
 

Zerocal - Stateless Calendar Magic
Zerocal was the first project I deployed on Shuttle.
The principle was very simple yet innovative: encode calendar data directly into a URL. This means creating an event was as straightforward as:
curl https:&amp;#x2F;&amp;#x2F;zerocal.shuttleapp.rs?start=2023-11-04+20:00&amp;amp;amp;duration=3h&amp;amp;amp;title=Birthday&amp;amp;amp;description=paaarty
This would return an iCal file, that you can add to your calendar.
Here’s how you create an event in the browser:

    
I tried building this project on Shuttle when they were still fixing some things and changing their APIs here and there. Even with these small issues, it was a good experience. In just a few minutes, my app was up and running.
Here’s the code to start the service including the axum routes:
#[shuttle_runtime::main]
async fn axum() -&amp;amp;gt; shuttle_axum::ShuttleAxum  
    &amp;#x2F;&amp;#x2F; just normal axum routes
	let router = Router::new()
    		.route(&amp;amp;quot;&amp;#x2F;&amp;amp;quot;, get(calendar))
    		.route(&amp;amp;quot;&amp;#x2F;&amp;amp;quot;, post(calendar));

	Ok(router.into())
 
I don’t really need Zerocal for myself anymore, so I’m hoping someone else might
want to take it over. I think it could be really useful for sharing invites on
places like GitHub or Discord. If you want to know more about Zerocal, you can
read this detailed breakdown.
I would also like to mention that someone else built a similar project inspired
by Zerocal: kiwi by  Mahesh Sundaram, written in Deno. This is a really cool outcome.

    
A Reader Mode For My E-Reader
My appreciation for Firefox’s reader view sparked the creation of a Reader Mode
Proxy for a minimalist, JavaScript-free web reading experience, particularly
tailored for e-readers. The intention was to transform verbose websites into a
more digestible format for distraction-free reading.
This project deeply reflected my personal preferences, as I like simple
apps that solve a problem. With just a sprinkle of annotations, my code adapted
smoothly to Shuttle’s environment. Initially, I had my own local mode, which
allowed me to run the app on my machine for testing, but I found no need to
maintain that because Shuttle’s own local mode works just as well.
While developing the app, there were some bumps along the road. Service
downtimes required some code revamping. Yet, Shuttle’s evolution simplified
parts of my process, especially when it introduced native static file handling.
Before it looked like this:
#[shuttle_runtime::main]
async fn axum() -&amp;amp;gt; shuttle_axum::ShuttleAxum  
	let router = Router::new()
        &amp;#x2F;&amp;#x2F; Previously, I needed to manually serve static files
    	.route(
        	&amp;amp;quot;&amp;#x2F;static&amp;#x2F;Crimson.woff2&amp;amp;quot;,
        	get(|| async  
            	static_content(
                	include_bytes!(&amp;amp;quot;..&amp;#x2F;static&amp;#x2F;fonts&amp;#x2F;Crimson.woff2&amp;amp;quot;,),
                	HeaderValue::from_static(&amp;amp;quot;text&amp;#x2F;woff2&amp;amp;quot;),
            	)
        	 ),
    	)
    	.route(
        	&amp;amp;quot;&amp;#x2F;static&amp;#x2F;JetBrainsMono.woff2&amp;amp;quot;,
        	get(|| async  
            	static_content(
                	include_bytes!(&amp;amp;quot;..&amp;#x2F;static&amp;#x2F;fonts&amp;#x2F;JetBrainsMono.woff2&amp;amp;quot;,),
                	HeaderValue::from_static(&amp;amp;quot;font&amp;#x2F;woff2&amp;amp;quot;),
            	)
        	 ),
    	)
    	.fallback(readable);

	Ok(router.into())
 
Now it’s just
#[shuttle_runtime::main]
async fn axum() -&amp;amp;gt; shuttle_axum::ShuttleAxum  
   let router = Router::new()
    	.nest_service(&amp;amp;quot;&amp;#x2F;static&amp;amp;quot;, ServeDir::new(PathBuf::from(&amp;amp;quot;static&amp;amp;quot;)))
    	.fallback(readable);
	Ok(router.into())
 
To understand the intricacies of this project, here’s a more comprehensive
look.

    
Control and Safety
Initially, I was concerned that annotating my code for infrastructure would
cause vendor lock-in. I wanted to retain full control over my project. Want to
move away? The Shuttle macros get rid of the boilerplate, so I could just remove
the 2 annotations I’ve added and get the original code back. Shuttle’s code is
also open source, so I could even set up your self-hosted instance —
although I wouldn’t want to.

    
The True Cost of DIY Infrastructure
Infrastructure may seem easy on the surface, but maintaining it involves various
complexities and costs. Updates, deployments, availability – it can be
overwhelming. Each hour spent on these tasks carries both a direct and
opportunity cost.
Infrastructure can be a maze, and Shuttle seems to fit well for those working
with Rust. I’m thinking of trying out a larger project on Shuttle soon, now that
I’ve got a decent understanding of what Shuttle can and can’t do. If you’re
considering giving it a shot, it’s wise to check their
pricing to ensure it aligns with your needs.
Be mindful of the real cost of infrastructure!
As I’ve mentioned before, it’s
not just server costs, but a lot more. The biggest factor will probably be human
labor for maintenance and debugging infrastructure and that is expensive. If I
were to use infrastructure as code, I’d be spending many hours setting up my
infrastructure and a lot more to maintain it and that can be expensive, given
today’s salaries.
Even if it was just for a hobby project, it would not be worth
the trouble for me. I’d much rather work on features than the code that runs it
all.
"
},

{
"title": "Little Helpers",
"url": "https://endler.dev/2023/helpers/",
"body": "Yesterday I couldn’t help but feel a sense of awe at all the conveniences modern
life has to offer.
A lot of the chores in our household are taken care of by little helpers: The
dishwasher washes the dishes, the washing machine washes the clothes, and the
robot vacuum cleaner cleans the floors. The refrigerator keeps our food cold,
the microwave heats it up, and the oven cooks it.
We take all of this for granted because the devices rarely fail, but it’s really
amazing when you think about it. It’s only been a few decades since much of this
was tedious, time-consuming, manual labor.
I heard stories about how people used to watch the washing machine do its thing,
just because it was entertaining to see the machine do their work for them.


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );

Growing up in the 90s and early 2000s, I remember when “smart home” was a
buzzword, and now it’s a reality. Smart devices control the thermostat and soon
the lights and the door locks in our apartment.
Of course there were a bunch of stupid ideas that didn’t work out along the way.
I remember when they tried to sell those “smart” fridges that would run a web
browser and let you order groceries from the fridge. Who would want to do that?
It’s so much easier to just order groceries online from your phone or computer.
On the other hand, of all the people I talked to, I’ve never met anyone who
regrets buying a vacuum robot.
We recently got a cat and quickly automated all the tedious stuff. The litter
box cleans itself, there’s a water fountain that keeps the water fresh, and soon
we’ll get a food dispenser. That means we have more time to focus on the fun
stuff, like playing with the cat.
And yes, I fully realize that this convenience comes from an incredible position of
privilege. A privileged position that we should never take for granted! Instead,
we should be grateful for the little helpers that make our lives easier and make
them more accessible to everyone.
"
},

{
"title": "A Reader Mode Proxy for the Slow Web",
"url": "https://endler.dev/2022/readable/",
"body": "
    
        
            Reader showing an article in light and dark mode.
            
            
     tl;dr: I built a service that takes any article and creates a pleasant-to-read, printable version.
It is similar to Reader View in Firefox&amp;#x2F;Safari, but also works on older
browsers, can be shared and has a focus on beautiful typography.
Check out the source code.


The web used to be such a fun place.
Nowadays? Meh.
Trackers, ads, bloat, fullscreen popups, autoplaying videos… it’s all so exhausting.
I just want to read long-form posts without distractions with a good cup of tea,
the cat sleeping on the windowsill and some light snow falling in front of the window.

    
The Slow Web
I’m a big fan of the Slow Web movement
and of little sites that do one thing well.
For reading long-form text clutter-free I use Reader
View
in Firefox, and while it doesn’t always work and it’s not the prettiest I like it.
There are reader modes in other browsers as well, but some of them — like Chrome — hide
it behind a feature flag.
Other browsers, like the one on my eBook reader, don’t come with a reader mode at all,
which leaves me with a subpar and slow browsing experience on my main device used for reading.
So I built a reader mode as a service with a focus on beautiful typography
which works across all browsers.
It’s very basic, but I use it to read articles on my older devices and
it could also make content more accessible in regions with low bandwidth or
while travelling.

    
Building It
Lately I saw a post about circumflex, a Hacker News terminal
client. The tool did a solid job
at rendering website content and I wondered if I can retrofit that into a proxy
server.
The Golang cleanup code is here:
func GetArticle(url string, title string, width int, indentationSymbol string) (string, error)  
    articleInRawHTML, httpErr := readability.FromURL(url, 5*time.Second)
    if httpErr != nil  
        return &amp;amp;quot;&amp;amp;quot;, fmt.Errorf(&amp;amp;quot;could not fetch url: %w&amp;amp;quot;, httpErr)
     
    &amp;#x2F;&amp;#x2F; ...
 
They use go-readability, a port
of Mozilla’s Readability. The Rust
equivalent is readability and it’s
simple enough to use:
use readability::extractor;

fn main() -&amp;amp;gt; Result&amp;amp;lt;(), Box&amp;amp;lt;dyn std::error::Error&amp;amp;gt;&amp;amp;gt;  
    let response = extractor::scrape(&amp;amp;quot;https:&amp;#x2F;&amp;#x2F;endler.dev&amp;#x2F;2022&amp;#x2F;readable&amp;amp;quot;)?;
    println!(&amp;amp;quot;  &amp;amp;quot;, response.content);
    Ok(())
 
Before we write a full proxy server, let’s write a simple CLI tool that takes a
URL and outputs a clean, readable HTML file.
use readability::extractor;
use std::fs::File;
use std::io::Write;

fn main() -&amp;amp;gt; Result&amp;amp;lt;(), Box&amp;amp;lt;dyn std::error::Error&amp;amp;gt;&amp;amp;gt;  
	&amp;#x2F;&amp;#x2F; read the URL from the command line
	let url = std::env::args().nth(1).expect(&amp;amp;quot;Please provide a URL&amp;amp;quot;);

	let response = extractor::scrape(&amp;amp;amp;url)?;
	let mut file = File::create(&amp;amp;quot;index.html&amp;amp;quot;)?;
	file.write_all(response.content.as_bytes())?;
	Ok(())
 
The output already looked surprisingly good.
Next I added a simple HTML template to wrap the response content.
&amp;amp;lt;!DOCTYPE html&amp;amp;gt;
&amp;amp;lt;html lang=&amp;amp;quot;en&amp;amp;quot;&amp;amp;gt;
  &amp;amp;lt;head&amp;amp;gt;
    &amp;amp;lt;meta charset=&amp;amp;quot;UTF-8&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
    &amp;amp;lt;title&amp;amp;gt;Document&amp;amp;lt;&amp;#x2F;title&amp;amp;gt;
    &amp;amp;lt;link rel=&amp;amp;quot;stylesheet&amp;amp;quot; href=&amp;amp;quot;yue.css&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
    &amp;amp;lt;style type=&amp;amp;quot;text&amp;#x2F;css&amp;amp;quot;&amp;amp;gt;
      body  
        margin: 0;
        padding: 0.4em 1em 6em;
        background: #fff;
       
      .yue  
        max-width: 650px;
        margin: 0 auto;
       
    &amp;amp;lt;&amp;#x2F;style&amp;amp;gt;
  &amp;amp;lt;&amp;#x2F;head&amp;amp;gt;
  &amp;amp;lt;body&amp;amp;gt;
    &amp;amp;lt;div class=&amp;amp;quot;yue&amp;amp;quot;&amp;amp;gt;  content  &amp;amp;lt;&amp;#x2F;div&amp;amp;gt;
  &amp;amp;lt;&amp;#x2F;body&amp;amp;gt;
&amp;amp;lt;&amp;#x2F;html&amp;amp;gt;
No need to use a full-blown template engine for now; we can just use
str::replace to replace the   content   placeholder with the actual content.
😉

    
Proxy Setup
The proxy setup is super simple with shuttle. It’s my
second project after zerocal, which is hosted on shuttle and
I’m very happy with how smooth the process is. 🚀
Let’s call the app readable:
cargo shuttle init --axum --name readable
This creates a small Axum app with a simple
hello world route.

    
Roadblock No. 1: reqwest
When I integrated the readability crate
into the project I hit a minor roadblock.
I used extractor::scrape just like above and the proxy started locally.
However when I wanted to fetch a website from the proxy, I got an error:
thread &amp;amp;#39;tokio-runtime-worker&amp;amp;#39; panicked at
&amp;amp;#39;Cannot drop a runtime in a context where blocking is not allowed.
This happens when a runtime is dropped from
within an asynchronous context.&amp;amp;#39;
This meant that I started a runtime inside a runtime.
After checking the source code of the readability crate, I found that it
builds a reqwest::blocking::Client and uses that to fetch the URL.
After that request, the client is dropped which causes the runtime to be shut
down.
I fixed this by using a reqwest::Client instead of the reqwest::blocking::Client.
&amp;#x2F;&amp;#x2F; reqwest::blocking::Client
let client = reqwest::blocking::Client::new();

&amp;#x2F;&amp;#x2F; reqwest::Client
let client = reqwest::Client::new();
Now I had the content of the article, but I still needed to pass it to
readability. Fortunately they provide a function named extractor::extract
that takes something that implements Read and returns the extracted content.
However, the reqwest::Response doesn’t implement Read (in contrast to the
reqwest::blocking::Response). So I needed to convert it to a Readable type
myself.
Luckily, the reqwest::Response has a bytes method that returns a Bytes
object. The Bytes object implements Read and I can use it to call
extractor::extract.
let body = client.get(&amp;amp;amp;url).await?.text().await?;
let bytes = body.bytes().await?;
let response = extractor::extract(&amp;amp;amp;mut res, &amp;amp;amp;url)?;
    
        
Roadblock No. 2: Routing
The app didn’t crash anymore, but I still didn’t get any response.
My router looked like this:
#[shuttle_service::main]
async fn axum() -&amp;amp;gt; shuttle_service::ShuttleAxum  
    let router = Router::new().route(&amp;amp;quot;&amp;#x2F;:url&amp;amp;quot;, get(readable));
    let sync_wrapper = SyncWrapper::new(router);

    Ok(sync_wrapper)
 
Turns out that when I use &amp;#x2F;:url as the route, it doesn’t match the path
&amp;#x2F;https:&amp;#x2F;&amp;#x2F;example.com because : matches only a single segment up to the
first slash.
The solution was to use &amp;#x2F;*url instead, which is a wildcard route that matches
all segments until the end.

    
Typography and Layout

    
            New York Times website (left) vs reader mode (right)
            
            
For my first prototype I used a CSS framework called
yue.css because it was the first thing I
found which looked nice.
For the final version I ended up mimicking the style of Ruud van Asseldonk’s
blog because it always reminded me of reading a
well-typeset book.
For fonts I chose two of my favorites

Crimson Pro for the body text.
JetBrains Mono for the code.

Both are licensed under the SIL Open Font License 1.1.
You can even use readable from the terminal.
lynx https:&amp;#x2F;&amp;#x2F;readable.shuttleapp.rs&amp;#x2F;https:&amp;#x2F;&amp;#x2F;en.wikipedia.org&amp;#x2F;wiki&amp;#x2F;Alan_Turing
    
        
Caveats
The proxy is far from perfect.
It’s something I built in a few hours for my personal use.

It doesn’t always produce valid HTML.
JavaScript is not executed, so some websites don’t work properly. Some might
say that’s feature, not a bug. 😉
That is also true for websites with sophisticated paywalls or bot-detection. A
workaround would be to use a headless browser like
ScrapingBee or
Browserless, but I didn’t want to add that
complexity to the project.
The readability library takes a lot of freedom in formatting the document
however it pleases. It can sometimes produce weird results. For example, it
loves to mangle code blocks.


Credits
I was not the first person to build a readability proxy. I found out about
readable-proxy when I did my research,
but the project seems to be abandoned. Nevertheless it was nice to see that
others had the same need.
Thanks to Ruud van Asseldonk for open sourcing
his blog. 🙏 His writing and documentation are
always a great source of inspiration to me.

    
Conclusion

    
            The browser on my old Kobo eBook reader using the readability proxy.
            
            
In times where the most popular browser might kill off ad
blockers, a little service
for reading articles without ads or tracking can come in handy. I’m not saying you
should use it to send all your traffic through it, but it’s a nice tool to have
in your toolbox for a rainy day, a warm drink and a great article. ☕
Feel free to deploy your own instance of readable or use the one I’m hosting.
The source code is available on GitHub.
Maybe one of you wants to help me maintain it.
"
},

{
"title": "zerocal - A Serverless Calendar App in Rust Running on shuttle.rs",
"url": "https://endler.dev/2022/zerocal/",
"body": "
    

Every once in a while my buddies and I meet for dinner.
I value these evenings, but the worst part is scheduling these events!
We send out a message to the group.
We wait for a response.
We decide on a date.
Someone sends out a calendar invite.
Things finally happen.
None of that is fun except for the dinner.
Being the reasonable person you are, you would think: “Why don’t you just use a
scheduling app?”.
I have tried many of them. None of them are any good.
They are all… too much!
Just let me send out an invite and whoever wants can show up.

I don’t want to have to create an account for your calendar&amp;#x2F;scheduling&amp;#x2F;whatever app.
I don’t want to have to add my friends.
I don’t want to have to add my friends’ friends.
I don’t want to have to add my friends’ friends’ friends.
You get the idea:
I just want to send out an invite and get no response from you.


The nerdy, introvert engineer’s solution
💡 What we definitely need is yet another calendar app which allows us to create events and send out an invite with a link to that event!
You probably didn’t see that coming now, did you?
Oh, and I don’t want to use Google Calendar to create the event because I don’t trust them.
Like any reasonable person, I wanted a way to create calendar entries from my terminal.
That’s how I pitched the idea to my buddies last time.
The answer was: “I don’t know, sounds like a solution in search of a problem.”
But you know what they say: Never ask a starfish for directions.

    
Show, don’t tell
That night I went home and built a website that would create a calendar entry
from GET parameters.
It allows you to create a calendar event from the convenience of your command line:
&amp;amp;gt; curl https:&amp;#x2F;&amp;#x2F;zerocal.shuttleapp.rs?start=2022-11-04+20:00&amp;amp;amp;duration=3h&amp;amp;amp;title=Birthday&amp;amp;amp;description=paaarty
BEGIN:VCALENDAR
VERSION:2.0
PRODID:ICALENDAR-RS
CALSCALE:GREGORIAN
BEGIN:VEVENT
DTSTAMP:20221002T123149Z
CLASS:CONFIDENTIAL
DESCRIPTION:paaarty
DTEND:20221002T133149Z
DTSTART:20221002T123149Z
SUMMARY:Birthday
UID:c99dd4bb-5c35-4d61-9c46-7a471de0e7f4
END:VEVENT
END:VCALENDAR
You can then save that to a file and open it with your calendar app.
&amp;amp;gt; curl https:&amp;#x2F;&amp;#x2F;zerocal.shuttleapp.rs?start=2022-11-04+20:00&amp;amp;amp;duration=3h&amp;amp;amp;title=Birthday&amp;amp;amp;description=paaarty &amp;amp;gt; birthday.ics
&amp;amp;gt; open birthday.ics
In a sense, it’s a “serverless calendar app”, haha.
There is no state on the server, it just generates a calendar event on the fly and returns it.

    
How I built it
You probably noticed that the URL contains “shuttleapp.rs”.
That’s because I’m using shuttle.rs to host the website.
Shuttle is a hosting service for Rust projects and I wanted to try it out for a long time.
To initialize the project using the awesome axum web framework, I’ve used
cargo install cargo-shuttle
cargo shuttle init --axum --name zerocal zerocal
and I was greeted with everything I needed to get started:
use axum:: routing::get, Router ;
use sync_wrapper::SyncWrapper;

async fn hello_world() -&amp;amp;gt; &amp;amp;amp;&amp;amp;#39;static str  
  &amp;amp;quot;Hello, world!&amp;amp;quot;
 

#[shuttle_service::main]
async fn axum() -&amp;amp;gt; shuttle_service::ShuttleAxum  
  let router = Router::new().route(&amp;amp;quot;&amp;#x2F;hello&amp;amp;quot;, get(hello_world));
  let sync_wrapper = SyncWrapper::new(router);

  Ok(sync_wrapper)
 
Let’s quickly commit the changes:
git add .gitignore Cargo.toml src&amp;#x2F;
git commit -m &amp;amp;quot;Hello World&amp;amp;quot;
To deploy the code, I needed to sign up for a shuttle account.
This can be done over at https:&amp;#x2F;&amp;#x2F;www.shuttle.rs&amp;#x2F;login.
It will ask you to authorize it to access your Github account.
Then:
cargo shuttle login
and finally:
cargo shuttle deploy
Now let’s head over to zerocal.shuttleapp.rs:
Hello World!
Deploying the first version took less than 5 minutes. Neat!
We’re all set for our custom calendar app.

    
Writing the app
To create the calendar event, I used the icalendar crate (shout out to hoodie for creating this nice library!). iCalendar is a standard for creating calendar events that is supported by most calendar apps.
cargo add icalendar
cargo add chrono # For date and time parsing
Let’s create a demo calendar event:
let event = Event::new()
  .summary(&amp;amp;quot;test event&amp;amp;quot;)
  .description(&amp;amp;quot;here I have something really important to do&amp;amp;quot;)
  .starts(Utc::now())
  .ends(Utc::now() + Duration::days(1))
  .done();
Simple enough.

    
How to return a file!?
Now that we have a calendar event, we need to return it to the user.
But how do we return it as a file?
There’s an example of how to return a file dynamically in axum here.
async fn calendar() -&amp;amp;gt; impl IntoResponse  
  let ical = Calendar::new()
    .push(
      &amp;#x2F;&amp;#x2F; add an event
      Event::new()
        .summary(&amp;amp;quot;It works! 😀&amp;amp;quot;)
        .description(&amp;amp;quot;Meeting with the Rust community&amp;amp;quot;)
        .starts(Utc::now() + Duration::hours(1))
        .ends(Utc::now() + Duration::hours(2))
        .done(),
    )
    .done();

  CalendarResponse(ical)
 
Some interesting things to note here:

Every calendar file is a collection of events so we wrap the event in a
Calendar object, which represents the collection.
impl IntoResponse is a trait that allows us to return any type that implements it.
CalendarResponse is a newtype wrapper around Calendar that implements IntoResponse.

Here is the CalendarResponse implementation:
&amp;#x2F;&amp;#x2F;&amp;#x2F; Newtype wrapper around Calendar for `IntoResponse` impl
#[derive(Debug)]
pub struct CalendarResponse(pub Calendar);

impl IntoResponse for CalendarResponse  
  fn into_response(self) -&amp;amp;gt; Response  
    let mut res = Response::new(boxed(Full::from(self.0.to_string())));
    res.headers_mut().insert(
      header::CONTENT_TYPE,
      HeaderValue::from_static(&amp;amp;quot;text&amp;#x2F;calendar&amp;amp;quot;),
    );
    res
   
 
We just create a new Response object and set the Content-Type header to the correct MIME type for iCalendar files: text&amp;#x2F;calendar.
Then we return the response.

    
Add date parsing
This part is a bit hacky, so feel free to glance over it.
We need to parse the date and duration from the query string.
I used dateparser, because it supports sooo many different date formats.
async fn calendar(Query(params): Query&amp;amp;lt;HashMap&amp;amp;lt;String, String&amp;amp;gt;&amp;amp;gt;) -&amp;amp;gt; impl IntoResponse  
  let mut event = Event::new();
  event.class(Class::Confidential);

  if let Some(title) = params.get(&amp;amp;quot;title&amp;amp;quot;)  
    event.summary(title);
    else  
    event.summary(DEFAULT_EVENT_TITLE);
   
  if let Some(description) = params.get(&amp;amp;quot;description&amp;amp;quot;)  
    event.description(description);
    else  
    event.description(&amp;amp;quot;Powered by zerocal.shuttleapp.rs&amp;amp;quot;);
   

  if let Some(start) = params.get(&amp;amp;quot;start&amp;amp;quot;)  
    let start = dateparser::parse(start).unwrap();
    event.starts(start);
    if let Some(duration) = params.get(&amp;amp;quot;duration&amp;amp;quot;)  
      let duration = humantime::parse_duration(duration).unwrap();
      let duration = chrono::Duration::from_std(duration).unwrap();
      event.ends(start + duration);
     
   
  if let Some(end) = params.get(&amp;amp;quot;end&amp;amp;quot;)  
    let end = dateparser::parse(end).unwrap();
    event.ends(end);
    if let Some(duration) = params.get(&amp;amp;quot;duration&amp;amp;quot;)  
      if params.get(&amp;amp;quot;start&amp;amp;quot;).is_none()  
        let duration = humantime::parse_duration(duration).unwrap();
        let duration = chrono::Duration::from_std(duration).unwrap();
        event.starts(end - duration);
       
     
  let ical = Calendar::new().push(event.done()).done();

  CalendarResponse(ical)
 
Would be nice to support more date formats like now and tomorrow, but I’ll leave that for another time.
Let’s test it:
&amp;amp;gt; cargo shuttle run # This starts a local dev server
&amp;amp;gt; curl 127.0.0.1:8000?start=2022-11-04+20:00&amp;amp;amp;duration=3h&amp;amp;amp;title=Birthday&amp;amp;amp;description=Party
*🤖 bleep bloop, calendar file created*
Nice, it works!
Opening it in the browser creates a new event in the calendar:

    
            Of course, it also works on Chrome, but you do support the open web, right?
            
            
And for all the odd people who don’t use a terminal to create a calendar event, let’s also add a form to the website.

    
Add a form
&amp;amp;lt;form&amp;amp;gt;
  &amp;amp;lt;table&amp;amp;gt;
    &amp;amp;lt;tr&amp;amp;gt;
      &amp;amp;lt;td&amp;amp;gt;
        &amp;amp;lt;label for=&amp;amp;quot;title&amp;amp;quot;&amp;amp;gt;Event Title&amp;amp;lt;&amp;#x2F;label&amp;amp;gt;
      &amp;amp;lt;&amp;#x2F;td&amp;amp;gt;
      &amp;amp;lt;td&amp;amp;gt;
        &amp;amp;lt;input type=&amp;amp;quot;text&amp;amp;quot; id=&amp;amp;quot;title&amp;amp;quot; name=&amp;amp;quot;title&amp;amp;quot; value=&amp;amp;quot;Birthday&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
      &amp;amp;lt;&amp;#x2F;td&amp;amp;gt;
    &amp;amp;lt;&amp;#x2F;tr&amp;amp;gt;
    &amp;amp;lt;tr&amp;amp;gt;
      &amp;amp;lt;td&amp;amp;gt;
        &amp;amp;lt;label for=&amp;amp;quot;desc&amp;amp;quot;&amp;amp;gt;Description&amp;amp;lt;&amp;#x2F;label&amp;amp;gt;
      &amp;amp;lt;&amp;#x2F;td&amp;amp;gt;
      &amp;amp;lt;td&amp;amp;gt;
        &amp;amp;lt;input type=&amp;amp;quot;text&amp;amp;quot; id=&amp;amp;quot;desc&amp;amp;quot; name=&amp;amp;quot;desc&amp;amp;quot; value=&amp;amp;quot;Party&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
      &amp;amp;lt;&amp;#x2F;td&amp;amp;gt;
    &amp;amp;lt;&amp;#x2F;tr&amp;amp;gt;
    &amp;amp;lt;tr&amp;amp;gt;
      &amp;amp;lt;td&amp;amp;gt;&amp;amp;lt;label for=&amp;amp;quot;start&amp;amp;quot;&amp;amp;gt;Start&amp;amp;lt;&amp;#x2F;label&amp;amp;gt;&amp;amp;lt;&amp;#x2F;td&amp;amp;gt;
      &amp;amp;lt;td&amp;amp;gt;
        &amp;amp;lt;input type=&amp;amp;quot;datetime-local&amp;amp;quot; id=&amp;amp;quot;start&amp;amp;quot; name=&amp;amp;quot;start&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
      &amp;amp;lt;&amp;#x2F;td&amp;amp;gt;
    &amp;amp;lt;&amp;#x2F;tr&amp;amp;gt;
    &amp;amp;lt;tr&amp;amp;gt;
      &amp;amp;lt;td&amp;amp;gt;&amp;amp;lt;label for=&amp;amp;quot;end&amp;amp;quot;&amp;amp;gt;End&amp;amp;lt;&amp;#x2F;label&amp;amp;gt;&amp;amp;lt;&amp;#x2F;td&amp;amp;gt;
      &amp;amp;lt;td&amp;amp;gt;
        &amp;amp;lt;input type=&amp;amp;quot;datetime-local&amp;amp;quot; id=&amp;amp;quot;end&amp;amp;quot; name=&amp;amp;quot;end&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
      &amp;amp;lt;&amp;#x2F;td&amp;amp;gt;
    &amp;amp;lt;&amp;#x2F;tr&amp;amp;gt;
  &amp;amp;lt;&amp;#x2F;table&amp;amp;gt;
&amp;amp;lt;&amp;#x2F;form&amp;amp;gt;
I modified the calendar function a bit to return the form if the query string is empty:
async fn calendar(Query(params): Query&amp;amp;lt;HashMap&amp;amp;lt;String, String&amp;amp;gt;&amp;amp;gt;) -&amp;amp;gt; impl IntoResponse  
  &amp;#x2F;&amp;#x2F; if query is empty, show form
  if params.is_empty()  
    return Response::builder()
      .status(200)
      .body(boxed(Full::from(include_str!(&amp;amp;quot;..&amp;#x2F;static&amp;#x2F;index.html&amp;amp;quot;))))
      .unwrap();
   

  &amp;#x2F;&amp;#x2F; ...
 
After some more tweaking, we got ourselves a nice little form in all of its web 1.0 glory:

    
            The form
            
            
And that’s it! We now have a little web app that can create calendar events.
Well, almost. We still need to deploy it.

    
Deploying
cargo shuttle deploy
Right, that’s all. It’s that easy.
Thanks to the folks over at shuttle.rs for making this possible.
The calendar app is now available at zerocal.shuttleapp.rs.
Now I can finally send my friends a link to a calendar event for our next pub crawl.
They’ll surely appreciate it.yeahyeah

    
From zero to calendar in 100 lines of Rust
Boy it feels good to be writing some plain HTML again.
Building little apps never gets old.
Check out the source code on GitHub and help me make it better! 🙏
Here are some ideas:

✅ Add location support (e.g. location=Berlin or location=https:&amp;#x2F;&amp;#x2F;zoom.us&amp;#x2F;test).
Thanks to sigaloid.
Add support for more human-readable date formats (e.g. now, tomorrow).
Add support for recurring events.
Add support for timezones.
Add Google calendar short-links (https:&amp;#x2F;&amp;#x2F;calendar.google.com&amp;#x2F;calendar&amp;#x2F;render?action=TEMPLATE&amp;amp;amp;dates=20221003T224500Z%2F20221003T224500Z&amp;amp;amp;details=&amp;amp;amp;location=&amp;amp;amp;text=).
Add example bash command to create a calendar event from the command line.
Shorten the URL (e.g. zerocal.shuttleapp.rs&amp;#x2F;2022-11-04T20:00&amp;#x2F;3h&amp;#x2F;Birthday&amp;#x2F;Party)?

Check out the issue tracker and feel free to open a PR!
"
},

{
"title": "The Uber of Poland",
"url": "https://endler.dev/2021/uber-of-poland/",
"body": "A few years ago I visited a friend in Gdańsk, Poland. As we explored the
city, one thing I noticed was that cabs were relatively expensive and there was
no Uber. Instead, most (young) people used a community-organized service called
Night Riders.
I couldn’t find anything about that service on the web, so I decided to write
about it to preserve its history.

    
Delightfully Low-Tech
What fascinated me about Night Riders was the way the service operated —
completely via WhatsApp: you post a message in a group chat and one of the free
riders would reply with a 👍 emoji. With that, your ride was scheduled.
You’d pay through PayPal or cash.
In these days of venture-backed startups that need millions in capital before
they turn a profit, this approach is decidedly antagonistic. Basically,
Night Riders built on top of existing infrastructure instead of maintaining their
own ride-hailing platform, sign-up process, or even website.
The service would grow solely by word of mouth. Using existing infrastructure
meant that it was extremely cheap to run and there were almost zero upfront costs without
a single line of code to write.
It simply solved the customer’s problem in the most straightforward way
possible. Of course, there are legal issues regarding data protection, labor
law or payment processing, but the important bit is that they had paying
customers from day one.
The rest is easier to solve than a lack of product market fit.

    
In Defense of Clones
Uber and Lyft can’t be everywhere from the start. While they expand their
businesses, others have the ability to outpace them. There’s an Uber clone in
China (DiDi), one in Africa and the Middle East
(Careem) and basically one for every
country
in
the world. The tech
industry rarely talks about these Ubers of X, but they serve millions of
customers. While they start as exact copies of their well-known counterparts,
some of them end up offering better service thanks to their understanding of the
local market.

    
People always find a way
With creativity, you can provide great service even without a big budget. The
important part is to know which corners you can cut while staying true to your
mission. If there’s a market, there’s a way. The Cubans have a word for it:
resolver,
which means “we’ll figure it out”.
"
},

{
"title": "How Does The Unix `history` Command Work?",
"url": "https://endler.dev/2021/history/",
"body": "
    

            Source: Cozy attic created by vectorpouch and tux created by catalyststuff — freepik.com
            
        
As the day is winding down, I have a good hour just to myself.
Perfect time to listen to some Billie Joel (it’s either Billie Joel or Billie Eilish for me these days) and learn how the Unix history command works.
Life is good.
Learning what makes Unix tick is a bit of a hobby of mine.
I covered yes, ls, and cat before.
Don’t judge.

    
How does history even work?
Every command is tracked, so I see the last few commands on my machine when I run history.
❯❯❯ history
8680  cd endler.dev
8682  cd content&amp;#x2F;2021
8683  mkdir history
8684  cd history
8685  vim index.md
    
        
Yeah, but how does it do that?
The manpage on my mac is not really helpful — I also couldn’t find much in the first place.
I found this article (it’s good etiquette nowadays to warn you that this is a Medium link) and it describes a bit of what’s going on.
Every command is stored in $HISTFILE, which points to ~&amp;#x2F;.zsh_history for me.
❯❯❯ tail $HISTFILE
: 1586007759:0;cd endler.dev
: 1586007763:0;cd content&amp;#x2F;2021
: 1586007771:0;mkdir history
: 1586007772:0;cd history
: 1586007777:0;vim index.md
...
So let’s see. We got a : followed by a timestamp followed by :0, then a
separator (;) and finally the command itself. Each new command gets appended
to the end of the file. Not too hard to recreate.
Hold on, what’s that 0 about!?
It turns out it’s the command duration, and the entire thing is called the extended history format:
: &amp;amp;lt;beginning time&amp;amp;gt;:&amp;amp;lt;elapsed seconds&amp;amp;gt;;&amp;amp;lt;command&amp;amp;gt;
(Depending on your settings, your file might look different.)

    
Hooking into history
But still, how does history really work.
It must run some code whenever I execute a command
— a hook of some sort!


     💥 Swoooooosh 💥
Matthias from the future steps out of a blinding ball of light: Waaait!
That’s not really how it works!
It turns out that shells like bash and zsh don’t actually call a hook for history.
Why should they? When history is a shell builtin, they can just track the commands
internally.
Thankfully my editor-in-chief and resident Unix neckbeard Simon Brüggen
explained that to me — but only after I sent him the first draft for this article. 😓
As such, the next section is a bit like Lord of the Rings: a sympathetic
but naive fellow on a questionable mission with no clue of what he’s getting
himself into.
In my defense, Lord of the Rings is also enjoyed primarily for its entertainment
value, not its historical accuracy…. and just like in this epic story, I
promise we’ll get to the bottom of things in the end.


I found add-zsh-hook
and a usage example in atuin’s source code.
I might not fully comprehend all of that is written there, but I’m a man of action, and I can take a solid piece of work and tear it apart.
It’s not much, but here’s what I got:
# Source this in your ~&amp;#x2F;.zshrc
autoload -U add-zsh-hook

_past_preexec() 
    echo &amp;amp;quot;preexec&amp;amp;quot;
 

_past_precmd() 
    echo &amp;amp;quot;precmd&amp;amp;quot;
 

add-zsh-hook preexec _past_preexec
add-zsh-hook precmd _past_precmd
This sets up two hooks: the first one gets called right before a command gets
executed and the second one directly after. (I decided to call my little
history replacement past. I like short names.)
Okay, let’s tell zsh to totally run this file whenever we execute a command:
source src&amp;#x2F;shell&amp;#x2F;past.zsh
…aaaaaand
❯❯❯ date
preexec
Fri May 28 18:53:55 CEST 2021
precmd
It works! ✨ How exciting! ✨
Actually, I just remember now that I did the same thing for my little environment settings manager envy over two years ago, but hey!
So what to do with our newly acquired power?

    
Let’s Run Some Rust Code
Here’s the thing: only preexec gets the “real” command. precmd gets nothing:
_past_preexec() 
    echo &amp;amp;quot;preexec $@&amp;amp;quot;
 

_past_precmd() 
    echo &amp;amp;quot;precmd $@&amp;amp;quot;
 
$@ means “show me what you got” and here’s what it got:
❯❯❯ date
preexec date date date
Fri May 28 19:02:11 CEST 2021
precmd
Shouldn’t one “date” be enough?
Hum… let’s look at the zsh documentation for preexec:

If the history mechanism is active […], the string that the user typed is passed as the first argument, otherwise it is an empty string. The actual command that will be executed (including expanded aliases) is passed in two different forms: the second argument is a single-line, size-limited version of the command (with things like function bodies elided); the third argument contains the full text that is being executed.

I don’t know about you, but the third argument should be all we ever need? 🤨
Checking…
❯❯❯ ls -l
preexec ls -l lsd -l lsd -l
(Shout out to lsd, the next-gen ls command )
Alright, good enough. Let’s parse $3 with some Rust code and write it to our own history file.
use std::env;
use std::error::Error;
use std::fs::OpenOptions;
use std::io::Write;

const HISTORY_FILE: &amp;amp;amp;str = &amp;amp;quot;lol&amp;amp;quot;;

fn main() -&amp;amp;gt; Result&amp;amp;lt;(), Box&amp;amp;lt;dyn Error&amp;amp;gt;&amp;amp;gt;  
    let mut history = OpenOptions::new()
        .create(true)
        .append(true)
        .open(HISTORY_FILE)?;

    if let Some(command) = env::args().nth(3)  
        writeln!(history, &amp;amp;quot;  &amp;amp;quot;, command)?;
     ;
    Ok(())
 ❯❯❯ cargo run -- dummy dummy hello
❯❯❯ cargo run -- dummy dummy world
❯❯❯ cat lol
hello
world
We’re almost done — at least if we’re willing to cheat a bit. 😏
Let’s hardcode that format string:
use std::env;
use std::error::Error;
use std::fs::OpenOptions;
use std::io::Write;
use std::time::SystemTime;

const HISTORY_FILE: &amp;amp;amp;str = &amp;amp;quot;lol&amp;amp;quot;;

fn timestamp() -&amp;amp;gt; Result&amp;amp;lt;u64, Box&amp;amp;lt;dyn Error&amp;amp;gt;&amp;amp;gt;  
    let n = SystemTime::now().duration_since(SystemTime::UNIX_EPOCH)?;
    Ok(n.as_secs())
 

fn main() -&amp;amp;gt; Result&amp;amp;lt;(), Box&amp;amp;lt;dyn Error&amp;amp;gt;&amp;amp;gt;  
    let mut history = OpenOptions::new()
        .create(true)
        .append(true)
        .open(HISTORY_FILE)?;

    if let Some(command) = env::args().nth(3)  
        writeln!(history, &amp;amp;quot;:   :0;  &amp;amp;quot;, timestamp()?, command)?;
     ;
    Ok(())
 
Now, if we squint a little, it sorta kinda writes our command in my history format.
(That part about the Unix timestamp was taken straight from the docs. Zero regrets.)
Remember when I said that precmd gets nothing?
I lied.
In reality, you can read the exit code of the executed command (from $?). That’s very helpful, but we just agree to ignore that and never talk about it again.
With this out of the way, our final past.zsh hooks file looks like that:
autoload -U add-zsh-hook

_past_preexec() 
    past $@
 

add-zsh-hook preexec _past_preexec
Now here comes the dangerous part!
Step back while I replace the original history command with my own.
Never try this at home.
(Actually I’m exaggerating a bit. Feel free to try it. Worst thing that will happen is that you’ll lose a bit of history, but don’t sue me.)
First, let’s change the path to the history file to my real one:
&amp;#x2F;&amp;#x2F; You should read the $ HISTFILE  env var instead ;)
const HISTORY_FILE: &amp;amp;amp;str = &amp;amp;quot;&amp;#x2F;Users&amp;#x2F;mendler&amp;#x2F;.zhistory&amp;amp;quot;;
Then let’s install past:
❯❯❯ cargo install --path .
# bleep bloop...
After that, it’s ready to use.
Let’s add that bad boy to my ~&amp;#x2F;.zshrc:
source &amp;amp;quot;&amp;#x2F;Users&amp;#x2F;mendler&amp;#x2F;Code&amp;#x2F;private&amp;#x2F;past&amp;#x2F;src&amp;#x2F;shell&amp;#x2F;past.zsh&amp;amp;quot;
And FINALLY we can test it.
We open a new shell and run a few commands followed by history:
❯❯❯  date
...
❯❯❯ ls
...
❯❯❯ it works
...
❯❯❯ history
 1011  date
 1012  ls
 1013  it works
✨ Yay. ✨ The source code for past is on Github.

    
How it really really works
Our experiment was a great success, but I since learned that reality is a bit different.
“In early versions
of Unix the history command was a separate program”, but most modern shells have history builtin.
zsh tracks the history in its main run loop.
Here are the important bits. (Assume all types are in scope.)
Eprog prog;

&amp;#x2F;* Main zsh run loop *&amp;#x2F;
for (;;)
 
    &amp;#x2F;* Init history *&amp;#x2F;
    hbegin(1);
    if (!(prog = parse_event(ENDINPUT)))
     
        &amp;#x2F;* Couldn&amp;amp;#39;t parse command. Stop history *&amp;#x2F;
        hend(NULL);
        continue;
     
    &amp;#x2F;* Store command in history *&amp;#x2F;
    if (hend(prog))
     
        LinkList args;
        args = newlinklist();
        addlinknode(args, hist_ring-&amp;amp;gt;node.nam);
        addlinknode(args, dupstring(getjobtext(prog, NULL)));
        addlinknode(args, cmdstr = getpermtext(prog, NULL, 0));

        &amp;#x2F;* Here&amp;amp;#39;s the preexec hook that we used.
        * It gets passed all the args we saw earlier.
        *&amp;#x2F;
        callhookfunc(&amp;amp;quot;preexec&amp;amp;quot;, args, 1, NULL);

        &amp;#x2F;* Main routine for executing a command *&amp;#x2F;
        execode(prog);
     
 
The history lines are kept in a hash, and also in a ring-buffer
to prevent the history from getting too big. (See here.)
That’s smart! Without the ring-buffer, a malicious user could just thrash the history with random commands
until a buffer overflow is triggered. I never thought of that.

    
History time (see what I did there?)
The original history command was added to the Unix C shell (csh) in 1978.
Here’s a link to the paper by Bill Joy (hey, another Bill!).
He took inspiration from the REDO command in Interlisp.
You can find its specification in the original Interlisp manual in section 8.7.

    
Lessons learned

Rebuild what you don’t understand.
The history file is human-readable and pretty straightforward.
The history command is a shell builtin, but we can use hooks to write our own.
Fun fact: Did you know that in zsh, history is actually just an alias for fc -l? More info here or check out the source code.

“What I cannot create, I do not understand” — Richard Feynman
"
},

{
"title": "Spreadsheets Make For Great Business Ideas",
"url": "https://endler.dev/2021/first-business/",
"body": "One of the best decisions I made in 2020 was to open my calendar to everyone. People book appointments to chat about open-source projects, content creation, and business ideas.
When we discuss business ideas, the conversation often leans towards problems suited for startups, such as using artificial intelligence to find clothes that fit or building a crowdfunding platform on the blockchain.
While these are exciting ideas, they require significant persistence and deep pockets. It might be easier and less risky to join an existing startup in that domain.
In reality, most people are simply looking for something cool to work on and to make their customers happy. It turns out you don’t need to run a startup to achieve that (and you probably shouldn’t). Instead, starting a side project is less risky and can organically grow into a business over time.
Often, the solution is right in front of them: hidden within an Excel spreadsheet on their computer.

    
I Hate Excel
I spend as little time in Excel as possible, only engaging with it when absolutely necessary. My focus is on getting tasks done quickly, not on layout or design; I’d rather pay someone to do that work for me. And this is precisely my point!
The spreadsheets and lists you create to solve your own problems can also solve someone else’s. This represents a business opportunity!
This approach has several advantages:

💪 It solves a real problem.
🥱 It’s mundane, so people might pay to avoid doing it themselves.
⚡️ It wastes no time on design or infrastructure, embodying the ultimate MVP.
🐢 It’s low-tech: no programming required. You can start with Notion and Super.so.
🐜 It targets a niche market: if there were an established service, you’d already be using it. Big corporations won’t compete with you.
🚀 It allows you to spend less time building and more time engaging with potential customers.


Examples
A few years ago, while researching static code analysis tools, I compiled a list, pushed it to GitHub, and moved on. Fast forward, and that side project now generates revenue from sponsors and consulting gigs.
Another example is a person who created a spreadsheet for remote work locations, shared it on Twitter, and then developed a website from it. The website is NomadList, and its creator, Pieter Levels, now earns $300k&amp;#x2F;year.

“Instead of building a site first, I simply made [a] public Google spreadsheet to collect the first data and see if there’d be interest for this at all.” — Pieter Levels on how he created NomadList.

I’ve left a spot for your story here. Now, refine that spreadsheet (or list), share it with your friends, iterate based on their feedback, and build your first business.
"
},

{
"title": "Starting A Print-On-Demand Business As A Software Engineer",
"url": "https://endler.dev/2021/codeprints/",
"body": "One day I had the idea to make a print of my Github timeline.
I liked the thought of bringing something “virtual” into the real world. 😄
So I called up my friend Wolfgang and we built codeprints.
It’s my first “physical” product, so I decided to share my learnings.

    
            Felix Krause of fastlane fame was one
of our first customers and we are very thankful for this tweet promoting our
service, which gave us a huge traffic boost.
            
            
Launching Is Hard, So Launch Early
Even though I knew that launching early was vital, I still didn’t want to
“commit” to the final design shortly before the planned go-live. There was always that last bug to fix or that little extra feature to implement.
For example, I wanted to offer two designs&amp;#x2F;layouts: the classic Github contribution timeline and a graph-based design for repositories.
In cases like that, it helps to have a co-founder.
Wolfgang convinced me that multiple layouts were not needed for the MVP and that whatever we’d come up with would probably be wrong anyway without getting early user feedback.
He was right. Without Wolfgang, the shop would probably still not be live today.
We have a much clearer vision now of what people want to see, thanks to launching early. Turns out users were not really interested in the graph-based design after all, and it would have been a waste of time to create it.


     Lesson learned:
Even if you know all the rules for building products, it’s
different when applying them in practice for the first time. We’ll probably
never be completely happy with the shop functionality, but it’s better to launch
early and make incremental improvements later.


Software Development Is Easy
When we started, my main concern was software development. The frontend and the
backend needed to be coded and work together. We didn’t want to run into Github rate-limiting issues in case there were many users on the site. I was also
thinking a lot about which web frontend to use. Should we build it in Rust using
Yew or better go with Gatsby?
Turns out writing the code is the easy part.
Being software engineers, it didn’t take us too long to implement the backend
API and we quickly found a decent template for the frontend. Most of our time
was spent thinking about the product, the user experience,
financing, taxes, the shipping process, marketing, and
integrating customer feedback.
These were all things I had (and still have) little experience in.
Wolfgang suggested to “just use Shopify and the default template” to get started
quickly. In hindsight, it was the absolute right decision. I always thought
Shopify was for simple mom-and-pop stores, but it turns out it’s highly
customizable, integrates well with pretty much anything, and offers excellent tooling
like themekit. Payments, refunds,
discounts, customer analytics: it’s all built into the platform. It
saved us sooo much development time.

    
     Lesson learned:
There are many unknown
unknowns
— things we are neither aware of nor understand — when starting a project.
Try to get to the root of the problem as soon as possible to save time and avoid
the sunk cost fallacy.


Users Expect Great UI&amp;#x2F;UX
Giants like Amazon, Facebook, and Netflix have raised customer
expectations for great UX. They spend millions polishing their websites and getting every detail right. As a result, their sites work just right for millions of customers and on every device.
An indie shop does not have these resources. Nevertheless, many customers expect the same quality user experience as on other sites they use.
Being on the other side of the fence for the first time, I learned how hard it
is to build a user interface that works for 90% of the people. Every little
detail — like the order of form fields — makes a huge difference. Get too
many details wrong, and you lose a customer.
Those things can only be found by watching real users use your product. I promise you, it will be eye-opening!


     Lesson learned:
Watch potential customers use your service. It will be
painful at first, but will improve the quality of your product. Use standard
frameworks for shops if you can because they get many UI&amp;#x2F;UX details
right out of the box. WooCommerce or
Shopify come to mind.


Building Products Means Being Pragmatic
We have many ideas for future products. Many friends and customers tell us about
potential features all the time, but the problem is how to prioritize them.
Most ideas won’t work at scale: It’s tricky to find a supplier that has a
product on offer, is cheap, ships worldwide, and has a working integration with
your shop-system. So we have to regularly scrap product ideas, simply
because our suppliers’ support is not there. On top of that, we run the
business next to our day job and other
responsibilities, so we need to make use of our time as efficiently as possible.


     Lesson learned:
Making services look effortless is hard work. Time is your biggest constraint.
You’ll have to say “no” more often than you can say “yes”.


            Due to the pandemic, codeprints was
entirely built remotely. More people should give whereby
a try.
            
            
Getting Traction As A Small Business
It has never been easier to launch a shop. Services like Shopify, Stripe, and a
host of suppliers make starting out a breeze. On the other hand, there is a lot
more competition now that the barrier to entry is so low.
Thousands of services are constantly competing for our attention. On top of
that, most customers just default to big platforms like Amazon, AliExpress, or eBay
for their shopping needs these days, and search engines send a big chunk of the traffic there.
Since our product is custom-made, we can not offer it on those bigger platforms.
As an indie shop, we get most visitors through word of mouth, exceptional
customer support, and advertising where developers hang out:
Twitter, Reddit, HackerNews, Lobste.rs, and friends. It’s essential to focus on
providing value on those platforms; a plain marketing post won’t get you any attention. Other
platforms like LinkedIn, Facebook, ProductHunt, or IndieHackers could also work, but our target audience (OSS developers with an active Github profile) doesn’t
hang out there that much.


     Lesson learned: Always know where your customers are and understand their needs.


Finding A Niche Is Only Half The Job
Common market wisdom is to find niche and grow from within. With codeprints we definitely found our niche: the audience is very
narrow but interested in our geeky products. There are 56 million developers on
Github today; that’s a big target audience. Most profiles are not very active,
though. To make a print look attractive, you’d have to consistently commit
code over a long period of time — many years. If we assume that only 1% of
devs are active, that limits our target audience to 560.000 users. That’s still
a big but much smaller market. Now, if only 1% of these people find the shop and
order something (which would be quite a good ratio), we’re looking at 5.600
orders total. Not that much!
In order to extend that audience, one could either increase the number of
potential customers or focus on getting more of the existing potential customers
on the page.
In our case, we expanded by offering a one-year layout, reducing the
required level of Github activity for a cool print. We are also working on making
emptier profiles look more interesting and highlighting the value-producing part
of open source contribution. Every contribution counts — no matter how tiny.


     Lesson learned:
Make sure that your niche market is not too narrow so that you can make a sustainable business out of it.


            Early adopters like Orta
Therox are incredibly precious when starting out. Not
everybody has a rockstar profile like that, though (and that’s fine).
            
            
Make User Feedback Actionable
Initial customer feedback is precious. You should focus on every word these
customers say as they believe in your product and want you to win. (They voted
with their wallet after all.) Feedback from
friends is helpful, too, but I usually apply a bigger filter to that. Not all
of my friends are software developers, and while they all mean well, what they
tell me might be different from what they mean. It’s like they
are asking for faster
horses when what they
really want is a car.
Feedback on social media can be… snarky at times; be prepared for that! Your job
is to find the grain of truth in every statement and focus on constructive
advice.
For example, take this feedback we got:

How lazy can someone be to pay €36 for this.

You could turn it around to make it constructive:

Can I get a cheaper version to print myself?

And that is some valuable feedback. We could provide a downloadable version in
the future!


     Lesson learned:
It takes practice to extract actionable feedback from user input and make it fit your product vision.


Summary
2020 was a crazy year.
I helped launch two small side-businesses, codeprints and
analysis-tools.dev.
Both have an entirely different revenue model, but
they have one thing in common: they were super fun to build! 🤩
It’s motivating to look back at those achievements sometimes…
That print of 2020 pretty much encapsulates those feelings for me.
(Note the greener spots in August and September, which is when we launched
analysis-tools and the days in December when we built codeprints.)

    
            My coding year in review using our new
vertical layout.Here’s to
building more products in 2021.
            
            
Let me know if you found that post helpful and reach out if you have questions.
Oh and if you’re looking for a unique way to decorate your home office, why not
get your own print from codeprints? 😊
P.S.: If you’re a product owner and you’re looking for a unique present for your
team, get in contact and be the first to get an invite to a private beta.
"
},

{
"title": "So You Want To Earn Money With Open Source",
"url": "https://endler.dev/2021/oss-money/",
"body": "I earned 0 Euros from maintaining OSS software for years, and I thought that’s
the way things are. I finally looked into ways to monetize my projects last year
and in this talk I want to share what I learned so far. It didn’t make me
rich (yet!), but I built my first sustainable side-project with
analysis-tools.dev ✨.
I’ll talk about this and other projects and the mistakes I made on the road
towards sustainability.


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );

Related links and resources:

Podcast by Caleb Porzio about why building
a business around Github sponsors is so
hard.
The Changelog Podcast — It’s OK to make money from your open source
with Zeno Rocha.
Nadia Eghbal talking about Maintenance of our essential info-structure. She also wrote a book on that topic
called Working in Public.
Generating income from open source by Vadim Demedes, which explains different ways to monetize your projects based on various real-world examples.

Find a full transcript of the talk below. (Sorry for the wall of text.)

This is my talk about earning money with Open Source, which I gave at the Web
Engineering Meetup Aachen at the end of 2020. The organizers gladly allowed me
to share it on my YouTube channel. I’m basically trying to answer the question:
“Why am I not making 100k on Github?”. I’m talking about finding corporate
sponsors for myself and the long road towards sustainability of open-source
maintenance.
You might not even want to start. This is a talk for those people that have
the mindset that it’s probably not worth it to spend that much effort on Open
Source if it takes so long until you find success. Now, this talk turned out to
be a little grim. I had this very motivational talk in mind, but in reality,
it’s hard, and by hard, I mean it’s really hard.
I just want to get this point across and maybe still motivate you to do it but
first: why am I entitled to talk about this? I’ve been doing Open Source for 10
years over 10 years now. This is a talk dedicated to my former self maybe 15
years ago. I work at trivago, which is a hotel search company based in
Düsseldorf. I have a blog at endler.dev. Like everyone and their mom, I also
have a YouTube channel. It’s called Hello, Rust! and I’m
extremely active with one video every two years. Hence, you definitely want to
subscribe to not miss any updates. But today, I want to talk about Open Source,
and I have a very sophisticated outline with two points my journey and revenue
models.
Let’s go back all the way to 2010. The world definitely looked a bit different
back then.

    
            Github in 2010
            
            
This was Github, and I was a bit late to the game. I joined in January 2010, and
by then, Github was already two years old, so my username was taken. I usually
go by the handle mre on platforms, and I noticed that this handle was not
used by anyone, so I just sent a mail to support and asked if I could have it,
and then I got an answer from this guy saying “go for it.” It was Chris
Wanstrath, who goes by the handle defunct, and he’s the former CEO of Github,
and at this point in time, I was hooked. I really liked the platform. I really
liked how they worked very hands-on with Open Source. I used it for some
projects of mine; you can see in the screenshot that I uploaded my blog, for
example, because they host it for free. It was built with Jekyll, and you just
push it to their site. Then they statically generate it, and it’s done. It goes
without saying that nothing has changed in the last 10 years because my blog
more or less still looks like that. It’s not built with jQuery and Jekyll
anymore, but with zola and Cloudflare Worker Sites, but it’s more or less the
same thing. For preparing for this talk, I wanted to take a step back and see
where I was coming from and where I am right now, and probably the best way to
do it is to look up some statistics and see if the number of repositories over
time would give me some insights. So I queried the Github API for that.
You can see it’s pretty much a linear graph from 2010 all the way to 2020.
Except for 2018, where I reached peak productivity, it seems, but oh well. In
the end, it’s more or less a linear thing, and you might say you put some work
in you get some feedback out, but in reality, it’s different. There is a
compound effect. If we look at my number of stars over time, you can see that
more or less it started very slowly, and now it’s sort of growing exponentially,
so right now, we are at 25.000 stars across all projects. Another way to look at
it would be the number of followers. That’s kind of a new metric to me, but I
did look up some statistics from archive.org (because Github doesn’t have that
information through their API), and again, it’s more or less exponential growth.
You put some work in, but you get a compound effect of your work plus some
interest out. This is not luck; it’s work. It means you know what you’re doing.
At the same time, there’s the elephant in the room, and that is it’s just a pat
on the back. We have earned zero dollars until now, and one question you might
have is how do you monetize this effort.
First off, is it an effort?
Well, I don’t know about you, but I probably spend two or three hours on average
per day on Open Source: thinking about Open Source and creating new projects,
but also maintaining and code review, so it really is work, and it’s a lot of
work, and you more or less do that for free.
There’s nothing wrong with doing things for free and doing it as a hobby, but in
this case, you are supposed to be working on whatever you like. Open Source is
not like that; sometimes you have obligations, and you feel responsible for
maybe helping people out, which is a big part of it. You do that next to your
regular work, so it can really be a burden. If you don’t know by now, making
this somehow valuable is hard, it’s really hard. I want to talk about some ways
to build a proper revenue model from Open Source. It goes without saying that
this should probably not be your first focus if you saw the graphs before, but
once you reach a point where you want to get some revenue, you have a couple of
options. I don’t want to talk about doing Open Source as part of your business,
and I don’t want to talk about bigger companies and more significant support
here. I want to focus on a couple things that everyone can do. Sponsoring [on
Github] is one. Offer paid learning materials on top of your normal
documentation. For example, you might have a video series that you ask for
money. Sell merchandising like Mozilla does. Consulting next to your Open Source
business Services and plugins like writing an ADFS plugin or high availability
functionality are very common examples for paid features targeting enterprises.
But let’s start with the basics. Let’s start with point number one, sponsoring.
There are two types of sponsoring: the first one is individual donations.
Individual sponsoring is what Github Sponsors is all about. If you want to earn
money [with that model], you have to think about the funnel, and you have to
think about how you capture people’s attention and how you monetize that. It
starts with a product, [which] can be anything. From there, you generate
interest, and this interest creates an audience, and that audience eventually
might pay for your service, and this is actually the entire secret. It’s how you
earn money with any product, and with Open Source, if you want to attract
sponsors, you build a product people want.
If you transfer that to Open Source, building a project is maybe a repository,
and the stars indicate the interest of the audience. The audience itself is made
out of followers (personal followers or followers of a company), and those
followers might or might not become sponsors in the end. Now, I know stars are a
terrible metric for popularity because some people use stars differently than
others. For example, some use it as bookmarks to check out projects later,
others want to thank the developers for maybe putting in a lot of effort, and so
on, but it’s a good first estimation.
Now, think about the following. Think about the number of stars I have and the
followers and the number of sponsors. Think about my “funnel” right now. I told
you that I have 25.000 stars and roughly 1000 followers, and out of those, I
have three sponsors, so the ratio between the stars and sponsors is 0.01. That
looks pretty grim. It means you need around 8.000 stars to attract a single
supporter. I was wondering: “maybe it’s just me?”. Maybe the top 1000 Github
maintainers did not have that problem. Well, it turns out it’s exactly the same
schema. If you take the top 1000 Github maintainers and look at their sponsors,
it’s again a pretty grim picture. For example, looking at the median, you look
at 3421 followers per person and a median of zero sponsors. That’s zero percent
if my math is correct, and if you look at the average, you even have 5430
followers (because Linus Torvalds pushes that number up). You have 2.8 sponsors
out of that on average, and that is 0.5%, which is a bit more than I have, but
it’s roughly in the same ballpark. Now think about this: Github has 40 million
users, so that means the top 1000 maintainers make up 0.0025% of the entire
community. The median income of those maintainers on Github is basically zero.
That in and on itself is maybe not the biggest problem, but keep in mind that
the Github revenue of 2019 was 300 million dollars. I read that comment on
Hacker News yesterday:

I have sponsors on Github and rake in a cool two dollars per month. It’s
obviously less after taxes, so I have to have a day job.

So this is clearly not working. You have to think of different ways to monetize
Open Source, or you just wait until Github Sponsors becomes more popular –
whatever happens first. One way I just want to quickly touch on is the notion of
sponsorware. It’s kind of a new concept, and some people haven’t heard of it
before. I honestly really like it. Generally speaking, you create a project, and
you keep it private. You talk about it on Twitter, though or any other platform,
and you say: “hey, I’m building this, but if you want early access, you have to
become a sponsor,” and once you reach a certain threshold of sponsored sponsors,
or income or whatever. Then you make a project public. This initial example that
I showed you, where someone was earning 100k on Open Source, is from someone
doing just that. He’s building products and services, talks about them, and then
makes them open for everyone in the end.
This has some advantages: first of you get early feedback from people that
really believe in your mission. Second, you don’t have to work for free all the
time, and third, you might also create an audience and hype from your projects.
The disadvantage is that if you are a hardcore Open Source or free software
believer, this goes against your ethic. You want the software to be open, to
begin with, without any additional requirements. So you really have to make up
your own mind about that. I tried, and I have an early access program, which I
only share with sponsors. [My first sponsorware was a] tool for getting Github
statistics. [The statistics from this talk were] created with that tool. I think
you need a big audience to pull that off. The question is if you want to put
that much effort in, or you just want to make it open in the first place and
think about other revenue models. However, I think still it’s a very interesting
concept, and we might see that [more] in the future, so you know how it looks
like now, and you have a name for it.
Another one is corporate sponsoring. This is a double-edged sword because
corporate sponsoring means that a company gives you money and sometimes wants
something. They might want additional support, or they want the bug to be fixed,
and more or less it feels like you are somehow beginning to work for them, but
nevertheless, those companies put in quite a big amount of money into Open
Source these days. Looking at two big companies, Facebook and Google, they
invested 177k and 845k respectively into Open Source over their lifetime on Open
Collective, a platform for collecting those donations. That’s really great. We
need more companies doing that, but also, as a little side note and maybe as a
little rant, I believe that those companies are doing way too little.
Facebook’s revenue last year was 70 billion, and Google had 160 billion, which
is nothing to be ashamed of, so I wonder really if this is the most they can do.
Of course, Google, for example, also donated to other projects like Mozilla, and
they also organize meetups and so on. But do you really think that Facebook and
Google would exist today if there was no Python or web server or Linux back in
the day when two Stanford students tried to build a search engine? Sometimes I
feel that Fortune 500 companies really don’t understand how much they depend on
Open Source and how many people depend on a few people who maintain critical
parts of our infrastructure.
I don’t think they invest nearly enough into Open Source. What a lot of people
think is that Open Source works like the panel on the left where you have a full
room of engineers trying to figure out the best way to build a project, and in
reality, it’s more or less someone working late at night to fix bugs and doing
it because they believe in it. The public perception is probably wrong, and a
really small group of people who maintain critical infrastructure. Sometimes
that can lead to very tricky situations. Two of my childhood heroes talked about
it openly: Kenneth
Reitz is
the core maintainer of requests for Python and antirez is
the creator of
Redis, a key-value store. So one is front-end development
and the other one from backend-end. They both talk about burnout here because
the burden of becoming an Open Source maintainer on a big scale can very much
and very quickly lead to burnout. The internet never sleeps. You never go to
sleep. You always get a ticket, a feature request, a pull request, an issue. You
always have something to work on, and on top of that, you have to do all your
other responsibilities, so that can lead to burnout really quickly. There was
one guy who I also respect deeply. His name is Mark Pilgrim. He is the author of
Dive Into Python, and he once pulled a 410 for deleting everything [about him]
on the internet. There’s actually a term for it: infocide for “information
suicide.” He got fed up with the ecosystem, and if you think about the Ruby
community, you might remember _why, the author of the Poignant Guide to Ruby.
He did kind of the same thing. Focusing on what antirez has said, “once I
started to receive money to work at Redis, it was no longer possible for my
ethics to have my past pattern, so I started to force myself to work on the
normal schedules. This, for me, is a huge struggle for many years. At this
point, moreover, I’m sure I’m doing less than I could, because of that, but this
is how things work”, so it feels like he feels guilty for maybe being forced
into that work schedule and maybe not performing well enough. There are some
signs of burnout for me somehow, and it’s that love-hate relationship of Open
Source and money. If you accept money, it becomes a job, but you’re not writing
code most of the time. You’re writing the talks, reviewing pull requests, you’re
looking at issues, you’re answering questions on StackOverflow, you’re
discussing on Discord, you’re marketing on YouTube or conferences. When you
become popular with Open Source, then it feels like you have a choice between
two options: one is depression and the other one is burnout. If your project
does not become successful, then suddenly you think you’re a failure, you’re a
mistake. It has zero stars; nobody likes it. But if it becomes a success, then
everyone likes it, and you get hugged to death. That’s a really unfortunate
situation to be in, and you want to stop being overwhelmed with those
responsibilities. You have to set clear boundaries and pick your poison. You
have to be careful if you accept companies as sponsors. I want to show you one
example of how it can work and [point out] some risks. Earlier this year, I
started working on a real project that I had been putting off for many years
before.
You see, in December 2015, I started a list of static analysis tools on Github.
Static analysis tools are just tools that help you improve your code, and it
turns out that there’s a lot of those tools. Just starting to collect them was
the first step. I didn’t think much about it, but over time that became really
popular. And you can see that this graph is more or less a linear increase in
stars over time. In 2018, I started really thinking hard about whether there was
more than just a Github project here. I talked to many people that I had this
idea of building something more from that. It really took someone else to maybe
push me over the finishing line and convinced me that this was worth it, and
that is Jakub. He said, “why not build a website from it?” and over the course
of maybe two weekends or so, we built a website. It’s built with Gatsby, but it
really doesn’t matter. We just did it, and then we saw what happened to it. We
render 500 tools right now, and the initial feedback was really great. People
really seem to like that. We got a cool 720.000 requests on the first day, and
over the next week or so, it more or less hit 1.5 million. That was great
because suddenly people started getting interested in that project. So we
started finding some sponsors. Those companies are special because they believe
in your mission, but they also know how Open Source works. They don’t really
expect you to advertise their tool. They want to sell to developers, so they
want to be in the developers’ minds, saying: “Hey! You are a developer. We built
this amazing tool you might want to check it out!” but they also get seen as an
Open Source company. I think that’s a win-win. I have to say it doesn’t always
go as easily. sometimes companies expect you to just have cheap advertising
space. Then they jump off the moment they see you don’t get that many clicks,
but others understand that they invest into something that maybe pays off in a
year or two from now. So I’m really thankful that some companies understand that
mission. However, what companies want is different than what individuals want.
Companies want an invoice. Companies want something tax-deductible. Companies
want someone that keeps the lights on and is responsive via email, so you really
have those obligations, and one platform that helps with that is Open
Collective. They have a 501c6 program for Open Source projects that acts as a
fiscal host, which means they will do all the invoicing and officially be the
maintainers. If you, as an Open Source maintainer or a contributor to a project,
want to get [reimbursed for your work], you have to send an invoice to open
collective.
I think that’s the best of both worlds. Again, because it’s a very transparent
process, companies are in the loop and don’t have to deal with all the financial
stuff. But it also means that you have to really polish your public perception.
Companies really want to know what they can get out of sponsoring you, and you
have to make that very clear. Probably the most important site that you have is
not your website, but it’s your sponsors page on Github where you describe the
different tiers and what those tiers mean, so we have three tiers: One is
targeted at smaller companies and freelancers. They just get exposure, and they
get seen as an Open Source friendly tech company. That is a hundred dollars a
month. We have a middle-tier, a company sponsor that maybe is a bigger company.
They get the batch, too, but they also get a blog post about a static analysis
tool that they want to promote, but we make it transparent that this is really a
sponsored content. Finally, if you want to go all the way, you go to full
content creation, which might be a video workshop, but we don’t have video
workshop sponsors yet, so I cannot talk about that yet. I have to say I really
would like to try though and it’s cheap really for what you get.
Anyway, those are things that you can do today. Without really changing how you
work on Open Source, you can set that up, and you just see how it goes. Maybe no
one reacts, and that’s fine. Everything else on that list is kind of advanced.
You need an audience, and so you should start with that.
Paid learning material is something that we are doing with analysis tools in the
future with a video course. There are companies like tailwind that do that
impressively well, so you can learn from them. For merchandising, you have to
have a brand. Hence, it’s not something that I could do, but someone like
Mozilla or the Coding Train on
YouTube could definitely do something like that. Consulting is always an option.
Still, it’s also a lot more work and probably takes you away from what you
really love, so it really becomes a job. You have to think about whether you
want to do that or not. Enterprise services are very advanced [and interesting]
for maybe the one percent of projects that can be run in a business and where
you have special requirements. I have to say start from the top and work your
way down. Start to create an audience. It’s probably easier to build an audience
on Twitter and then funnel it back to Github than the other way around. Oh, by
the way, did I tell you it’s hard? I really don’t want to end on a low note. I
really want to emphasize that I would do it again, all of that if I started
today. I think there’s no better time to contribute to Open Source than today.
Probably tomorrow will even be a better time because suddenly, way more people
are interested, it’s way easier to set up projects, you have all those free
tools like VSCode and Github actions, free hosting. It’s just amazing how much
you can pull off with very little money involved. So you can try it. What’s the
worst thing that can happen? No one cares? Well, okay, then you’re as good as
me. But I have some tips for you if you want to start today. My first tip is:
“do your homework.” Many people start with learning, and then they build things,
and then they close the circle, but there’s one key piece missing here. Some
people hate the word, but you learn to love it eventually. It’s called
marketing. Marketing means a lot of things to a lot of people, but what it means
to me is getting the word out because someone else will if you don’t, and you
are awesome; you just have to realize that. Maybe not everyone knows [about your
project] right away, so you should really talk about it more. Maybe at
conferences, maybe on Twitter, maybe you can just tell your friends. Maybe you
can ask people to contribute and to support you. Somehow it’s frowned upon in
the community that if you do marketing, you’re not doing it for real, but I
think that’s not true. I think that if smart people and patient and passionate
people did marketing, then the world would be a better place; because I’m pretty
sure the evil guys do marketing. So do your homework, but rest assured that
being an Open Source maintainer means running a business, and you are the
product. You have to think about why someone would want to sponsor you because
if you don’t come up with an answer for that, how should they know. Also, think
about the funnel. How will people find you, for example? The best way for people
to find you is probably starting a YouTube channel.
There are easier ways, though.
[First,] you can always help out in a different project, and you
don’t even have to be a coder. If you are good with design, then I can tell you
there are so many Open Source projects that need designers. It’s crazy.
Maybe start creating a logo for a small project and start getting some
visibility. Another one is having fun. If you know that earning money is hard in
Open Source, then that can also be liberating because it means you can
experiment and you can be creative, and yeah, having fun is the most important
thing, I guess.
Second, build things you love because it’s your free time in the end. The
chances that someone will find the project is pretty low, so it better be
something that you’re really interested in. If you don’t believe in that, just
move on to the next thing. It’s fine if you drop a project that you don’t
believe in anymore. No one will hold you accountable for that unless they are
jerks, and you don’t want to be surrounded by jerks.
Third, find friendly people because you really grow with your community. You
want people that support your project and maybe eventually become maintainers to
ease the burden, and that takes a lot of time, sometimes years, until you find
one maintainer, so always be friendly, try to put yourself in their perspective.
Go the extra mile if you can. For example, reintegrate the master branch into
their pull request. Just do it for them. Say thanks twice if you’re unsure.
Fourth is to grow an audience.
Radical marketing is one way, but being approachable and being inclusive is
another way. You want to be the guy or the girl that people go to when they have
a tough question, or they want to know how to get into Open Source. You want to
be the person that helps them out on their first pull request. They will pay it
back a thousand times. The most exciting people I have met so far are available
for questions, and they don’t really ask for anything in return. You hold them
very close and dear to your heart. When the time comes, you will remember those
people. We will say, like, “this is an amazing person to work with; I can highly
recommend them,” which is called a lead.
Finally, be in it for the long run. Good things take time. You see, it took me
10 years. Maybe it takes you five or maybe even less, but it’s probably not an
overnight success. It’s really a long-term investment.
"
},

{
"title": "My Blog Just Got Faster: Cloudflare Workers and AVIF Support",
"url": "https://endler.dev/2020/perf/",
"body": "
    

Did I mention that this website is fast?
Oh yeah, I did, multiple times.
Few reasons (from ordinary to the first signs of creeping insanity):

📄 Static site
☁️ Cached on Cloudflare CDN
🔗 ️HTTP&amp;#x2F;2 and HTTP&amp;#x2F;3 support
🚫 No web fonts (sadly)
✅ Edge-worker powered analytics (no Google Analytics)
🌸 Avoiding JavaScript whenever possible; CSS covers 90% of my use-cases.
🖼️ Image width and height specified in HTML to avoid page reflows.
👍🏻 Inlined, optimized SVG graphics and hand-rolled CSS
🚅 Static WASM search (lazy loaded)
🏎️ The entire homepage is &amp;amp;lt;10K (brotli-compressed), including graphics, thus should fit into the first HTTP round-trip.
💟 Heck, even the favicon is optimized for size. Update: I’m using an SVG icon now thanks to this article.

Then again, it’s 2020: everyone is optimizing their favicons, right? …right!?
Well, it turns out most other sites don’t think about their user’s data plans as much as I do. Actually, that’s an understatement: they don’t care at all. But to me, lean is beautiful!

    
Wait, What About Images?
I prefer SVG for diagrams and illustrations.
Only if it’s a photo, I’ll use JPEG or WebP.
To be honest with you, I never really liked WebP.
The gist is that it might not even be smaller than JPEGs compressed with MozJPEG.
There is a lengthy debate on the Mozilla bug tracker if you want to read more.
To this day, Safari doesn’t support WebP.

    
Hello AVIF 👋
Meet AVIF, the new next-gen image compression format. Check this out:

    
            Source: ReachLightSpeed.com
            
        
It’s already supported by Chrome 85 and Firefox 80.
Then it hit me like a hurricane 🌪️:


     😲 Holy smokes, AVIF is supported by major browsers now!?
I want this for my blog!


Yes and no.
I’m using Zola for my blog, and
AVIF support for Zola is not yet there, but I want it now!
So I whipped up an ugly Rust script (as you do) that creates AVIF images from my old JPEG and PNG images. I keep the original raw files around just in case.
Under the hood, it calls cavif by Kornel Lesiński.

    
Data Savings
The results of AVIF on the blog were nothing short of impressive:

    
            Total image size for endler.dev&amp;#x2F;2020&amp;#x2F;sponsors
            
            
Check Your Browser
But hold on for a sec… is your browser even capable of showing AVIF?

  
If that reads “yup,” you’re all set.
If that reads “nope,” then you have a few options:

On Firefox: Open about:config from the address bar and search for avif.
On Chrome: Make sure to update to the latest version.
On Safari: I’m not sure what you’re doing with your life. Try a real browser instead. 😏


Workaround I: Fallback For Older Browsers
HTML is great in that your browser ignores unknown new syntax.
So I can use the &amp;amp;lt;picture&amp;amp;gt; element to serve the right format to you. (Look ma, no JavaScript!)
&amp;amp;lt;picture&amp;amp;gt;
  &amp;amp;lt;source srcset=&amp;amp;quot;fancy_browser.avif&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
  &amp;amp;lt;source srcset=&amp;amp;quot;decent_browser.webp&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
  &amp;amp;lt;img src=&amp;amp;quot;meh_browser.jpg&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
&amp;amp;lt;&amp;#x2F;picture&amp;amp;gt;
The real
thing
is a bit more convoluted, but you get the idea.

    
Workaround II: Wrong Content-Type On Github Pages
There was one ugly problem with Github and AVIF, though: Their server returned a
Content-Type: application&amp;#x2F;octet-stream header.
This meant that the images did not load on Firefox.
There is no way to fix that on my side as Github is hosting my page. Until now!
I wanted to try Cloudflare’s Workers Sites for a long time, and this bug
finally made me switch. Basically, I run the full website as an edge worker
right on the CDN; no own web server is needed. What’s great about it is that
the site is fast everywhere now — even in remote locations — no more
roundtrips to a server.
By running an edge worker, I also gained full control over the request- and response objects.
I added this gem of a snippet to intercept the worker response:
if (&amp;#x2F;.avif$&amp;#x2F;.test(url))  
  response.headers.set(&amp;amp;quot;Content-Type&amp;amp;quot;, &amp;amp;quot;image&amp;#x2F;avif&amp;amp;quot;);
  response.headers.set(&amp;amp;quot;Content-Disposition&amp;amp;quot;, &amp;amp;quot;inline&amp;amp;quot;);
 
And bam, Bob’s your uncle. Firefox is happy.
You can read more about modifying response objects here.
Another side-effect of Workers Sites is that a production deployment takes one minute now.

    
Performance Results After Moving To Cloudflare

    
            Website response time before
            
            
            Source: KeyCDN
            
        
            Website response time after
            
            
            Source: KeyCDN
            
        
            Page size and rating before
            
            
            Source: Pingdom.com
            
        
            Page size and rating after
            
            
            Source: Pingdom.com
            
        
I don’t have to hide from a comparison with well-known sites either:

    
            Comparison with some other blogs I read
            
            
            Source: Speedcurve
            
        
Further reading

How to Use AVIF: The New Next-Gen Image Compression Format — Nice introduction that highlights some common pitfalls when integrating AVIF. It inspired me to add AVIF support.
AVIF has landed by Jake Archibald — Compares image sizes and qualities of different formats: SVG, JPEG, PNG, WebP, and AVIF.
avif.io — Fast, configurable, client-side image compression that works on desktop and mobile.
Squoosh — another image compression service built with WebAssembly that supports AVIF
Tons of great examples on how to configure Cloudflare workers
Cloudflare Workers Sites

"
},

{
"title": "Launching a Side Project Backed by Github Sponsors",
"url": "https://endler.dev/2020/sponsors/",
"body": "Yesterday we launched analysis-tools.dev, and boy had I underestimated the response.

    
It’s a side project about comparing static code analysis tools.
Static analysis helps improve code quality by detecting bugs in source code
without even running it.
What’s best about the project is that it’s completely open-source. We wanted to
build a product that wouldn’t depend on showing ads or tracking users. Instead,
we were asking for sponsors on Github — that’s it. We learned a lot in the
process, and if you like to do the same, keep reading!

    
First, Some Stats
Everyone likes business metrics. Here are some of ours:

The project started as an awesome list on Github in December
2015.
We’re currently listing 470 static analysis tools.
Traffic grew continuously. Counting 7.5k stars and over 190 contributors at
the moment.
500-1000 unique users per week.
I had the idea to build a website for years now, but my coworker Jakub
joined in May 2020 to finally make it a reality.


            Github stars over time. That graph screams BUSINESS OPPORTUNITY.
            
            
            Source: star-history.t9t.io
            
        
“Why did it take five years to build a website!?”, I hear you ask. Because I
thought the idea was so obvious that others must have tried before and failed.
I put it off, even though nobody stepped in to fill this niche.
I put it off, even though I kept the list up-to-date for five years, just to
learn about the tools out there.
You get the gist: don’t put things off for too long. When ideas sound obvious, it’s probably because they are.

    
Revenue Model
It took a while to figure out how to support the project financially. We knew
what we didn’t want: an SEO landfill backed by AdWords. Neither did we want to
“sell user data” to trackers.
We owe it to the contributors on Github to keep all data free for everyone.
How could we still build a service around it?
Initially, we thought about swallowing the infrastructure costs
ourselves, but we’d have no incentive to maintain the site or extend it with new
features.
Github Sponsors was still quite new at that time. Yet, as soon as we realized
that it was an option, it suddenly clicked: Companies that are not afraid of a
comparison with the competition have an incentive to support an open platform
that facilitates that. Furthermore, we could avoid bias and
build a product that makes comparing objective and accessible.
Sponsoring could be the antidote to soulless growth and instead allow us to build
a lean, sustainable side business. We don’t expect analysis-tools.dev ever to be
a full-time job. The market might be too small for that — and that’s fine.

    
Tech
Once we had a revenue model, we could focus on the tech. We’re both engineers,
which helps with iterating quickly.
Initially, I wanted to build something fancy with
Yew. It’s a Rust&amp;#x2F;Webassembly framework and
your boy likes Rust&amp;#x2F;Webassembly…
I’m glad Jakub suggested something else: Gatsby. Now, let me be honest with
you: I couldn’t care less about Gatsby. And that’s what I said to Jakub: “I
couldn’t care less about Gatsby.” But that’s precisely the point: not being
emotionally attached to something makes us focus on the job and not the tool.
We get more stuff done!
From there on, it was pretty much easy going: we used a starter template, Jakub
showed me how the GraphQL integration worked, and we
even got to use some Rust! The site runs on Cloudflare as an edge
worker built on top of Rust. (Yeah, I cheated
a bit.)
Count to three, MVP!

    
Finding Sponsors
So we had our prototype but zero sponsors so far. What started now was (and
still is) by far the hardest part: convincing people to support us.
We were smart enough not to send cold e-mails because most companies ignore
them. Instead, we turned to our network and realized that developers reached out
before to add their company’s projects to the old static analysis list on
Github.
These were the people we contacted first. We tried to keep the messages short
and personal.
What worked best was a medium-sized e-mail with some context and a reminder that
they contributed to the project before. We included a link to our sponsors
page.
Businesses want reliable partners and a reasonable value proposal,
so a prerequisite is that the sponsor page has to be meticulously polished.

    
            Our Github Sponsors page
            
            
Just like Star Wars Episode IX, we received mixed reviews: many people never
replied, others passed the message
on to their managers, which in turn never replied, while others again had no
interest in sponsoring open-source projects in general. That’s all fair game:
people are busy, and sponsorware is quite a new concept.

A little rant: I’m of the opinion that tech businesses don’t nearly sponsor
enough compared to all the value they get from Open Source. Would your company
exist if there hadn’t been a free operating system like Linux or a web server
like Nginx or Apache when it was founded?

There was, however, a rare breed of respondents, which expressed interest but
needed some guidance. For many, it is the first step towards sponsoring any
developer through Github Sponsors &amp;#x2F; OpenCollective.
It helped that we use OpenCollective as our fiscal host, which handles invoicing
and donation transfers. Their docs
helped us a lot when getting started.
The task of finding sponsors is never
done, but it was very reassuring
to hear from DeepCode - an AI-based semantic
analysis service, that they were willing to take a chance on us.
Thanks to them, we could push product over the finishing line. Because of them,
we can keep the site free for everybody. It also means the website is kept free
from ads and trackers.
In turn, DeepCode gets exposed to many great developers that care about code
quality and might become loyal customers. Also, they get recognized as an
open-source-friendly tech company, which is more important than ever if you’re
trying to sell dev tools. Win-win!

    
Marketing
Jakub and I both had started businesses before, but this was the first truly
open product we would build.
Phase 1: Ship early 🚀
We decided for a soft launch: deploy the site as early as possible and let the
crawlers index it. The fact that the page is statically rendered and follows
some basic SEO guidelines sure helped with improving our search engine rankings
over time.
Phase 2: Ask for feedback from your target audience 💬
After we got some organic traffic and our first votes, we reached out to our
developer friends to test the page and vote on tools they know and love. This
served as an early validation, and we got some honest feedback, which helped us
catch the most blatant flaws.
Phase 3: Prepare announcement post 📝
We wrote a blog post which, even if clickbaity, got the job done: Static
Analysis is Broken — Let’s Fix
It! It
pretty much captures our frustration about the space and why building an open
platform is important. We could have done a better job explaining the technical
differences between the different analysis tools, but that’s for another day.
Phase 4: Announce on social media 🔥
Shortly before the official announcement, we noticed that the search
functionality was broken (of course). Turns out, we hit the free quota limit on
Algolia a biiit earlier than expected. 😅 No biggie: quick
exchange with Algolia’s customer support, and they moved us over to the
open-source plan (which we didn’t know existed). We were back on track!

Site note: Algolia customer support is top-notch. Responsive, tech-savvy,
and helpful. Using Algolia turned out to be a great fit for our product.
Response times are consistently in the low milliseconds and the integration
with Gatsby was quick and easy.


            We got quite a bit of buzz from that
tweet: 63 retweets, 86 likes and counting
            
            
Clearly, everyone knew that we were asking for support here, but we are thankful
for every single one that liked and retweeted. It’s one of these situations
where having a network of like-minded people can help.
As soon as we were confident that the site wasn’t completely broken, we set off
to announce it on
Lobste.rs
(2 downvotes),
&amp;#x2F;r&amp;#x2F;SideProject
(3 upvotes) and Hacker News (173
upvotes, 57 comments). Social media is kind of unpredictable.
It helps to cater the message to each audience and stay humble, though.
The response from all of that marketing effort was nuts:

    
            Traffic on launch day
            
            
Perhaps unsurprisingly, the Cloudflare edge workers didn’t break a sweat.

    
            Edge worker CPU time on Cloudflare
            
            
My boss Xoan Vilas even did a quick performance
analysis and he approved. (Thanks boss!)

    
High fives all around!

    
Now what?
Of course, we’ll add new features; of course, we have more plans for the future,
yada yada yada. Instead, let’s reflect on that milestone: a healthy little
business with no ads or trackers, solely carried by sponsors. 🎉
Finally, I want you to look deep inside yourself and find your own little
product to work on. It’s probably
right in front of your nose, and like myself, you’ve been putting it off for too
long. Well, not anymore! The next success story is yours. So go out and build
things.
Oh wait! …before you leave, would you mind checking out
analysis-tools.dev and smashing that upvote button
for a few tools you like? Hey, and if you feel super generous today
(or you have a fabulous employer that cares about open-source), why not check out
our sponsorship page?

    
            Jakub and me in Vienna, Austria. I’m not actually that small.
            
            
"
},

{
"title": "What Happened To Programming In The 2010s?",
"url": "https://endler.dev/2020/review/",
"body": "A while ago, I read an article titled “What Happened In The
2010s” by Fred Wilson. The
post highlights key changes in technology and business during the last ten
years. This inspired me to think about a much more narrow topic: What Happened To
Programming In The 2010s?


     🚓 I probably forgot like 90% of what actually happened. Please
don’t sue me.
My goal is to reflect on the past so that you can better predict the future.


Where To Start?
From a mile-high perspective, programming is still the same as a decade ago:

Punch program into editor
Feed to compiler (or interpreter)
Bleep Boop 🤖
Receive output

But if we take a closer look, a lot has changed around us.
Many things we take for granted today didn’t exist a decade ago.

    
What Happened Before?
Back in 2009, we wrote jQuery plugins, ran websites on
shared hosting services, and uploaded content via
FTP. Sometimes code was
copy-pasted from dubious forums, tutorials on blogs, or even hand-transcribed
from books. Stack Overflow (which launched on
15th of September 2008) was still in its infancy. Version control
was done with CVS or
SVN — or not at all.
I signed up for Github on 3rd of
January 2010. Nobody had even heard of a Raspberry
Pi (which only got released in
2012).

    
            Source: xkcd #2324
            
        
An Explosion Of New Programming Languages
The last decade saw the creation of a vast number of new and exciting programming
languages.
Crystal, Dart, Elixir, Elm, Go, Julia, Kotlin, Nim, Rust, Swift, TypeScript
all released their first stable version!
Even more exciting: all of the above languages are developed in the open now, and the source code is
freely available on Github. That means, everyone can contribute to their development — a big testament to Open Source.
Each of those languages introduced new ideas that were not widespread before:

Strong Type Systems: Kotlin and Swift made optional null types
mainstream, TypeScript brought types to JavaScript, Algebraic datatypes are
common in Kotlin, Swift, TypeScript, and Rust.
Interoperability: Dart compiles to JavaScript, Elixir interfaces with
Erlang, Kotlin with Java, and Swift with Objective-C.
Better Performance: Go promoted Goroutines and channels for easier
concurrency and impressed with a
sub-millisecond Garbage Collector,
while Rust avoids Garbage Collector overhead altogether thanks to ownership and borrowing.

This is just a short list, but innovation in the programming language field has
greatly accelerated.

    
More Innovation in Older Languages
Established languages didn’t stand still either. A few examples:
C++ woke up from its long winter sleep and released C++11 after its last major
release in 1998. It introduced numerous new features like Lambdas, auto
pointers, and range-based loops to the language.
At the beginning of the last decade, the latest PHP version was 5.3. We’re at
7.4 now. (We skipped 6.0, but I’m not ready to talk about it yet.) Along the
way, it got over twice as fast. PHP is a truly modern programming language
now with a
thriving ecosystem.
Heck, even Visual Basic has tuples now. (Sorry, I couldn’t resist.)

    
Faster Release Cycles
Most languages adopted a quicker release cycle. Here’s a list for some popular languages:
LanguageCurrent release cycle
Cirregular
C#~ 12 months
C++~ 3 years
Go6 months
Java6 months
JavaScript (ECMAScript)12 months
PHP12 months
Python12 months
Ruby12 months
Rust6 weeks (!)
Swift6 months
Visual Basic .NET~ 24 months


The Slow Death Of Null
Close to the end of the last decade, in a talk from 25thof August 2009,
Tony Hoare described the null pointer as his Billion Dollar
Mistake.
A study by the Chromium project found that 70% of their serious security bugs were memory safety problems (same for Microsoft). Fortunately, the notion that our memory safety problem isn’t bad coders
has finally gained some traction.
Many mainstream languages embraced safer alternatives to null: nullable
types, Option, and Result types. Languages like Haskell had these features
before, but they only gained popularity in the 2010s.

    
Revenge of the Type System
Closely related is the debate about type
systems.
The past decade has seen type systems make their stage comeback; TypeScript,
Python, and PHP (just to name a few) started to embrace type systems.
The trend goes towards type inference: add types to make your intent clearer for
other humans and in the face of ambiguity — otherwise, skip them. Java,
C++, Go, Kotlin, Swift, and Rust are popular examples with type inference support. I
can only speak for myself, but I think writing Java has become a lot more
ergonomic in the last few years.

    
Exponential Growth Of Libraries and Frameworks
As of today, npm hosts 1,330,634 packages. That’s over a million
packages that somebody else is maintaining for you. Add another 160,488 Ruby
gems, 243,984 Python projects,
and top it off with 42,547 Rust crates.

    
            Number of packages for popular programming languages. Don’t ask me what happened to npm in 2019.
            
            
            Source: Module Counts
            
        
Of course, there’s the occasional
leftpad,
but it also means that we have to write less library code ourselves and can
focus on business value instead. On the other hand, there are more potential
points of failure, and auditing is difficult. There is also a large number of outdated
packages. For a more in-depth discussion, I recommend the Census II report by
the Linux Foundation &amp;amp;amp; Harvard [PDF].
We also went a bit crazy on frontend frameworks:

Angular in 2010
React in 2013
Vue in 2014
Svelte in 2016
…and soon Yew?


No Free Lunch
A review like this wouldn’t be complete without taking a peek at Moore’s Law.
It has held up surprisingly well in the last decade:

    
            Source: Wikipedia
            
        
There’s a catch, though.
Looking at single-core performance, the curve is flattening:

    
            Source: Standford University: The Future of Computing (video)
            
        
The new transistors prophesied by Moore don’t make our CPUs faster but instead
add other kinds of processing capabilities like more parallelism or hardware
encryption.
There is no free lunch anymore. Engineers have to find new ways of making their
applications faster, e.g. by embracing concurrent
execution.
Callbacks, coroutines, and eventually async&amp;#x2F;await are becoming industry
standards.
GPUs (Graphical Processing Units) became very powerful, allowing for massively
parallel computations, which caused a renaissance of Machine Learning for practical use-cases:

Deep learning becomes feasible, which leads to machine learning becoming
integral to many widely used software services and applications.
— Timeline of Machine Learning on Wikipedia

Compute is ubiquitous, so in most cases, energy efficiency plays a more prominent role now than raw performance (at least for consumer devices).

    
Unlikely Twists Of Fate

Microsoft is a cool kid now. It acquired Github, announced the Windows subsystem for Linux (which should really be called Linux Subsystem for Windows), open sourced
MS-DOS and .NET.
Even the Microsoft Calculator is now open source.
IBM acquired Red Hat.
Linus Torvalds apologized for his behavior, took time off.
Open source became the default for software development (?).


Learnings
If you’re now thinking: Matthias, you totally forgot X, then I brought
that point home. This is not even close to everything that happened. You’d
roughly need a decade to talk about all of it.
Personally, I’m excited about the next ten years.
Software is eating the world — at an ever-faster pace.
"
},

{
"title": "Tips for Faster Rust Compile Times",
"url": "https://endler.dev/2020/rust-compile-times/",
"body": "

     This post has moved to my other blog.
It will be updated there in the future.


"
},

{
"title": "Gravity",
"url": "https://endler.dev/2020/gravity/",
"body": "Here’s a test to show your age:
Do you still remember that funny JavaScript gravity effect, which Google used on their homepage ten years ago? This one?


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );

I wanted to have some fun and integrated it into a website I was building.
Unfortunately, it didn’t work out-of-the-box.
It choked on some DOM elements that were not strictly classes (like SVG elements).
So, in good hacker fashion, I quickly patched up the script (it’s just a three-line change), and now it’s back to its former glory.
Test it here! (Caution: you’ll have to reload the page after that. 😏)
Apply Gravity

    var myLink = document.getElementById(gravity);

    myLink.onclick = function() 

        var script = document.createElement( script );
        script.type =  text&amp;#x2F;javascript ;
        script.src =  gravity.js ; 
        document.getElementsByTagName( head )[0].appendChild(script);
        return false;
     

Anyway, feel free to add it to your own sites and have some fun.
It’s also great to prank your friends.
Simply add that single line to any website and weeee!
&amp;amp;lt;script
  type=&amp;amp;quot;text&amp;#x2F;javascript&amp;amp;quot;
  src=&amp;amp;quot;https:&amp;#x2F;&amp;#x2F;endler.dev&amp;#x2F;2020&amp;#x2F;gravity&amp;#x2F;gravity.js&amp;amp;quot;
&amp;amp;gt;&amp;amp;lt;&amp;#x2F;script&amp;amp;gt;
Sometimes I miss those simple times of the early web…
"
},

{
"title": "Hacker Folklore",
"url": "https://endler.dev/2020/folklore/",
"body": "Some computer terms have a surprising legacy. Many of them are derived from
long-obsolete technologies. This post tries to dust off the exciting history of
some
of these terms that we use every day but aren’t quite sure about their origins.
Let’s jump right in!

    
Bike-Shedding
Today’s meaning: A pointless discussion about trivial issues.
The term bike-shed effect or bike-shedding was coined as a metaphor to
illuminate the law of triviality; it was popularised in the Berkeley Software
Distribution community by the Danish computer developer Poul-Henning Kamp in
1999 on the FreeBSD mailing list and has
spread from there to the whole software industry.
The concept was first presented as a corollary of his broader “Parkinson’s law”
spoof of management. He dramatizes this “law of triviality” with the example of
a committee’s deliberations on an atomic reactor, contrasting it to
deliberations on a bicycle shed. As he put it: “The time spent on any item of
the agenda will be in inverse proportion to the sum of money involved.”
A reactor is so vastly expensive and complicated that an average person cannot
understand it, so one assumes that those who work on it understand it. On the
other hand, everyone can visualize a cheap, simple bicycle shed, so planning one
can result in endless discussions because everyone involved wants to add a touch
and show personal contribution.
Reference - Wikipedia: Law of Triviality

    
Boilerplate

    
            An old machine that bended steel
plates to water boilers.
            
            
            Source: Wikimedia
Commons
            
        
Today’s meaning: A chunk of code that is copied over and over again with little
or no changes made to it in the process.
Boiler plate originally referred to the rolled steel used to make water
boilers but is used in the media to refer to hackneyed or unoriginal writing.
The term refers to the metal printing plates of pre-prepared text such as
advertisements or syndicated columns that were distributed to small, local
newspapers. These printing plates came to be known as ‘boilerplates’ by analogy.
One large supplier to newspapers of this kind of boilerplate was the Western
Newspaper Union, which supplied “ready-to-print stories [which] contained
national or international news” to papers with smaller geographic footprints,
which could include advertisements pre-printed next to the conventional content.
References:

Wikipedia
Stack Overflow
StackExchange - English Language &amp;amp;amp;
Usage


            The man in the foreground is holding
a rounded printing plate. Plates like this were provided by companies such as
Western Newspaper Union to many smaller newspapers.
            
            
            Source: Wikimedia
Commons
            
        
Boot &amp;#x2F; Reboot &amp;#x2F; Bootstrapping

    
            Lithography of Baron Münchhausen
pulling himself out of a swamp by his pigtail
            
            
            Source: Wikimedia
            
        
The term boot is used in the context of computers to refer to the process of
starting a computer.
In compiler development, the term bootstrapping refers to the process of
rewriting a compiler in a new language: The first compiler is written in an
existing language. Then it gets rewritten in the new language and compiled by
itself.
The saying “to pull oneself up by one’s bootstraps” dates back to the 19th
century. Tall boots may have a tab, loop or handle at the top allowing one to
help pulling them on. The metaphor spawned additional metaphors for
self-sustaining processes that proceed without external help.
According to Wikipedia,

The idiom dates at least to 1834, when it appeared in the Workingman’s
Advocate: “It is conjectured that Mr. Murphee will now be enabled to hand
himself over the Cumberland river or a barn yard fence by the straps of his
boots.”

There’s also a nice summary in Merriam-Webster.

    
Bug
Today’s meaning: A defect in a piece of code or hardware.
The origins are unknown!
Contrary to popular belief it predates the bug
found by Grace Hopper in the Mark II computer.
The term was used by engineers way before that; at least since the 1870s.
It predates electronic computers and computer software.
Thomas Edison used the term “bug” in his notes.
Reference

    
Bit
The term’s invention is credited to John W. Tukey, who in a memo written for
Bell Labs on January 9, 1947, had shortened “binary information digit” to “bit”.
Reference

    
Byte
The term “byte” was first introduced by Werner
Buchholz in June 1956. This was
during the initial design stage for the IBM Stretch computer. The computer had a
design that enabled addressing down to the individual bit and allowed variable
field length instructions, with the size of the byte encoded into the
instruction itself. The choice of spelling as “byte” instead of “bite” was
intentional to prevent any accidental alteration to “bit”.

    
Carriage Return and Line Feed
Today’s meaning: Set the cursor to the beginning of the next line.
These two terms were adopted from typewriters.
The carriage holds the paper and is moving from left to right to advance the
typing position as the keys are pressed. It “carries” the paper with it. The
carriage return is the operation when the carriage gets moved into its original
position on the very left end side of the paper.
Simply returning the carriage to the left is not enough to start with a new
line, however. The carriage would still be on the same line than before —
just at the beginning of the line. To go to a new line, a line feed was
needed. It would move the paper inside the typewriter up by one line.
These two operations — carriage return (CR) and line feed (LF) —
were commonly done at once by pushing the carriage return lever.

    
            A mechanical typewriter. The lever for the carriage return is
on the outer left side.
            
            
            Source: Source:
piqsels
            
        
On Unix systems (like Linux or macOS), a n still stands for a
line feed (ASCII symbol: LF) or newline.
On CP&amp;#x2F;M, DOS, and Windows, rn is used, where r stands for
carriage return and n stands for line feed (CR+LF).
Reference

Here is an old video that shows the basic mechanics of carriage return and
line-feed:


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );


Command key symbol (⌘)
Today’s meaning: A meta-key available on Apple computers to provide additional
keyboard combinations.
Directly quoting Wikipedia (emphasis mine):
The ⌘ symbol came into the Macintosh project at a late stage. The development
team originally went for their old Apple key, but Steve Jobs found it
frustrating when “apples” filled up the Mac’s menus next to the key commands,
because he felt that this was an over-use of the company logo. He then opted for
a different key symbol. With only a few days left before deadline, the team’s
bitmap artist Susan Kare started researching for the Apple logo’s successor. She
was browsing through a symbol dictionary when she came across the
cloverleaf-like symbol, commonly used in Nordic countries as an indicator of
cultural locations and places of interest (it is the official road sign for
tourist attraction in Denmark, Finland, Iceland, Norway, and Sweden and the
computer key has often been called Fornminne — ancient monument — by Swedish Mac
users and Seværdighedstegn by Danish users). When she showed it to the rest of
the team, everyone liked it, and so it became the symbol of the 1984 Macintosh
command key. Susan Kare states that it has since been told to her that the
symbol had been picked for its Scandinavian usage due to its resembling the
shape of a square castle with round corner towers as seen from above looking
down, notably Borgholm Castle.

    
            Norwegian Severdighet road sign
            
            
            Source: Wikimedia
Commons
            
        
            Aearial view of Borgholm Castle, which could have been the model for the symbol
            
            
            Source: Wikimedia
Commons
            
        
References:

Wikipedia: Command Key
Cult of Mac: What Are The Mac’s Command ⌘ And Option ⌥ Symbols Supposed To
Represent?


Cookie
Today’s meaning: A small piece of data sent from a website and stored in the
user’s web browser.
The term cookie was coined by  23-year-old web browser programmer Lou Montulli
in the fall of 1994. It was
inspired by the term magic cookie, which is a packet of data a program
receives and sends back unchanged, used by Unix programmers.
This term in turn derives from the fortune cookie, which is a cookie with an embedded message.
Montulli used the
term cookie to describe the small packets of data that the web browser
receives and sends back unchanged to the web server.

“So, yeah, the cookie,” Montulli says with a laugh. “It’s one week of my life that turned into the most important thing that I ever did.” (Reference)


Core Dump
Today’s meaning: Retrieving a snapshot of a (crashed) program’s state by
storing all of its memory for offline analysis.
The name comes from magnetic core
memory, which is an early
storage mechanism based on a grid of toroid magnets. It has since become
obsolete, but the term is still used today for getting a snapshot of a computer
process. Reference

    
            A 32 x 32 core memory plane storing
1024 bits (or 128 bytes) of data. The first core dumps were printed on paper, which sounds reasonable given these small amounts of bytes.
            
            
            Source: Wikimedia
Commons
            
        
Cursor
Today’s meaning: a visual cue (such as a flashing vertical line) on a video display that indicates position (as for data entry). Merriam-Webster
Cursor is Latin for runner. A cursor is the name given to the transparent
slide engraved with a hairline that is used for marking a point on a slide rule.
The term was then transferred to computers through analogy.
Reference

    
            Source: A December 1951 advertisement for the
IBM 604 Electronic Calculating Punch that was first produced in 1948. The
advertisement claims the IBM 604 can do the work of 150 engineers with slide
rules. The cursor (or runner) is the transparent part in the middle of the
slide.
            
        
Daemon
In computing, a daemon is a background process that handles requests for
services such as print spooling and file transfers, and then terminates. The
term was coined by the programmers of MIT’s Project MAC (Mathematics and Computation) in 1963. They took the name from
Maxwell’s demon, a hypothetical creature from a thought experiment that
constantly works in the background, sorting molecules.

The MIT programmers thought demon would be an appropriate name for a background process that worked tirelessly to perform system chores. But instead of using the term demon, they used daemon, which is an older form of the word. (Reference)


Dashboard
Today’s meaning: A user interface that provides a quick overview of a system’s
status.
Originally a plank of wood at the front of a horse-drawn carriage to protect the
driver from mud ‘dashed’ backward by a horses hooves.
When automobiles were manufactured, the board in front of the driver was given
the same name. That was the logical place to put the necessary gauges so the
driver could see them easily. In time, the term became more associated with the
readouts than the protection it offered.
Reference

    
            A dashboard of a horse carriage.
            
            
            Source: Wikimedia
Commons
            
        
Firewall
Today’s meaning: A network security system that establishes a barrier between a trusted internal network and an untrusted external network, such as the Internet.
Fire walls are used mainly in terraced houses, but also in individual residential buildings. They prevent fire and smoke from spreading to another part of the building in the event of a fire. Large fires can thus be prevented. The term is used in computing since the 80s.
Reference

    
            Firewall residential construction, separating the building into two separate residential units, and fire areas.
            
            
            Source: Wikimedia
Commons
            
        
Firmware
Today’s meaning: A class of computer software that provides the low-level
control for the device’s specific hardware and closely tied to the hardware it runs on.
Ascher
Opler
coined the term firmware in a 1967 Datamation article. As originally used,
firmware contrasted with hardware (the CPU itself) and software (normal
instructions executing on a CPU). It existed on the boundary between hardware
and software; thus the name “firmware”. The original article is available on the
Internet
Archive.
Reference

    
Foo and Bar
Today’s meaning: Common placeholder variable names.
Originally the term might come from the military term FUBAR.
There are a few variations, but a common meaning is FUBAR: “f***ed up beyond all recognition”.
The use of foo in a programming context is generally credited to the Tech
Model Railroad Club (TMRC) of MIT from circa 1960. In the complex model system,
there were scram switches located at numerous places around the room that could
be thrown if something undesirable was about to occur, such as a train going
full-bore at an obstruction.
The way I understood it was that they literally had emergency buttons labeled
foo for lack of a better name.
Maybe related to the original military meaning of FUBAR to indicate that something is going very very wrong.

    
            A scram switch (button), that could be
pressed to prevent inadvertent operation. Maybe the TMRC had buttons labeled foo instead
            
            
            Source: Source Wikimedia
Commons
            
        
References:

Wikipedia
Stack Overflow.


Freelancer
Today’s meaning: A self-employed person, which is not committed to a particular employer long-term.
The term first appears in the novel Ivanhoe by Sir Walter Scott. (The novel also had a lasting influence on the Robin Hood legend.)

    
            Cover of a Classic Comics book
            
            
            Source: Wikimedia Commons
            
        
In it, a Lord offers his paid army of ‘free lances’ to King Richard:

I offered Richard the service of my Free Lances, and he refused them
— I will lead them to Hull, seize on shipping, and embark for Flanders;
thanks to the bustling times, a man of action will always find employment.

Therefore, a “free lancer” is someone who fights for whoever pays the most.
Free does not mean “without pay”, but refers to the additional freedom to work for any employer. Reference

    
Hash
Today’s meaning: A hash function is any function that can be used to map data of arbitrary size to fixed-size values.
According to Wikipedia, the use of the word “hash” in hash function
“comes by way of analogy with its non-technical meaning, to “chop and mix”.
Indeed, typical hash functions, like the mod operation, “chop” the input domain
into many sub-domains that get “mixed” into the output range to improve the
uniformity of the key distribution.“
References:

Software Engineering StackExchange
Wikipedia


Log &amp;#x2F; Logfile
Today’s meaning: A file that records events of a computer program or system.
Sailors used so-called log lines to measure the speed of their ship. A flat
piece of wood (the log) was attached to a long rope. The log had regularly
spaced knots in it. As the log would drift away, the sailors would count the
number of knots that went out in a fixed time interval, and this would be the
ship’s speed — in knots.
The ship’s speed was important for navigation, so the sailors noted it down in a book, aptly called the log book, together with other information to establish the position of the ship more accurately, like landmark sightings and weather events. Later, additional information, more generally concerning the ship, was added — or logged — such as harbor fees and abnormal provision depletion.
Reference.

    
            Sailors measuring ship speed with a
log line
            
            
            Source: The Pilgrims &amp;amp;amp; Plymouth Colony:1620 by Duane A.
Cline
            
        
            The parts of a log-line
            
            
            Source: The
Pilgrims &amp;amp;amp; Plymouth Colony:1620 by Duane A.
Cline
            
        
            Page from the log-file of the British
Winchelsea. The second column denotes the number of knots measured with the
log-line, which indicates the ship’s speed
            
            
            Source: Navigation and Logbooks
in the Age of Sail by Peter
Reaveley
            
        
Patch
Today’s meaning: A piece of code that can be applied to fix or improve a
computer program.
In the early days of computing history, if you made a programming mistake, you’d
have to fix a paper tape or a punched card by putting a patch on top of a hole.

    
            A program tape with physical patches used
to correct punched holes by covering them.
            
            
            Source: Smithsonian Archives
Center
            
        
Ping
Today’s meaning: A way to check the availability and response time of a computer over the network.
Ping is a terminal program originally written by Mike Muuss in 1983 that is included in
every version of UNIX, Windows, and macOS. He named it “after the sound that a
sonar makes, inspired by the whole principle of echo-location. […] ping uses
timed IP&amp;#x2F;ICMP ECHO_REQUEST and ECHO_REPLY packets to probe the “distance” to the
target machine.“ The reference is
well worth a read.

    
Pixel
Today’s meaning: The smallest controllable element of a picture represented on the screen.
The word pixel is a combination of pix (from “pictures”, shortened to “pics”) and el (for “element”).
Similarly, voxel is a volume element and texel is a texture element.
Reference

    
Shell
Today’s meaning: An interactive, commonly text-based runtime to interact with a
computer system.
The inventor of the term, Louis Pouzin, does not give an explanation for the
name in his essay The Origins of the
Shell. It can however be traced back to
Unix’ predecessor Multics. It is described in the Multics
glossary like so:

[The shell] is passed a command line for execution by the listener.

The The New Hacker’s Dictionary, (also known as the Jargon File) by Eric S.
Raymond contains the
following:

Historical note: Apparently, the original Multics shell (sense 1) was so
called because it was a shell (sense 3);

where sense 3 refers to

A skeleton program, created by hand or by another program (like, say, a parser
generator), which provides the necessary incantations to set up some task and
the control flow to drive it (the term driver is sometimes used synonymously).
The user is meant to fill in whatever code is needed to get real work done.
This usage is common in the AI and Microsoft Windows worlds, and confuses Unix
hackers.

Unfortunately, the book does not provide any evidence to back up this claim.
I like the (possibly historically incorrect) analogy to a nut with the shell
being on the outside, protecting the kernel.
Reference

    
Slab allocator
Today’s meaning: An efficient memory allocation technique, which reuses
previous allocations.
Slab allocation was invented by John
Bonwick
(Note: PDF file) in 1994 and has since been used by services like
Memcached and the Linux Kernel.

With slab allocation, a cache for a certain type or size of data object has a
number of pre-allocated “slabs” of memory; within each slab there are memory
chunks of fixed size suitable for the objects.
(Wikpedia)

The name slab comes from a teenage friend of Bonwick. He tells the
story on the Oracle blog:
While watching TV together, a commercial by Kellogg’s came on with the tag line,
“Can you pinch an inch?”

The implication was that you were overweight if you could pinch more than an
inch of fat on your waist — and that hoovering a bowl of corn flakes would
help.


Without missing a beat, Tommy, who weighed about 250 pounds, reached for his
midsection and offered his response: “Hell, I can grab a slab!”

A decade later, Bonwick remembered that term when he was looking for a word to
describe the allocation of a larger chunk of memory.
Here is the original Kellogg’s advertisement:


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );


Spam
Today’s meaning: Unsolicited electronic communications, for example by sending
mass-emails or posting in forums and chats.
The term goes back to a sketch by the British comedy group Monty Python from 1970.
In the sketch, a cafe is including
Spam (a brand of canned cooked
pork) in almost every dish.
Spam is a portmanteau of spiced and ham.
The excessive amount of Spam mentioned is a
reference to the ubiquity of it and other imported canned meat products in the
UK after World War II (a period of rationing in the UK) as the country struggled
to rebuild its agricultural base.
Reference

    
            Vintage Ad: Look What You Can Do With One
Can of Spam
            
            
            Source: By user Jamie (jbcurio) on
flickr.com
            
        
  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );

Monty Pythons Flying Circus (1974) - SPAM from
Testing Tester on Vimeo.

    
Mainframe
Today’s meaning: A large computer system, often used by large organizations.
Originally, the term referred to the frame that held the main components of a
computer. The main components were the CPU, memory, and I&amp;#x2F;O devices. The term
was used in the 1960s and 1970s when computers were large and required a lot of
space.

    
            This diagram shows how the IBM 701 mainframe swings open for access to the circuitry. From ‘Type 701 EDPM [Electronic Data Processing Machine] Installation Manual’, IBM. From Computer History Museum archives.
            
            
Read more about that on Ken Shirriff’s blog.

    
Radio Button
Today’s meaning: A UI element that allows to choose from a predefined set of
mutually exclusive options
“Radio buttons” are named after the analogous pendant of mechanical buttons that
were used in radios. The UI concept has later been used in tape recorders,
cassette recorders and wearable audio players (the famous “Walkman” and
similar). And later in VCRs and video cameras.
Reference

    
            An old car radio (left) and CSS
radio buttons (right). Only a single option can be selected at any point in
time. As a kid, I would push two buttons at once so they would interlock. Good
times.
            
            
            Source: Images by Matt Coady
            
        
Uppercase and lowercase
Today’s meaning: Distinction between capital letters and small letters on a
keyboard.
Back when typesetting was a manual process where single letters made of led were
“type set” to form words and sentences, upper- and lowercase letters were kept
in separate containers — or cases — to make this rather tedious process a little faster.

    
            A set of printers cases
            
            
            Source: From the book ‘Printing types, their history, forms, and use; a study in
survivals’ by Updike, Daniel Berkeley, 1860-1941. Freely available on
archive.org.
            
        
Honorable mentions

    
404
Today’s meaning: HTTP Status Code for “File not found”.
There is a story that the number comes from the server room where the World Wide
Web’s central database was located. In there, administrators would manually
locate the requested files and transfer them, over the network, to the person
who made that request. If a file didn’t exist, they’d return an error message:
“Room 404: file not found”.
This, however, seems to be a myth and the status code was chosen rather
arbitrarily based on the then well-established FTP status codes.
Reference

    
Programming languages and Abbreviations
The etymology of programming language names and common abbreviations would
probably warrant its own article, but I’ve decided to note down some of my
favorites for the time being.

    
C++
C++ is a programming language based on C by Bjarne Stroustrup. The name is a
programmer pun by Rick Mascitti, a coworker of Stroustrup. The ++ refers to
the post-increment operator, that is common in many C-like languages. It
increases the value of a variable by 1. In that sense, C++ can be seen as the
spiritual “successor” of C.
Reference

    
C Sharp
Similarly to C++, C# is a C-like programming language. The name again refers to
“incremental” improvements on top of C++. The # in the name looks like four
plus signs. Hence C# == (C++)++. But on top of that, the name was also
inspired by the musical notation where a sharp indicates that the written note
should be made a semitone higher in pitch.
Reference

    
            A C-Sharp note.
            
            
            Source: Wikimedia
Commons
            
        
PNG
Officially, PNG stands for Portable Network Graphics. It was born out of
frustration over a CompuServe announcement in 1994 that programs supporting GIF
would have to pay licensing fees from now on. A working group lead by hacker
Thomas Boutell created the .webp file format, a
patent-free replacement for GIF. Therefore I prefer the format’s unofficial
name: PNG’s Not GIF. Here’s a great
article
on PNG’s history.
Reference

    
Credits
Most of the content comes from sources like Wikipedia (with reference where
appropriate), but the explanations are difficult to hunt down if you don’t know
what you’re looking for.
This is a living document, and I’m planning to update it in case of reader
submissions.

    
Conclusion

You have to know the past to understand the present.
— Dr. Carl Sagan (1980)

I hope you enjoyed this trip down memory lane. Now it’s your turn!
👉 Do you know any other stories? Send me a message, and I’ll add them here.

    
Related Projects

Awesome Computer History:
A curated list of computer history videos, documentaries and related folklore
maintained by Thomas Watson.
Wikipedia: List of computer term etymologies:
List of the origins of computer-related terms or terms used in the computing world.
Talk: The Etymology of Programming by Brittany Storoz - JSConf EU
2018: A talk that explains the background behind a few programming terms. Careful here: the explanation for “bug” is probably wrong as mentioned above.
Typewriter terminology that has survived into the personal computer
era:
A list of computer terms that have their origins from typewriters.
Folklore - The Original Macintosh:
Anecdotes about the development of Apple’s original Macintosh, and the people who made it.

"
},

{
"title": "A Timelapse of Timelapse",
"url": "https://endler.dev/2020/timelapse/",
"body": "Timelapse is a little open-source screen
recorder for macOS. It takes a screenshot every second and creates a movie in
the end.
To celebrate its unlikely 1.0 release today, I present here a “timelapse” of
this project’s journey. It just took ten years to get here.


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );


2011 - How it all began
To be honest, I don’t remember why I initially wrote the tool. I must have had a
personal need for a screen recorder, I guess…
In May 2011, when I started the project, I was doing my Masters Degree in
Computer Science. I might have needed the tool for University; most likely,
however, I was just trying to find an excuse for not working on an assignment.
During that time, I wrote a lot of tools like that. Mainly to scratch a personal
itch, learn a new programming language, or just have fun.
Among them are tools like a random sandwich
generator for Subway (the American
fast-food chain), DrawRoom, a keyboard-driven
drawing app inspired by
WriteRoom, and the
obligatory CMS software, that I sold to
clients. Surprisingly, none of them were a great success.

    
            DrawRoom, a tool that I wrote around the same time, is a real piece of art. To this day it has five commits and a single Github star (by myself, don’t judge…).
            
            
What I do know for sure is that I was unhappy with all existing screen
recorders. They could roughly be categorized into these three groups:

Proprietary solutions that cost money or could call home.
Tools that didn’t work on macOS.
Small, fragile, one-off scripts that people passed around in forums or as
Github gists. They rarely worked as advertised.

Among the remaining tools were none that provided any timelapse functionality;
so I set out to write my own.
This all sounds very epic, but in reality, I worked on it for a day. After five
heroic commits on May 11, 2011, it sat there, idle, for seven years…

    
2018
A lot of time elapsed before anything exciting happened.
In January ’18, seemingly out of nowhere, the first user filed a bug report. It
was titled hung when creating the
avi 😱. Turns out that a game
developer from Canada,
juul1a, was trying
to use the tool to track her progress on an indie game — how cool is that?
To help her out, I decided to do some general cleanup, finally write down some
instructions on how to even use the program, add a requirements.txt, and port
the tool from mencoder to
ffmpeg.
After that, timelapse was ready for prime-time. 🎬 Here is some live action from
her videos featuring timelapses:


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );

At that point, the tool was still very wobbly and could only be used from the
commandline, but I began to see some potential for building a proper app from
it; I just never found the time.
In October ’18, I decided to ask for support during
Hacktoberfest. I created a few
tickets and labeled them with hacktoberfest to try and find contributors.
And then, I waited.
First, Shreya V Prabhu fixed an issue where a
new recording was overwriting the previous one by adding a timestamp to the
video name. Then Abner
Campanha and Shane
Creedon (no longer on Github) created a basic test structure.
Gbenro Selere added a CI pipeline for Travis CI.
It really worked, and the project was in much better shape after that!

    
2019
One year passes by, and Kyle Jones adds some
contribution guidelines, while I move the CI pipeline to the newly released
Github actions.
Chaitanya fixed a bug where the program would hang when
the recording stopped by moving the video creation from threads to a separate
process. He continued to make the
codebase more robust and became a core contributor, reviewing pull requests and
handling releases.
Thanks to orcutt989, the app now made use of
type hints in Python 3.6.
gkpln3 added support for multi-monitor
configurations. The screen captured will always be the one with the mouse on it.

    
2020
Fast forward to today, and after almost ten years, we finally created a true
macOS app using the awesome py2app
bundler. This should make the tool usable by non-developers.

    
Back to the Future
We reached the end of our little journey.
A long time has passed until 1.0. This project is a testament to the wonders of
open source collaboration, and I am proud to work on it with contributors from
around the world. It doesn’t have to be a life-changing project to bring people
together who have fun building things. If this were the end of the story, I’d be
okay with that. I doubt it, though. Here’s to the next ten years!
🎬 Download timelapse on Github.

    
Bonus
The video at the beginning is a timelapse of how I finish this article.
How meta.
"
},

{
"title": "A Tiny, Static, Full-Text Search Engine using Rust and WebAssembly",
"url": "https://endler.dev/2019/tinysearch/",
"body": "

     I wrote a basic search module that you can add to a static website.
It’s very lightweight (50kB-100kB gzipped) and works with Hugo, Zola, and
Jekyll. Only searching for entire words is supported. Try the search box on the
left for a demo. The code is on Github.


Static site generators are magical. They combine the best of both worlds:
dynamic content without sacrificing performance.
Over the years, this blog has been running on Jekyll, Cobalt, and, lately,
Zola.
One thing I always disliked, however, was the fact that static websites don’t
come with “static” search engines, too. Instead, people resort to custom Google
searches, external search engines like Algolia, or pure
JavaScript-based solutions like lunr.js or elasticlunr.
All of these work fine for most sites, but it never felt like the final answer.
I didn’t want to add yet another dependency on Google; neither did I want to use
a stand-alone web-backend like Algolia, which adds latency and is proprietary.
On the other side, I’m not a huge fan of JavaScript-heavy websites. For example,
just the search indices that lunr creates can be multiple megabytes
in size.
That feels lavish - even by today’s bandwidth standards. On top of that,
parsing JavaScript is still
time-consuming.
I wanted some simple, lean, and self-contained search, that could be deployed
next to my other static content.
As a consequence, I refrained from adding search functionality to my blog at
all. That’s unfortunate because, with a growing number of articles, it gets
harder and harder to find relevant content.

    
The Idea
Many years ago, in 2013, I read “Writing a full-text search engine using Bloom
filters” — and it was a revelation.
The idea was simple: Let’s run all my blog articles through a generator that
creates a tiny, self-contained search index using this magical data structure
called a
✨Bloom Filter ✨.

    
Wait, what’s a Bloom Filter?
A Bloom filter is a space-efficient way to
check if an element is in a set.
The trick is that it doesn’t store the elements themselves; it just knows with
some confidence that they were stored before. In our case, it can say with a
certain error rate that a word is in an article.

    
            A Bloom filter stores a
‘fingerprint’ (a number of hash values) of all input values instead of the raw
input. The result is a low-memory-footprint data structure. This is an example
of ‘hello’ as an input.
            
            
Here’s the Python code from the original article that generates the Bloom
filters for each post (courtesy of Stavros
Korokithakis):
filters =   
for name, words in split_posts.items():
  filters[name] = BloomFilter(capacity=len(words), error_rate=0.1)
  for word in words:
    filters[name].add(word)
The memory footprint is extremely small, thanks to error_rate, which allows
for a negligible number of false positives.
I immediately knew that I wanted something like this for my homepage. My idea
was to directly ship the Bloom filters and the search engine to the browser. I
could finally have a small, static search without the need for a backend!

    
Headaches
Disillusionment came quickly.
I had no idea how to bundle and minimize the generated Bloom filters, let alone
run them on clients. The original article briefly touches on this:

You need to implement a Bloom filter algorithm on the client-side. This will
probably not be much longer than the inverted index search algorithm, but it’s
still probably a bit more complicated.

I didn’t feel confident enough in my JavaScript skills to pull this off. Back in
2013, NPM was a mere three years old, and WebPack just turned one, so I also
didn’t know where to look for existing solutions.
Unsure what to do next, my idea remained a pipe dream.

    
A New Hope
Five years later, in 2018, the web had become a different place. Bundlers were
ubiquitous, and the Node ecosystem was flourishing. One thing, in particular,
revived my dreams about the tiny static search engine: WebAssembly.

WebAssembly (abbreviated Wasm) is a binary instruction format for a
stack-based virtual machine. Wasm is designed as a portable target for
compilation of high-level languages like C&amp;#x2F;C++&amp;#x2F;Rust, enabling deployment on
the web for client and server applications. [source]

This meant that I could use a language that I was familiar with to write the
client-side code — Rust! 🎉
My journey started with a prototype back in January
2018.
It was just a direct port of the Python version from above:
let mut filters = HashMap::new();
for (name, words) in articles  
  let mut filter = BloomFilter::with_rate(0.1, words.len() as u32);
  for word in words  
    filter.insert(&amp;amp;amp;word);
   
  filters.insert(name, filter);
 
While I managed to create the Bloom filters for every article, I still had no
clue how to package it for the web… until wasm-pack came along in
February
2018.

    
Whoops! I Shipped Some Rust Code To Your Browser.
Now I had all the pieces of the puzzle:

Rust — A language I was comfortable with
wasm-pack — A bundler for WebAssembly modules
A working prototype that served as a proof-of-concept

The search box you see on the left side of this page is the outcome. It fully runs on Rust using
WebAssembly (a.k.a the RAW stack). Try it now if you like.
There were quite a few obstacles along the way.

    
Bloom Filter Crates
I looked into a few Rust libraries (crates) that implement Bloom filters.
First, I tried jedisct1’s
rust-bloom-filter, but the types
didn’t implement
Serialize&amp;#x2F;Deserialize.
This meant that I could not store my generated Bloom filters inside the binary and load
them on the client-side.
After trying a few others, I found the
cuckoofilter crate, which
supported serialization. The behavior is similar to Bloom filters, but if you’re
interested in the differences, you can look at this
summary.
Here’s how to use it:
let mut cf = cuckoofilter::new();

&amp;#x2F;&amp;#x2F; Add data to the filter
let value: &amp;amp;amp;str = &amp;amp;quot;hello world&amp;amp;quot;;
let success = cf.add(value)?;

&amp;#x2F;&amp;#x2F; Lookup if data was added before
let success = cf.contains(value);
&amp;#x2F;&amp;#x2F; success ==&amp;amp;gt; true
Let’s check the output size when bundling the filters for ten articles on my blog using cuckoo filters:
~&amp;#x2F;C&amp;#x2F;p&amp;#x2F;tinysearch ❯❯❯ l storage
Permissions Size User    Date Modified Name
.rw-r--r--   44k mendler 24 Mar 15:42  storage
44kB doesn’t sound too shabby, but these are just the cuckoo filters for ten
articles, serialized as a Rust binary. On top of that, we have to add the search
functionality and the helper code. In total, the client-side code weighed in at
216kB using vanilla wasm-pack. Too much.

    
Trimming Binary Size
After the sobering first result of 216kB for our initial prototype, we have a
few options to bring the binary size down.
The first is following johnthagen’s advice on
minimizing Rust binary size.
By setting a few options in our Cargo.toml, we can shave off quite a few bytes:
&amp;amp;quot;opt-level = &amp;amp;#39;z&amp;amp;#39;&amp;amp;quot; =&amp;amp;gt; 249665 bytes
&amp;amp;quot;lto = true&amp;amp;quot;      =&amp;amp;gt; 202516 bytes
&amp;amp;quot;opt-level = &amp;amp;#39;s&amp;amp;#39;&amp;amp;quot; =&amp;amp;gt; 195950 bytes
Setting opt-level to s means we trade size for speed,
but we’re preliminarily interested in minimal size anyway. After all, a small download size also improves performance.
Next, we can try wee_alloc, an alternative Rust allocator
producing a small .wasm code size.

It is geared towards code that makes a handful of initial dynamically sized allocations, and then performs its heavy lifting without any further allocations. This scenario requires some allocator to exist, but we are more than happy to trade allocation performance for small code size.

Exactly what we want. Let’s try!
&amp;amp;quot;wee_alloc and nightly&amp;amp;quot; =&amp;amp;gt; 187560 bytes
We shaved off another 4% from our binary.
Out of curiosity, I tried to set codegen-units to 1, meaning we only use a single thread for code generation. Surprisingly, this resulted in a slightly smaller binary size.
&amp;amp;quot;codegen-units = 1&amp;amp;quot; =&amp;amp;gt; 183294 bytes
Then I got word of a Wasm optimizer called binaryen.
On macOS, it’s available through homebrew:
brew install binaryen
It ships a binary called wasm-opt and that shaved off another 15%:
&amp;amp;quot;wasm-opt -Oz&amp;amp;quot; =&amp;amp;gt; 154413 bytes
Then I removed web-sys as we don’t have to bind to the DOM: 152858 bytes.
There’s a tool called twiggy to profile the code size of Wasm binaries.
It printed the following output:
twiggy top -n 20 pkg&amp;#x2F;tinysearch_bg.wasm
 Shallow Bytes │ Shallow % │ Item
─────────────┼───────────┼────────────────────────────────
         79256 ┊    44.37% ┊ data[0]
         13886 ┊     7.77% ┊ &amp;amp;quot;function names&amp;amp;quot; subsection
          7289 ┊     4.08% ┊ data[1]
          6888 ┊     3.86% ┊ core::fmt::float::float_to_decimal_common_shortest::hdd201d50dffd0509
          6080 ┊     3.40% ┊ core::fmt::float::float_to_decimal_common_exact::hcb5f56a54ebe7361
          5972 ┊     3.34% ┊ std::sync::once::Once::call_once::  closure  ::ha520deb2caa7e231
          5869 ┊     3.29% ┊ search
From what I can tell, the biggest chunk of our binary is occupied by the raw data section for our articles.
Next up, we got the function headers and some float to decimal helper functions, that most likely come from deserialization.
Finally, I tried wasm-snip, which replaces a WebAssembly function’s body with an unreachable like so, but it didn’t reduce code size:
wasm-snip --snip-rust-fmt-code --snip-rust-panicking-code -o pkg&amp;#x2F;tinysearch_bg_snip.wasm pkg&amp;#x2F;tinysearch_bg_opt.wasm
After tweaking with the parameters of the cuckoo filters a bit and removing
stop words from the articles, I
arrived at 121kB (51kB gzipped) — not bad considering the average image size on the web is around 900kB.
On top of that, the search functionality only gets loaded when a user clicks into the search field.

    
Update
Recently I moved the project from cuckoofilters to XOR filters.
I used the awesome xorf project, which comes with built-in serde serialization.
which allowed me to remove a lot of custom code.
With that, I could reduce the payload size by another 20-25% percent. I’m down to 99kB (49kB gzipped) on my blog now. 🎉
The new version is released on crates.io already, if you want to give it a try.

    
Frontend- and Glue Code
wasm-pack will auto-generate the JavaScript code to talk to Wasm.
For the search UI, I customized a few JavaScript and CSS bits from
w3schools.
It even has keyboard support!
Now when a user enters a search query, we go through the cuckoo filter of each
article and try to match the words. The results are scored by the number of
hits. Thanks to my dear colleague Jorge Luis Betancourt for adding that part.

    
            Video of the search functionality
            
            
(Fun fact: this animation is about the same size as the uncompressed Wasm search itself.)

    
Caveats
Only whole words are matched. I would love to add prefix-search, but the
binary became too big when I tried.

    
Usage
The standalone binary to create the Wasm file is called tinysearch.
It expects a single path to a JSON file as an input:
tinysearch path&amp;#x2F;to&amp;#x2F;corpus.json
This corpus.json contains the text you would like to index. The format is pretty straightforward:
[
   
    &amp;amp;quot;title&amp;amp;quot;: &amp;amp;quot;Article 1&amp;amp;quot;,
    &amp;amp;quot;url&amp;amp;quot;: &amp;amp;quot;https:&amp;#x2F;&amp;#x2F;example.com&amp;#x2F;article1&amp;amp;quot;,
    &amp;amp;quot;body&amp;amp;quot;: &amp;amp;quot;This is the body of article 1.&amp;amp;quot;
   ,
   
    &amp;amp;quot;title&amp;amp;quot;: &amp;amp;quot;Article 2&amp;amp;quot;,
    &amp;amp;quot;url&amp;amp;quot;: &amp;amp;quot;https:&amp;#x2F;&amp;#x2F;example.com&amp;#x2F;article2&amp;amp;quot;,
    &amp;amp;quot;body&amp;amp;quot;: &amp;amp;quot;This is the body of article 2.&amp;amp;quot;
   
]
You can generate this JSON file with any static site generator.
Here’s my version for Zola:
 % set section = get_section(path=&amp;amp;quot;_index.md&amp;amp;quot;) % 

[
   %- for post in section.pages -% 
     % if not post.draft % 
       
        &amp;amp;quot;title&amp;amp;quot;:    post.title | striptags | json_encode | safe   ,
        &amp;amp;quot;url&amp;amp;quot;:    post.permalink | json_encode | safe   ,
        &amp;amp;quot;body&amp;amp;quot;:    post.content | striptags | json_encode | safe   
       
       % if not loop.last % , % endif % 
     % endif % 
   %- endfor -% 
]
I’m pretty sure that the Jekyll version looks quite similar.
Here’s a starting point.
If you get something working for your static site generator, please let me know.

    
Observations

This is still the wild west: unstable features, nightly Rust, documentation
gets outdated almost every day.
Bring your thinking cap!
Creating a product out of a good idea is a lot of work. One has to pay
attention to many factors: ease-of-use, generality, maintainability,
documentation, and so on.
Rust is very good at removing dead code, so you usually don’t pay for what
you don’t use. I would still advise you to be very conservative about the
dependencies you add to a Wasm binary because it’s tempting to add features
that you don’t need and which will add to the binary size. For example, I
used StructOpt during testing, and I had a main() function that was parsing
these command-line arguments. This was not necessary for Wasm, so I
removed it later.
I understand that not everyone wants to write Rust code. It’s complicated to
get started with, but the cool thing is that you can
use almost any other language, too. For example, you can write Go code and
transpile to Wasm, or maybe you prefer PHP or Haskell. There is support for
many languages already.
A lot of people dismiss WebAssembly as a toy technology. They couldn’t be
further from the truth. In my opinion, WebAssembly will revolutionize the way we build
products for the web and beyond. What was very hard just two years ago is now
easy: shipping code in any language to every browser. I’m super excited about
its future.
If you’re looking for a standalone, self-hosted search index for your company
website, check out sonic.
Also check out stork as an alternative.


Try it!
The code for tinysearch is on Github.
Please be aware of these limitations:

Only searches for entire words. There are no search suggestions.
The reason is that prefix search blows up binary size like Mentos and Diet Coke.
Since we bundle all search indices for all articles into one static binary, I
only recommend to use it for low- to medium-sized websites. Expect around 4kB
(non-compressed) per article.
The compile times are abysmal at the moment (around 1.5 minutes after a
fresh install on my machine), mainly because we’re compiling the Rust crate
from scratch every time we rebuild the index.
Update: This is mostly fixed thanks to the awesome work of
CephalonRho in PR
#13. Thanks again!

The final Wasm code is laser-fast because we save the roundtrips to a
search-server. The instant feedback loop feels more like filtering a list than
searching through posts. It can even work fully offline, which might be nice if
you like to bundle it with an app.
"
},

{
"title": "Maybe You Don't Need Kubernetes",
"url": "https://endler.dev/2019/maybe-you-dont-need-kubernetes/",
"body": "
    

            A woman riding a scooter
            
            
            Source: Illustration created by freepik, Nomad logo by HashiCorp.
            
        
Kubernetes is the 800-pound gorilla of container orchestration.
It powers some of the biggest deployments worldwide, but it comes
with a price tag.
Especially for smaller teams, it can be time-consuming to maintain and has a
steep learning curve. For what our team of four wanted to achieve at trivago, it
added too much overhead. So we looked into alternatives — and fell in love with
Nomad.

    
The Wishlist
Our team runs a number of typical services for monitoring and performance
analysis: API endpoints for metrics written in Go, Prometheus exporters, log
parsers like Logstash or Gollum, and databases like InfluxDB or Elasticsearch.
Each of these services run in their own container. We needed a simple system to
keep those jobs running.
We started with a list of requirements for container orchestration:

Run a fleet of services across many machines.
Provide an overview of running services.
Allow for communication between services.
Restart them automatically when they die.
Be manageable by a small team.

On top of that, the following things were nice to have but not strictly
required:

Tag machines by their capabilities (e.g., label machines with fast disks for
I&amp;#x2F;O heavy services.)
Be able to run these services independently of any orchestrator (e.g. in
development).
Have a common place to share configurations and secrets.
Provide an endpoint for metrics and logging.


Why Kubernetes Was Not A Good Fit For Us
When creating a prototype with Kubernetes, we noticed that we started adding
ever-more complex layers of logic to operate our services. Logic on which we
implicitly relied on.
As an example, Kubernetes allows embedding service configurations using
ConfigMaps. Especially when merging multiple config files or
adding more services to a pod, this can get quite confusing quickly.
Kubernetes - or helm, for that matter - allows injecting external configs
dynamically to ensure separation of concerns. But this can
lead to tight, implicit coupling between your project and Kubernetes.
Helm and ConfigMaps are optional features so you don’t have to use them. You
might as well just copy the config into the Docker image. However, it’s tempting
to go down that path and build unnecessary abstractions that can later bite you.
On top of that, the Kubernetes ecosystem is still rapidly evolving. It takes a
fair amount of time and energy to stay up-to-date with the best practices and
latest tooling. Kubectl, minikube, kubeadm, helm, tiller, kops, oc - the list
goes on and on. Not all tools are necessary to get started with Kubernetes, but
it’s hard to know which ones are, so you have to be at least aware of them.
Because of that, the learning curve is quite steep.

    
When To Use Kubernetes
At trivago specifically, many teams use Kubernetes and are quite happy with it.
These instances are managed by Google or Amazon however, which have the capacity to do so.
Kubernetes comes with amazing
features,
that make container orchestration at scale more manageable:

Fine-grained rights management
Custom controllers allow getting logic into the cluster. These are just
programs that talk to the Kubernetes API.
Autoscaling! Kubernetes can scale your services up and down on demand. It
uses service metrics to do this without manual intervention.

The question is if you really need all those features. You can’t rely on these
abstractions to just work; you’ll have to learn what’s going on under the
hood.
Especially in our team, which runs most services on-premise (because of its
close connection to trivago’s core infrastructure), we didn’t want to afford
running our own Kubernetes cluster; we wanted to ship services instead.

    
Batteries Not Included
Nomad is the 20% of service orchestration that gets you 80% of the way. All it
does is manage deployments. It takes care of your rollouts and restarts your
containers in case of errors, and that’s about it.
The entire point of Nomad is that it does less: it doesn’t include
fine-grained rights management or advanced network policies, and that’s by
design. Those components are provided as enterprise services, by a third-party
— or not at all.
I think Nomad hit a sweet-spot between ease of use and expressiveness. It’s good
for small, mostly independent services. If you need more control, you’ll have to
build it yourself or use a different approach. Nomad is just an orchestrator.
The best part about Nomad is that it’s easy to replace. There is little to no
vendor lock-in because the functionality it provides can easily be integrated
into any other system that manages services. It just runs as a plain old single
binary on every machine in your cluster; that’s it!

    
The Nomad Ecosystem Of Loosely Coupled Components
The real power of Nomad lies within its ecosystem. It integrates very well with
other - completely optional - products like Consul (a key-value store) or
Vault (for secrets handling). Inside your Nomad file, you can have sections
for fetching data from those services:
template  
  data = &amp;amp;lt;&amp;amp;lt;EOH
LOG_LEVEL=&amp;amp;quot;  key &amp;amp;quot;service&amp;#x2F;geo-api&amp;#x2F;log-verbosity&amp;amp;quot;  &amp;amp;quot;
API_KEY=&amp;amp;quot;  with secret &amp;amp;quot;secret&amp;#x2F;geo-api-key&amp;amp;quot;    .Data.value    end  &amp;amp;quot;
EOH

  destination = &amp;amp;quot;secrets&amp;#x2F;file.env&amp;amp;quot;
  env         = true
 
This will read the service&amp;#x2F;geo-api&amp;#x2F;log-verbosity key from Consul and expose it
as a LOG_LEVEL environment variable inside your job. It’s also exposing
secret&amp;#x2F;geo-api-key from Vault as API_KEY. Simple, but powerful!
Because it’s so simple, Nomad can also be easily extended with other services
through its API. For example, jobs can be tagged for service discovery. At
trivago, we tag all services, which expose metrics, with trv-metrics. This
way, Prometheus finds the services via Consul and periodically scrapes the
&amp;#x2F;metrics endpoint for new data. The same can be done for logs by integrating
Loki for example.
There are many other examples for extensibility:

Trigger a Jenkins job using a webhook and Consul watches to redeploy your
Nomad job on service config changes.
Use Ceph to add a distributed file system to Nomad.
Use fabio for load balancing.

All of this allowed us to grow our infrastructure organically without too much
up-front commitment.

    
Fair Warning
No system is perfect. I advise you not to use any fancy new features in
production right now. There are bugs and missing features of course - but
that’s also the case for
Kubernetes.
Compared to Kubernetes, there is far less momentum behind Nomad. Kubernetes has
seen around 75.000 commits and 2000 contributors so far, while Nomad sports about
14.000 commits and 300 contributors. It will be hard for Nomad to keep up with
the velocity of Kubernetes, but maybe it doesn’t have to! The scope is much more
narrow and the smaller community could also mean that it’ll be easier to get your
pull request accepted, in comparison to Kubernetes.

    
Summary
The takeaway is: don’t use Kubernetes just because everyone else does.
Carefully evaluate your requirements and check which tool fits the bill.
If you’re planning to deploy a fleet of homogenous services on large-scale
infrastructure, Kubernetes might be the way to go. Just be aware of the
additional complexity and operational costs. Some of these costs can be
avoided by using a managed Kubernetes environment like Google Kubernetes
Engine or Amazon EKS.
If you’re just looking for a reliable orchestrator that is easy to maintain and
extendable, why not give Nomad a try? You might be surprised by how far it’ll get you.
If Kubernetes were a car, Nomad would be a scooter. Sometimes you prefer one and
sometimes the other. Both have their right to exist.
"
},

{
"title": "What Is Rust Doing Behind the Curtains?",
"url": "https://endler.dev/2018/cargo-inspect/",
"body": "Rust allows for a lot of syntactic sugar, that makes it a pleasure to write. It is sometimes hard, however, to look behind the curtain and see what the compiler is really doing with our code.

At Rust Belt Rust 2018, I saw a talk by Tshepang Lekhonkhobe titled Syntax conveniences afforded by the compiler (Recording here).
To quote the abstract:

The Rust compiler provides a number of conveniences that make life easier for its users. It is good to know what these are, to avoid being mystified by what’s going on under the hood… the less magical thinking we have of the world, the better.

He goes on to give a few examples of these conveniences:

lifetime elisions
type inference
syntactic sugar
implicit dereferencing
type coercions
hidden code (e.g. the prelude)

It was very educational and fun to see him compare code with and without these conveniences during the talk.
Coming home, I wanted to learn more. I wondered if there was a tool, which revealed what Rust was doing behind the curtains.
Over on Reddit, I found a discussion about compiler flags to produce desugared output.
(Note that I’m using rustup here to trigger the nightly compiler with the +nightly flag.)
rustc +nightly -Zunpretty=hir example.rs
HIR stands for high-level intermediate representation. This is basically an abstract syntax tree (AST) more suited for use by the compiler. It replaces syntactic sugar with basic building blocks that are easier to handle by the following compile steps.
To find out more, read this detailed write-up by Nico Matsakis.

Anyway, the output looked surprisingly readable (see below). With some syntax highlighting and formatting, this could be quite a handy tool.
I tried to use rustfmt on it, and it worked unreasonably well.
Motivated by this quick win, I wrapped it up in a cargo subcommand and called it cargo-inspect.
Let’s try cargo-inspect on some real code!

    
Example - Desugaring a range expression
The following examples can also be found in the project’s examples folder.
Input:
for n in 1..3  
    println!(&amp;amp;quot;  &amp;amp;quot;, n);
 
Output of cargo-inspect:

That’s the neatly formatted terminal output. It sports line numbers and colors, thanks to prettyprint, which is a library on top of bat. Maybe you can’t read that, so here’s the gist of it:
match ::std::iter::IntoIterator::into_iter(
        ::std::ops::Range   start: 1, end: 3  )
  mut iter =&amp;amp;gt; loop  
    &amp;#x2F;&amp;#x2F; ...
   ,
 ;
We can see that 1..3 gets converted into std::ops::Range   start: 1, end: 3  .
To the compiler backend, these are absolutely the same. So this holds:
assert_eq!((1..3), std::ops::Range   start: 1, end: 3  );
    
        
Example - File handling
Input:
use std::fs::File;
use std::io::Error;

fn main() -&amp;amp;gt; Result&amp;amp;lt;(), Error&amp;amp;gt;  
    let file = File::open(&amp;amp;quot;file.txt&amp;amp;quot;)?;
    Ok(())
 
Output:
#[prelude_import]
use std::prelude::v1::*;
#[macro_use]
extern crate std;
use std::fs::File;
use std::io::Error;

fn main() -&amp;amp;gt; Result&amp;amp;lt;(), Error&amp;amp;gt;  
  let file = match ::std::ops::Try::into_result(
      &amp;amp;lt;File&amp;amp;gt;::open(&amp;amp;quot;file.txt&amp;amp;quot;))  
    ::std::result::Result::Err(err) =&amp;amp;gt;
    #[allow(unreachable_code)]
     
      #[allow(unreachable_code)]
      return ::std::ops::Try::from_error(
          ::std::convert::From::from(err))
     
    ::std::result::Result::Ok(val) =&amp;amp;gt;
    #[allow(unreachable_code)]
     
      #[allow(unreachable_code)]
      val
     
   ;
  Ok(())
 
We can see that the carrier operator ? gets desugared into a match on the Result of File::open. In case of an error, We apply std::convert::From::from to convert between error types. Otherwise, we simply return the Ok value.

    
Talk
Over at FOSDEM in Belgium, I was able to speak about the project in detail.
Here is the recording:


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );


Future work
I’m not planning to rewrite the compiler here. rustc is doing a far greater job than I could. All this functionality already existed before; I’m merely trying to make the compiler more approachable for learners like me.
Right now, the tool is quite fragile. It throws ugly error messages when things go wrong.
It mostly shines, when you run it on small, isolated example snippets.

    
Get involved!
Over on Github, I opened up a few issues for others to get involved.
Namely, I wish there were options to:


Make it work with cargo projects.

Show the original code above the desugared code.

Show only part of the full output

…and much more.

Also, if you find a particularly exciting code example, don’t be shy to contribute it to the examples folder.
"
},

{
"title": "The Unreasonable Effectiveness of Excel Macros",
"url": "https://endler.dev/2018/excel/",
"body": "I never was a big fan of internships, partially because all the exciting
companies were far away from my little village in Bavaria and partially because
I was too shy to apply.
Only once I applied for an internship in Ireland as part of a school program.
Our teacher assigned the jobs and so my friend got one at Apple and I ended up
at a medium-sized IT distributor — let’s call them PcGo.

Judging by the website, the company looked quite impressive, but in reality, it
was just a secluded, grey warehouse in the rainy industrial area of Cork. Upon
arrival, I was introduced to my colleague Evgeny, who was the main (and only)
employee responsible for assembling desktop computers. From what I can tell, he
ran the shop. He just spoke broken English, so he handed me an electric
screwdriver and a box of screws, and I got to work. Together we assembled a lot
of computers in my first week, and we had a lot of fun. One day he drove me home
from work because I missed my bus. It was a rainy day and while he was driving
through the narrow streets of Cork we talked and laughed, but all of a sudden I
heard a loud bang. I looked through the rear mirror only to find that there was
no rear mirror anymore. Turns out he bumped into another car, and the thing went
off. Evgeny didn’t mind. In a thick Eastern-European accent he remarked “Lost
three mirrors before already,” and kept driving.
In my second week, I had a visit from my boss. Apparently, I was done with the
workload that they planned for my three-week internship. I was used to
assembling and installing computers, which explains why.
To keep me busy, they put together another task. On an old Windows 98 computer
in the back, he pointed the browser to silverpages.ie, searched for “computer”
and after a while we looked at an endless list of addresses of Irish companies
having “something to do with computers.” Each entry consisted of the expected
fields: the company name, the address, the phone number, the website (if any)
and a list of keywords.
My boss said that they needed an overview of all competing vendors. He carefully
selected a field from an entry, copied it and pasted it into an Excel sheet. He
did the same for the remaining fields. “That’s it!”, he said with a fake smile.
We both knew that this would mean two boring weeks for me.
They wanted to keep me busy by letting me manually scrape the entirety of a web database.
I could have taken that as an insult, but instead, I looked at it as a
challenge.
I noticed that the page number on silverpages.ie could be controlled by a GET
parameter.
“Can I write a program that does the scraping?” My boss was noticeably puzzled.
“Uhm… you can do whatever you want, but you’re not allowed to install any
additional software!”. With that, he was off.
Judging from the installed programs, I wasn’t left with many choices: Excel or
Minesweeper. I knew that Excel’s Visual Basic macros were quite powerful, but I
wasn’t sure if I could scrape a full website with it.
After a while, I detected a feature to download a website into an Excel sheet
(what a glorious functionality). This worked perfectly, so all I had to do was
record a macro to create a temporary sheet for each page, copy all important
fields into a “master slide” and then get rid of the temporary sheet. I recorded
the macro and looked at the code. The rest of the day was spent figuring out how
to modify the URL in a loop and cleaning up the macro. I pressed the “run macro”
button and then I sat there waiting. The computer was running at full speed. My
biggest fear was that the program would crash or that the computer would run out
of memory. I refrained from playing minesweeper on it, so I mostly played pool
or chatted with Evgeny.
When I came to the office the next morning, my program was done. To my surprise, it scraped the entirety
of SilverPages, and there were many thousands of entries in the list. I sent the
document to my boss via E-Mail and then got back to playing minesweeper.
An hour later, three guys with suits were standing behind me. I had to show them
the list again. They couldn’t believe I did that on my own, so I showed them the
tool to scrape the data. For them, I had some sort of superpower.
They left without giving me another task; I was free to do whatever I wanted for
the remaining two weeks. I went on to write an inventory tool for them, which
they could also manage via Excel. It was just a glorious Excel form for a
spreadsheet that they maintained manually. I spent two weeks of my summer
vacation to finish that tool because they said they would pay me for that, which, of course, they didn’t :).

    
Lessons learned

Never underestimate the power of Excel macros.
If you have a boring task at hand, make it more challenging by adding constraints.

"
},

{
"title": "Switching from a German to a US Keyboard Layout - Is It Worth It?",
"url": "https://endler.dev/2018/keyboard/",
"body": "For the first three decades of my life, I’ve exclusively used a German keyboard layout for programming. In 2018, I finally switched to a US layout.
This post summarizes my thoughts around the topic.
I was looking for a similar article before jumping the gun, but I couldn’t find one — so I wrote it.

    
            My current keyboard (as of April 2021), the low-profile, tenkeyless Keychron K1 is close to my favorite input device. Yes, I got the RGB version. — Amazon referral link.
            
            
Why Switch To the US Layout?
I was reasonably efficient when writing prose, but felt like
a lemur on a piano when programming:
lots of finger-stretching while trying to reach the special keys like  , ;, or &amp;#x2F;.

    
            German Keyboard Layout
            
            
            Source: Image by Wikipedia
            
        
Here’s Wikipedia’s polite
explanation why the
German keyboard sucks for programming:

Like many other non-American keyboards, German keyboards change the right Alt
key into an Alt Gr key to access a third level of key assignments. This is
necessary because the umlauts and some other special characters leave no room
to have all the special symbols of ASCII, needed by programmers among others,
available on the first or second (shifted) levels without unduly increasing
the size of the keyboard.


But Why Switch Now?
After many years of using a rubber-dome Logitech Cordless Desktop
Wave, I
had to get a mechanical keyboard again.
Those rubber domes just feel too mushy to me now. In addition to that, I enjoy the
clicky sound of a mechanical keyboard and the noticeable tactile bump. (I’m using
Cherry MX Brown Keys with O-Ring dampeners to contain the anger of my coworkers.)
Most mechanical keyboards come with an ANSI US layout only, so I figured, I’d
finally make the switch.

    
            My first mechanical keyboard — Durgod Taurus K320 (referral link). They also have a fancy white-pink ISO version now.
            
            
How Long Did It Take To Get Accustomed To The New Layout?
Working as a Software Engineer, my biggest fear was, that the switch would slow
down my daily work. This turned out not to be true. I was reasonably productive
from day one, and nobody even noticed any difference. (That’s a good thing,
right?)
At first, I didn’t like the bar-shaped US-Return key. I preferred the European
layout with a vertical enter key. I was afraid that I would hit the key by
accident. After a while, I find that the US return key to be even more convenient.
I never hit it by accident, and it’s easy to reach with my pinky from the home position.
Within two weeks, I was back to 100% typing speed.

    
Did My Programming Speed Improve Noticeably?
Yup. I’d say I can type programs about 30% faster.
Especially when using special characters (&amp;#x2F;, ;,  , and so on) I’m much
faster now; partly because the key locations feel more intuitive, but mainly
because my fingers stay at their dedicated positions.
Somehow the position of special characters feels just right. I can finally understand the
reason why Vim is using &amp;#x2F; for search or why the pipe symbol is |: both are
easy to reach! It all makes sense!
(For a fun time, try that on a German keyboard!)
I understand why Mircosoft chose  as a directory separator: it’s easily
accessible from a US keyboard. On the German layout, it’s… just… awful
(Alt Gr+ß on Windows, Shift + Option + 7 on Mac).
The opening curly brace on a German layout Mac is produced with Alt+8, which
always made me leave the home
row and break my typing
flow. Now there are dedicated keys for parentheses. Such a relief!
Update: It also helps greatly when looking up hotkeys for IDEs, text editors,
photo editors, etc. because some programs remap shortcuts for the German market,
which means that all the English documentation is totally worthless.
Thanks to the English layout, I can just use the shortcuts mentioned and move on with my life.

    
Am I Slower When Writing German Texts Now?
In the beginning, I was.
Somehow my brain associated the German layout with German
texts. First, I used the macOS layout switcher.
This turned out to be cumbersome and take time.
Then I found the “US with Umlauts via Option Key
Layout”. It works perfectly fine for
me. It allows me to use a single Keyboard layout but insert German umlauts at will
(e.g. ö is Option+o). There is probably a similar layout for other language combinations.
Stefan Imhoff notified me that there’s also a
Karabiner rule which does the same. Might come
in handy in case you already use this tool.

    
Is Switching Between Keyboards Painful?

    
            US keyboard layout
            
            
            Source: Wikipedia
            
        
My built-in MacBook Pro keyboard layout is still German. I was afraid, that switching between
the internal German and the external English keyboard would confuse me. This
turned out not to be a problem. I rarely look at the print anyway.
(Update: can’t remember when I last looked at the print.)

    
How Often Do You Switch Back To A German Layout Now?
Never. My Girlfriend has a German keyboard and every time I have to use it, I switch to the US layout. It makes her very happy when I do this and forget to switch back to German when I’m done.

    
Summary
If you consider switching, just do it! I don’t look back at all
and apart from the initial transition period, I still couldn’t find any downsides.
Since posting this article, many of my friends made the switch as well and
had similar experiences:

Wolfgang Gassler wrote a post about his layout switch on Linux
Discussion of the article on Twitter

"
},

{
"title": "fastcat - A Faster `cat` Implementation Using Splice",
"url": "https://endler.dev/2018/fastcat/",
"body": "
    

Lots of people asked me to write another piece about the internals of well-known
Unix commands. Well, actually, nobody asked, but it makes for a good
intro. I’m sure you’ve read the previous parts about yes and
ls — they are epic.
Anyway, today we talk about cat, which is used to concatenate files - or, more
commonly, abused to print a file’s contents to the screen.
# Concatenate files, the intended purpose
cat input1.txt input2.txt input3.txt &amp;amp;gt; output.txt

# Print file to screen, the most common use-case
cat myfile
    
        
Implementing cat
Here’s a naive cat in Ruby:
#!&amp;#x2F;usr&amp;#x2F;bin&amp;#x2F;env ruby

def cat(args)
  args.each do |arg|
    IO.foreach(arg) do |line|
      puts line
    end
  end
end

cat(ARGV)
This program goes through each file and prints its contents line by line.
Easy peasy! But wait, how fast is this tool?
I quickly created a random 2 GB file for the benchmark.
Let’s compare the speed of our naive implementation with the system one
using the awesome pv (Pipe Viewer) tool.
All tests are averaged over five runs on a warm cache (file in memory).
# Ruby 2.5.1
&amp;amp;gt; .&amp;#x2F;rubycat myfile | pv -r &amp;amp;gt; &amp;#x2F;dev&amp;#x2F;null
[196MiB&amp;#x2F;s]
Not bad, I guess? How does it compare with my system’s cat?
cat myfile | pv -r &amp;amp;gt; &amp;#x2F;dev&amp;#x2F;null
[1.90GiB&amp;#x2F;s]
Uh oh, GNU cat is ten times faster than our little Ruby cat. 💎🐈🐌

    
Making our Ruby cat a little faster
Our naive Ruby code can be tweaked a bit.
Turns out line buffering hurts performance in the end1:
#!&amp;#x2F;usr&amp;#x2F;bin&amp;#x2F;env ruby

def cat(args)
  args.each do |arg|
    IO.copy_stream(arg, STDOUT)
  end
end

cat(ARGV)rubycat myfile | pv -r &amp;amp;gt; &amp;#x2F;dev&amp;#x2F;null
[1.81GiB&amp;#x2F;s]
Wow… we didn’t really try hard, and we’re already approaching the speed of a
tool that gets optimized since
1971. 🎉
But before we celebrate too much, let’s see if we can go even faster.

    
Splice
What initially motivated me to write about cat was this comment by user
wahern on
Hacker News:

I’m surprised that neither GNU yes nor GNU cat uses splice(2).

Could this splice thing make printing files even faster? — I was intrigued.
Splice was first introduced to the Linux Kernel in 2006, and there is a nice
summary from Linus Torvalds himself,
but I prefer the description from the manpage:

splice() moves data between two file descriptors without copying
between kernel address space and user address space. It transfers up
to len bytes of data from the file descriptor fd_in to the file
descriptor fd_out, where one of the file descriptors must refer to a
pipe.

If you really want to dig deeper, here’s the corresponding source code from the
Linux Kernel,
but we don’t need to know all the nitty-gritty details for now.
Instead, we can just inspect the header from the C implementation:
#include &amp;amp;lt;fcntl.h&amp;amp;gt;

ssize_t splice (int fd_in, loff_t *off_in, int fd_out,
                loff_t *off_out, size_t len,
                unsigned int flags);
To break it down even more, here’s how we would copy the entire src file to dst:
const ssize_t r = splice (src, NULL, dst, NULL, size, 0);
The cool thing about this is that all of it happens inside the Linux kernel, which means we won’t copy a single byte to userspace (where our program runs).
Ideally, splice works by remapping pages and does not actually copy
any data, which may improve I&amp;#x2F;O performance
(reference).

    
            Source: File icon by Aleksandr Vector from the Noun Project. Terminal icon by useiconic.com from the Noun Project.
            
        
Using splice from Rust
I have to say I’m not a C programmer and I prefer Rust because it offers a safer
interface. Here’s the same thing in Rust:
#[cfg(any(target_os = &amp;amp;quot;linux&amp;amp;quot;, target_os = &amp;amp;quot;android&amp;amp;quot;))]
pub fn splice(
    fd_in: RawFd,
    off_in: Option&amp;amp;lt;&amp;amp;amp;mut libc::loff_t&amp;amp;gt;,
    fd_out: RawFd,
    off_out: Option&amp;amp;lt;&amp;amp;amp;mut libc::loff_t&amp;amp;gt;,
    len: usize,
    flags: SpliceFFlags,
) -&amp;amp;gt; Result&amp;amp;lt;usize&amp;amp;gt;
Now, I didn’t implement the Linux bindings myself. Instead, I just used a library called
nix, which provides Rust friendly bindings to *nix APIs.
There is one caveat, though:
We cannot really copy the file directly to standard out, because splice
requires one file descriptor to be a pipe.
The way around that is to create a pipe, which consists of a reader and a
writer (rd and wr).
We pipe the file into the writer, and then we read from the pipe and push the data to stdout.
You can see that I use a relatively big buffer of 16384 bytes (214) to improve performance.
extern crate nix;

use std::env;
use std::fs::File;
use std::io;
use std::os::unix::io::AsRawFd;

use nix::fcntl:: splice, SpliceFFlags ;
use nix::unistd::pipe;

const BUF_SIZE: usize = 16384;

fn main()  
    for path in env::args().skip(1)  
        let input = File::open(&amp;amp;amp;path).expect(&amp;amp;amp;format!(&amp;amp;quot;fcat:   : No such file or directory&amp;amp;quot;, path));
        let (rd, wr) = pipe().unwrap();
        let stdout = io::stdout();
        let _handle = stdout.lock();

        loop  
            let res = splice(
                input.as_raw_fd(),
                None,
                wr,
                None,
                BUF_SIZE,
                SpliceFFlags::empty(),
            ).unwrap();

            if res == 0  
                &amp;#x2F;&amp;#x2F; We read 0 bytes from the input,
                &amp;#x2F;&amp;#x2F; which means we&amp;amp;#39;re done copying.
                break;
             

            let _res = splice(
                rd,
                None,
                stdout.as_raw_fd(),
                None,
                BUF_SIZE,
                SpliceFFlags::empty(),
            ).unwrap();
         
     
So, how fast is this?
fcat myfile | pv -r &amp;amp;gt; &amp;#x2F;dev&amp;#x2F;null
[5.90GiB&amp;#x2F;s]
Holy guacamole. That’s over three times as fast as system cat.

    
Operating System support

Linux and Android are fully supported.
OpenBSD
also has some sort of splice implementation called
sosplice. I didn’t test that, though.
On macOS, the closest thing to splice is its bigger brother,
sendfile, which can send a
file to a socket within the Kernel. Unfortunately, it does not support sending
from file to file.2 There’s also
copyfile,
which has a similar interface, but unfortunately, it is not zero-copy. (I
thought so in the beginning, but I was
wrong.)
Windows doesn’t provide zero-copy file-to-file transfer
(only file-to-socket transfer using the TransmitFile API).

Nevertheless, in a production-grade
implementation, the splice support could be activated on systems that support
it, while using a generic implementation as a fallback.

    
Nice, but why on earth would I want that?
I have no idea. Probably you don’t, because your bottleneck is somewhere else.
That said, many people use cat for piping data into another process like
# Count all lines in C files
cat *.c | wc -l
or
cat kittens.txt | grep &amp;amp;quot;dog&amp;amp;quot;
In this case, if you notice that cat is the bottleneck try fcat (but first,
try to avoid cat altogether).
With some more work, fcat could also be used to directly route packets from one
network card to another, similar to netcat.

    
Lessons learned

The closer we get to bare metal, the more our hard-won abstractions fall
apart, and we are back to low-level systems programming.
Apart from a fast cat, there’s also a use-case for a slow cat: old computers.
For that purpose, there’s — you guessed it — slowcat.

That said, I still have no idea why GNU cat does not use splice on Linux. 🤔
The source code for fcat is on Github.
Contributions welcome!

    
Footnotes
1. Thanks to reader Freeky for making this code more idiomatic.↩
2. Thanks to reader masklinn for the hint.↩
"
},

{
"title": "That Octocat on the Wall",
"url": "https://endler.dev/2018/github/",
"body": "
    

            Photo of my office with Github’s octocat on the wall over my couch
            
            
So I’m in a bit of a sentimental mood lately.
Github got acquired by Microsoft.
While I think the acquisition was well-deserved, I still wish it didn’t happen.
Let me explain.

    
My early days
I joined Github on 3rd of January 2010.
Since I was a bit late to the game, my usual handle (mre) was already taken.
So I naively sent a mail to Github, asking if I could bag the name as it seemed to be abandoned.
To my surprise, I got an answer.
The response came from a guy named Chris Wanstrath.
All he wrote was “it’s yours.”
That was the moment I fell in love with Github.
I felt encouraged to collaborate on projects, that everybody could contribute something valuable.
Only later I found out that Chris was one of the founders and the CEO of the company.

    
Living on Github
Before Github, there was SourceForge, and I only went there to download binaries.
Github showed me, that there was an entire community of like-minded people
out there, who ❤️ to work on code in their free-time.
To me, Github is much more than a git interface; it’s a social network.
While other people browse Facebook or Instagram, I browse Github.
I can still vividly remember getting my first star and my first issue on one of my projects coming from a real (!) person other than myself.
After so many years, a pull-request still feels like the most personal gift anyone could give to me.

    
Github - the culture
After a while, I started to admire some Github employees deeply:

Zach Holman (who is about my age) is a great writer, speaker, and one of the most creative developers I can think of.
Scott Chacon, who taught me a lot about git and whose presentation tool, showoff, I’ve used at University.
Tom Preston-Werner, who I admire for refusing an offer from Microsoft to pursue his dream and build Github, for establishing a super-nerdy company culture, and for Jekyll.

All three developers have since left the company.
I can’t help but notice that Github has changed.
The harassment accusations and letting Zach Holman go are only part of the story.
It has become a company like any other, maintaining a mature product.
It doesn’t excite me anymore.

    
An alternative reality
There’s still a bitter taste in my mouth when I think that Github has fallen prey to one of the tech giants. I loved Github while it was a small, friendly community of passionate developers.
Could this have been sustainable?
Maybe through paid features for project maintainers.
You see, if you do Open Source every day, it can be a lot of work.
People start depending on your projects, and you feel responsible for keeping the lights on.
To ease the burden, I’d love to have deeper insights into my project usage: visitor statistics for longer than two weeks,
a front page where you could filter and search for events, a better way to handle discussions
(which can get out of hand quickly), better CI integration à la Gitlab.
These features would be targeted at the top 10% of Github users, a group of 3 million people.
Would this be enough to pay the bills? Probably. Would it be enough to grow? Probably not.

    
So what?
I don’t think the acquisition will kill the culture. Microsoft is a strong partner and Nat Friedman is one of us.
On the other side, I’m not as enthusiastic as I used to be.
There’s room for competitors now and I’m beginning to wonder what will be the next Github.
That said, I will keep the Octocat on my office wall, in the hope that the excitement comes back.
"
},

{
"title": "Ten Years of Vim",
"url": "https://endler.dev/2018/ten-years-of-vim/",
"body": "


When I opened Vim by accident for the first time, I thought it was broken. My
keystrokes changed the screen in unpredictable ways, and I wanted to undo things
and quit. Needless to say, it was an unpleasant experience. There was something
about it though, that kept me coming back and it became my main editor.
Fast forward ten years (!) and I still use Vim.
After all the Textmates and Atoms and PhpStorms I tried, I still find myself at home in Vim.
People keep asking me: Why is that?

    
Why Vim?
Before Vim, I had used many other editors like notepad or nano. They all behaved more or less as expected: you insert text, you move your cursor with the arrow keys or your mouse, and you save with Control + S or by using the menu bar. VI (and Vim, its spiritual successor) is different.
EVERYTHING in Vim is different, and that’s why it’s so highly effective. Let me explain.

    
The Zen of Vim
The philosophy behind Vim takes a while to sink in:
While other editors focus on writing as the central part of working with text, Vim thinks it’s editing.
You see, most of the time I don’t spend writing new text; instead, I edit existing text.
I mold text, form it, turn it upside down.
Writing text is craftsmanship and hard work. You have to shape your thoughts with your cold, bare hands until they somewhat form a coherent whole.
This painful process is what Vim tries to make at least bearable. It helps you keep control.
It does that, by providing you sharp, effective tools to modify text.
The core of Vim is a language for editing text.

    
Vim, The Language
The Vim commands are not cryptic, you already know them.

To undo, type u.
To find the next t, type ft.
To delete a word, type daw.
To change a sentence, type cas.

More often than not, you can guess the correct command by thinking of an operation you want to execute and an object to execute it on.
Then just take the first character of every word. Try it!
If anything goes wrong, you can always hit ESC and type u for undo.
Operations: delete, find, change, back, insert, append,…
Objects: word, sentence, parentheses, (html) tag,… (see :help text-objects)
Inserting text is just another editing operation, which can be triggered with i.
That’s why, by default, you are in normal mode — also called command mode — where all those operations work.
Once you know this, Vim makes a lot more sense, and that’s when you start to be productive.

    
How My Workflow Changed Over The Years
When I was a beginner, I was very interested in how people with more Vim experience would use the editor.
Now that I’m a long-time user, here’s my answer: there’s no secret sauce.
I certainly feel less exhausted after editing text for a day, but 90% of the commands I use fit on a post-it note.
That said, throughout the years, my Vim habits changed.
I went through several phases:
Year 1: I’m happy if I can insert text and quit again.
Year 2: That’s cool, let’s learn more shortcuts.
Year 3-5: Let’s add all the features!!!
Year 6-10: My .vimrc is five lines long.
Year three is when I started to learn the Vim ecosystem for real.
I tried all sorts of flavors like MacVim and distributions like janus.
For a while, I even maintained my own Vim configuration
, which was almost 400 lines long.
All of that certainly helped me learn what’s out there, but I’m not sure if I would recommend that to a Vim beginner.
After all, you don’t really need all of that. Start with a vanilla Vim editor which works just fine!
My current Vim setup is pretty minimalistic. I don’t use plugins anymore, mostly out of laziness and because built-in Vim commands or macros can replace them.
Here are three concrete examples of how my workflow changed over the years:


In the beginning, I used a lot of “number powered movements”. That is, if you have a command like b, which goes back one word in the text, you can also say 5b to go back five words. Nowadays I mostly use &amp;#x2F; to move to a matching word because it’s quicker.


I don’t use arrow keys to move around in text anymore but forced myself to use h, j, k, l. Many people say that this is faster. After trying this for a few years, I don’t think that is true (at least for me). I now just stick to it out of habit.


On my main working machine I use Vim for quick text editing and Visual Studio Code plus the awesome Vim plugin for projects. This way, I get the best of both worlds.


Workflow Issues I Still Struggle With
After all these years I’m still not a Vim master — far from it.
As every other Vim user will tell you, we’re all still learning.
Here are a few things I wish I could do better:

Jumping around in longer texts: I know the basics, like searching (&amp;#x2F;), jumping to a matching bracket (%) or jumping to specific lines (for line 10, type 10G), but I still could use symbols more often for navigation.
Using visual mode for moving text around: Sometimes it can be quite complicated to type the right combination of letters to cut (delete) the text I want to move around. That’s where visual mode (v) shines. It highlights the selected text. I should use it more often.
Multiple registers for copy and paste: Right now I only use one register (like a pastebin) for copying text, but Vim supports multiple registers. That’s cool if you want to move around more than one thing at the same time. Let’s use more of those!
Tabs: I know how tabs work, but all the typing feels clunky. That’s why I never extensively used them. Instead, I mostly use multiple terminal tabs or an IDE with Vim bindings for bigger projects.


Would I learn Vim again?
That’s a tough question to answer.
On one side, I would say no.
There’s a steep learning curve in Vim and seeing all those modern IDEs become better at understanding the user’s intent, editing text became way easier and faster in general.
On the other side, Vim is the fastest way for me to write down my thoughts and code. As a bonus, it runs on every machine and might well be around for decades to come. In contrast, I don’t know if the IntelliJ shortcuts will be relevant in ten years (note: if you read this in the future and ask yourself “What is IntelliJ?”, the answer might be no).

    
Takeaways
If I can give you one tip, don’t learn Vim by memorizing commands. Instead, look at your current workflow and try to make it better, then see how Vim can make that easier. It helps to look at other people using Vim to get inspired (Youtube link with sound).
You will spend a lot of time writing text, so it’s well worth the time investment to learn one editor really well — especially if you are a programmer.
After ten years, Vim is somehow ingrained in my mind. I think Vim when I’m editing text. It has become yet another natural language to me. I’m looking forward to the next ten years.
"
},

{
"title": "Refactoring Go Code to Avoid File I/O in Unit Tests",
"url": "https://endler.dev/2018/go-io-testing/",
"body": "At work today, I refactored some simple Go code to make it more testable.
The idea was to avoid file handling in unit tests without mocking or using temporary files by separating data input&amp;#x2F;output and data manipulation.

    
            A gopher reading a long computer printout
            
            
            Source: Illustration by Marcus Olsson CC BY-NC-SA 4.0
            
        
I was surprised that I couldn’t find a simple explanation on sites like StackOverflow,
which is why I wrote down some notes myself so that others can refer to it in the future.

    
Our example code
The initial version looked like this:
package main

import (
	&amp;amp;quot;bufio&amp;amp;quot;
	&amp;amp;quot;io&amp;#x2F;ioutil&amp;amp;quot;
	&amp;amp;quot;os&amp;amp;quot;
)

func main()  
	analyze(&amp;amp;quot;test.txt&amp;amp;quot;)
 

func analyze(file string) error  
	handle, err := os.Open(file)

	if err != nil  
		return err
	 
	defer handle.Close()

	scanner := bufio.NewScanner(handle)
	for scanner.Scan()  
		&amp;#x2F;&amp;#x2F; Do something with line
		_ = scanner.Text()
	 
	return nil
 
As you can see, we take a filename as input, and we open that file inside the analyze function to do something with its contents.

    
Writing our first test for the code
A typical test harness for that code might look like this:
package main

import &amp;amp;quot;testing&amp;amp;quot;

func Test_analyze(t *testing.T)  
	t.Run(&amp;amp;quot;Test something&amp;amp;quot;, func(t *testing.T)  
		if err := analyze(&amp;amp;quot;test.txt&amp;amp;quot;); (err != nil) != false  
			t.Errorf(&amp;amp;quot;analyze() error = %v&amp;amp;quot;, err)
		 
	 )
 
All fine and good?

    
Problems
This will work, but file I&amp;#x2F;O while running tests is not always the best idea.
For one, you could be running in a constrained environment, where you don’t have access to the file.
We could use temporary files to avoid this.
But there might be problems with disk I&amp;#x2F;O, which makes for flaky tests and frustration.
Another process could also modify the file during the test.
All these issues have nothing to do with your code.
Furthermore, it’s not enough to just look at the test and see exactly what’s going on. You also have to read the text file first.
A lot of people suggest mocking instead.
There are quite a few powerful libraries like spf13&amp;#x2F;afero for this purpose.
These packages will create temporary files in the background and clean up afterward.
In my opinion, mocking should be the last resort when it comes to testing. Before you mock, check that you use the right abstractions in your code.
Maybe implementing against an interface or using Dependency Injection helps decouple components?
More often than not, a clear separation of concerns is all you need.

    
Refactoring to make testing easier
In my case above, we can easily avoid using mocks and temporary files by decoupling file I&amp;#x2F;O from the analysis.
We do so by refactoring our analyze function to call doSomething, which takes an io.Reader.
(You could also use an array of strings for now.)
Our main.go now looks like this:
package main

import (
	&amp;amp;quot;bufio&amp;amp;quot;
	&amp;amp;quot;io&amp;amp;quot;
	&amp;amp;quot;os&amp;amp;quot;
)

func main()  
	analyze(&amp;amp;quot;test.txt&amp;amp;quot;)
 

func analyze(file string) error  
	handle, err := os.Open(file)

	if err != nil  
		return err
	 
	defer handle.Close()
	return doSomething(handle)
 

func doSomething(handle io.Reader) error  
	scanner := bufio.NewScanner(handle)
	for scanner.Scan()  
		&amp;#x2F;&amp;#x2F; Do something with line
		_ = scanner.Text()
	 
	return nil
 
Now we can test the actual analysis in isolation:
package main

import (
	&amp;amp;quot;strings&amp;amp;quot;
	&amp;amp;quot;testing&amp;amp;quot;
)

func Test_analyze(t *testing.T)  
	t.Run(&amp;amp;quot;Test something&amp;amp;quot;, func(t *testing.T)  
		if err := doSomething(strings.NewReader(&amp;amp;quot;This is a test string&amp;amp;quot;)); (err != nil) != false  
			t.Errorf(&amp;amp;quot;analyze() error = %v&amp;amp;quot;, err)
		 
	 )
 
We changed analyze( test.txt ) to doSomething(strings.NewReader( This is a test string )).
(Of course, we should also write a separate test for analyze(), but the focus is on decoupling the datasource-agnostic part here.)

    
Result
By slightly refactoring our code, we gained the following advantages:

Simple testability: No mocks or temporary files.
Separation of concerns: Each function does exactly one thing.
Easier code re-use: The doSomething() function will work with any io.Reader and can be called from other places. We can even move it to its own library if we want.

On Reddit, user soapysops made an important remark:

In general, I prefer to not accept a file name in an API. A file name doesn’t give users enough control. It doesn’t let you use an unusual encoding, special file permissions, or a bytes.Buffer instead of an actual file, for example. Accepting a file name adds a huge dependency to the code: the file system, along with all of its associated OS specific stuff.
So I probably would have eliminated the file name based API and only exposed one based on io.Reader. That way, you have complete code coverage, fast tests, and far fewer edge cases to worry about.

I totally agree with that sentiment.
But often times you can’t simply change the user-facing API easily, because the API might be public and might already have users.
The refactoring above is just the first step towards better architecture. There is definitely a lot more you can do to start writing robust, well-tested systems in Go.

    
More Resources
If that got you interested, also check out justforfunc #29: dependency injection in a code review, which covers the same topic:


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );

A great resource that I can recommend is Learn Go with Tests. It teaches you test-driven development with Go and helps you get a grounding with TDD.
Another one is The Go Programming Language book, co-authored by Brian W. Kernighan (of Unix fame), which shows how to write clear and idiomatic Go to solve real-world problems. It contains a dedicated chapter on interfaces and testing. It also covers io.Reader in more detail.

    
            The Go Programming Language book co-authored by Brian W. Kernighan (affiliate link)
            
            
"
},

{
"title": "A Tiny `ls` Clone Written in Rust",
"url": "https://endler.dev/2018/ls/",
"body": "In my series of useless Unix tools rewritten in Rust, today I’m going to be covering one of my all-time favorites: ls.
First off, let me say that you probably don’t want to use this code as a replacement for ls on your local machine (although you could!).
As we will find out, ls is actually quite a powerful tool under the hood.
I’m not going to come up with a full rewrite, but instead only cover the very basic output that you would expect from calling ls -l on your command line.
What is this output? I’m glad you asked.

    
Expected output
&amp;amp;gt; ls -l
drwxr-xr-x 2 mendler  staff    13468 Feb  4 11:19 Top Secret
-rwxr--r-- 1 mendler  staff  6323935 Mar  8 21:56 Never Gonna Give You Up - Rick Astley.mp3
-rw-r--r-- 1 mendler  staff        0 Feb 18 23:55 Thoughts on Chess Boxing.doc
-rw-r--r-- 1 mendler  staff   380434 Dec 24 16:00 nobel-prize-speech.txt
Your output may vary, but generally, there are a couple of notable things going on. From left to right, we’ve got the following fields:

The drwx things in the beginning are the file permissions (also called the file mode). If d is set, it’s a directory. r means read, w means write and x execute.
This rwx pattern gets repeated three times for the current user, the group, and other computer users respectively.
Next we got the hardlink count when referring to a file, or the number of contained directory entries when referring to a directory. (Reference)
Owner name
Group name
Number of bytes in the file
Date when the file was last modified
Finally, the path name

For more in-depth information, I can recommend reading the manpage of ls from the GNU coreutils used in most Linux distributions and the one from Darwin (which powers MacOS).
Whew, that’s a lot of information for such a tiny tool.
But then again, it can’t be so hard to port that to Rust, right? Let’s get started!

    
A very basic ls in Rust
Here is the most bare-bones version of ls, which just prints all files in the current directory:
use std::fs;
use std::path::Path;
use std::error::Error;
use std::process;

fn main()  
	if let Err(ref e) = run(Path::new(&amp;amp;quot;.&amp;amp;quot;))  
		println!(&amp;amp;quot;  &amp;amp;quot;, e);
		process::exit(1);
	 
 
fn run(dir: &amp;amp;amp;Path) -&amp;amp;gt; Result&amp;amp;lt;(), Box&amp;amp;lt;Error&amp;amp;gt;&amp;amp;gt;  
	if dir.is_dir()  
		for entry in fs::read_dir(dir)?  
				let entry = entry?;
				let file_name = entry
						.file_name()
						.into_string()
						.or_else(|f| Err(format!(&amp;amp;quot;Invalid entry:  :? &amp;amp;quot;, f)))?;
				println!(&amp;amp;quot;  &amp;amp;quot;, file_name);
		 
	 
	Ok(())
 

We can copy that straight out of the documentation.
When we run it, we get the expected output:
&amp;amp;gt; cargo run
Cargo.lock
Cargo.toml
src
target
It prints the files and exits. Simple enough.
We should stop for a moment and celebrate our success, knowing that we just wrote our first little Unix utility from scratch.
Pro Tip: You can install the binary with cargo install and call it like any other binary from now on.
But we have higher goals, so let’s continue.

    
Adding a parameter to specify the directory
Usually, if we type ls mydir, we expect to get the file listing of no other directory than mydir. We should add the same functionality to our version.
To do this, we need to accept command line parameters.
One Rust crate that I love to use in this case is structopt. It makes argument parsing very easy.
Add it to your Cargo.toml. (You need cargo-edit for the following command).
cargo add structopt
Now we can import it and use it in our project:
#[macro_use]
extern crate structopt;

&amp;#x2F;&amp;#x2F; use std::...
use structopt::StructOpt;

#[derive(StructOpt, Debug)]
struct Opt  
	&amp;#x2F;&amp;#x2F;&amp;#x2F; Output file
	#[structopt(default_value = &amp;amp;quot;.&amp;amp;quot;, parse(from_os_str))]
	path: PathBuf,
 

fn main()  
	let opt = Opt::from_args();
	if let Err(ref e) = run(&amp;amp;amp;opt.path)  
			println!(&amp;amp;quot;  &amp;amp;quot;, e);
			process::exit(1);
	 
 
fn run(dir: &amp;amp;amp;PathBuf) -&amp;amp;gt; Result&amp;amp;lt;(), Box&amp;amp;lt;Error&amp;amp;gt;&amp;amp;gt;  
	&amp;#x2F;&amp;#x2F; Same as before
 
By adding the Opt struct, we can define the command line flags, input parameters, and the help output super easily.
There are tons of configuration options, so it’s worth checking out the project homepage.
Also note, that we changed the type of the path variable from Path to PathBuf. The difference is, that PathBuf owns the inner path string, while Path simply provides a reference to it. The relationship is similar to String and &amp;amp;amp;str.

    
Reading the modification time
Now let’s deal with the metadata.
First, we try to retrieve the modification time from the file.
A quick look at the documentation shows us how to do it:
use std::fs;

let metadata = fs::metadata(&amp;amp;quot;foo.txt&amp;amp;quot;)?;

if let Ok(time) = metadata.modified()  
	println!(&amp;amp;quot; :? &amp;amp;quot;, time);
 
The output might not be what you expect: we receive a SystemTime object, which represents the measurement of the system clock. E.g. this code
println!(&amp;amp;quot; :? &amp;amp;quot;, SystemTime::now());
&amp;#x2F;&amp;#x2F; Prints: SystemTime   tv_sec: 1520554933, tv_nsec: 610406401  
But the format that we would like to have is something like this:
Mar  9 01:24
Thankfully, there is a library called chrono, which can read this format and convert it into any human readable output we like:
let current: DateTime&amp;amp;lt;Local&amp;amp;gt; = DateTime::from(SystemTime::now());
println!(&amp;amp;quot;  &amp;amp;quot;, current.format(&amp;amp;quot;%_d %b %H:%M&amp;amp;quot;).to_string());
this prints
9 Mar 01:29
(Yeah, I know it’s getting late.)
Armed with that knowledge, we can now read our file modification time.
cargo add chronouse chrono:: DateTime, Local ;

fn run(dir: &amp;amp;amp;PathBuf) -&amp;amp;gt; Result&amp;amp;lt;(), Box&amp;amp;lt;Error&amp;amp;gt;&amp;amp;gt;  
	if dir.is_dir()  
		for entry in fs::read_dir(dir)?  
			let entry = entry?;
			let file_name = ...

			let metadata = entry.metadata()?;
			let size = metadata.len();
			let modified: DateTime&amp;amp;lt;Local&amp;amp;gt; = DateTime::from(metadata.modified()?);

			println!(
				&amp;amp;quot; :&amp;amp;gt;5       &amp;amp;quot;,
				size,
				modified.format(&amp;amp;quot;%_d %b %H:%M&amp;amp;quot;).to_string(),
				file_name
			);
		 
	 
	Ok(())
 
This  :&amp;amp;gt;5  might look weird. It’s a formatting directive provided by std::fmt.
It means “right align this field with a space padding of 5” - just like our bigger brother ls -l is doing it.
Similarly, we retrieved the size in bytes with metadata.len().

    
Unix file permissions are a zoo
Reading the file permissions is a bit more tricky.
While the rwx notation is very common in Unix derivatives such as *BSD or GNU&amp;#x2F;Linux, many other operating systems ship their own permission management.
There are even differences between the Unix derivatives.
Wikipedia lists a few extensions to the file permissions that you might encounter:

+ (plus) suffix indicates an access control list that can control additional permissions.
. (dot) suffix indicates an SELinux context is present. Details may be listed with the command ls -Z.
@ suffix indicates extended file attributes are present.

That just goes to show, that there are a lot of important details to be considered when implementing this in real life.

    
Implementing very basic file mode
For now, we just stick to the basics and assume we are on a platform that supports the rwx file mode.
Behind the r, the w and the x are in reality octal numbers. That’s easier for computers to work with and many hardcore users even prefer to type the numbers over the symbols.
The ruleset behind those octals is as follows. I took that from the chmod manpage.
	Modes may be absolute or symbolic.
	An absolute mode is an octal number constructed
	from the sum of one or more of the following values

	 0400    Allow read by owner.
	 0200    Allow write by owner.
	 0100    For files, allow execution by owner.
	 0040    Allow read by group members.
	 0020    Allow write by group members.
	 0010    For files, allow execution by group members.
	 0004    Allow read by others.
	 0002    Allow write by others.
	 0001    For files, allow execution by others.
For example, to set the permissions for a file so that the owner can read, write and execute it and nobody else can do anything would be 700 (400 + 200 +100).
Granted, those numbers are the same since the 70s and are not going to change soon, but it’s still a bad idea to compare our file permissions directly with the values; if not for compatibility reasons, then for readability and to avoid magic numbers in our code.
Therefore, we use the libc crate, which provides constants for those magic numbers.
As mentioned above, these file permissions are Unix specific, so we need to import a Unix-only library named std::os::unix::fs::PermissionsExt; for that.
extern crate libc;

&amp;#x2F;&amp;#x2F; Examples:
&amp;#x2F;&amp;#x2F; * `S_IRGRP` stands for &amp;amp;quot;read permission for group&amp;amp;quot;,
&amp;#x2F;&amp;#x2F; * `S_IXUSR` stands for &amp;amp;quot;execution permission for user&amp;amp;quot;
use libc:: S_IRGRP, S_IROTH, S_IRUSR, S_IWGRP, S_IWOTH, S_IWUSR, S_IXGRP, S_IXOTH, S_IXUSR ;
use std::os::unix::fs::PermissionsExt;
We can now get the file permissions like so:
let metadata = entry.metadata()?;
let mode = metadata.permissions().mode();
parse_permissions(mode as u16);
parse_permissions() is a little helper function defined as follows:
fn parse_permissions(mode: u16) -&amp;amp;gt; String  
	let user = triplet(mode, S_IRUSR, S_IWUSR, S_IXUSR);
	let group = triplet(mode, S_IRGRP, S_IWGRP, S_IXGRP);
	let other = triplet(mode, S_IROTH, S_IWOTH, S_IXOTH);
	[user, group, other].join(&amp;amp;quot;&amp;amp;quot;)
 
It takes the file mode as a u16 (simply because the libc constants are u16)
and calls triplet on it.
For each flag read, write, and execute, it runs a binary &amp;amp;amp; operation on mode.
The output is matched exhaustively against all possible permission patterns.
fn triplet(mode: u16, read: u16, write: u16, execute: u16) -&amp;amp;gt; String  
	match (mode &amp;amp;amp; read, mode &amp;amp;amp; write, mode &amp;amp;amp; execute)  
		(0, 0, 0) =&amp;amp;gt; &amp;amp;quot;---&amp;amp;quot;,
		(_, 0, 0) =&amp;amp;gt; &amp;amp;quot;r--&amp;amp;quot;,
		(0, _, 0) =&amp;amp;gt; &amp;amp;quot;-w-&amp;amp;quot;,
		(0, 0, _) =&amp;amp;gt; &amp;amp;quot;--x&amp;amp;quot;,
		(_, 0, _) =&amp;amp;gt; &amp;amp;quot;r-x&amp;amp;quot;,
		(_, _, 0) =&amp;amp;gt; &amp;amp;quot;rw-&amp;amp;quot;,
		(0, _, _) =&amp;amp;gt; &amp;amp;quot;-wx&amp;amp;quot;,
		(_, _, _) =&amp;amp;gt; &amp;amp;quot;rwx&amp;amp;quot;,
	 .to_string()
 
    
Wrapping up
The final output looks like this. Close enough.
&amp;amp;gt; cargo run
rw-r--r--     7  6 Mar 23:10 .gitignore
rw-r--r-- 15618  8 Mar 00:41 Cargo.lock
rw-r--r--   185  8 Mar 00:41 Cargo.toml
rwxr-xr-x   102  5 Mar 21:31 src
rwxr-xr-x   136  6 Mar 23:07 target
That’s it! You can find the final version of our toy ls on Github.
We are still far away from a full-fledged ls replacement, but at least we learned a thing or two about its internals.
If you’re looking for a proper ls replacement written in Rust, go check out lsd.
If, instead, you want to read another blog post from the same series, check out A Little Story About the yes Unix Command.
"
},

{
"title": "Rust in 2018",
"url": "https://endler.dev/2018/rust-2018/",
"body": "I wrote about the future of Rust before and it seems like nobody stops me from doing it again! Quite the contrary: this time the Rust core team even asked for it.
I’m a bit late to the party, but here are my 2 cents about the priorities for Rust in 2018.

    
Who is this guy?
There’s a depressingly high chance that we’ve never met before — which is a real shame.
For some context: I come from dynamically typed languages like Python and PHP.
Rust was the first language that allowed me to write real low-level code without feeling like arguing with a bouncer.
To me, Rust is not a fireflower, it’s my own personal Megazord1.
I want Rust to win, but for that, we need to tick a few points off the list.

    
Compiler documentation for easier contribution

When I was in Columbus, Ohio for Rust Belt Rust, I met Niko Matsakis, Ariel Ben-Yehuda, and Santiago Pastorino.
Those fine gentlemen eagerly worked on non-lexical lifetimes during the impl-period.
Watching them hack away on the compiler was deeply inspirational to me, and I started wondering if I could contribute, too.
Needless to say, the barrier to entry for hacking on the compiler can be quite high.
I didn’t contribute anything yet.
One thing I’d love to do is to spend short 30-60 minute chunks of time to fix a small thing in the compiler here and there. Could be as simple as renaming a variable, writing a test or adding some documentation.
Hence my first wish is, that contributing to the language will become easier.
That could be achieved by providing extensive mentorship, more entry-level tickets, and better compiler documentation.
All of that was already suggested by Niko.

    
More resources for intermediate programmers
On a related note, I’d like to see more talks&amp;#x2F;guidelines&amp;#x2F;books targeting intermediate Rust programmers.
This includes discussions on how to structure big projects in Rust and Rust-specific design patterns.
I want to read more about professional Rust usage and see case-studies from various industries.
For example, there was a startup called snips.ai, which built an on-device voice-assistant using Rust.
They integrated with C and C++ libraries and I want to hear more about their journey.

    
Improve the RFC process
I try to follow the RFC process very closely, but my time is limited.
My wish is, that I can open any RFC and immediately get its status:

A summary of the discussion with major pros and cons.
A simple usage example, right at the beginning.
The next steps towards stabilization.

For example, if I look at this (not so) random issue, I don’t even know where to start. What are the biggest blockers right now? Who is actively pushing this forward? How can I help out?
Github is great for code, but conversations about new features regularly get out of hand.
This is not a problem, that is limited to Rust, either. Just look at other big projects like Docker, Kubernetes, or Node.
Maybe we need a new tool for that.

    
The usual suspects
If I could ask for two stable features in 2018, it would be ? in main
and non-lexical lifetimes.
There’s more I could mention of course, but I’m not gonna bore you with faster compile times, impl trait, generators, and the like.
We’re on a good way here, see Nick Cameron’s post instead.
I’m convinced, that by improving documentation and mentorship, we can grow the number of contributors significantly
and stabilize many highly-anticipated features this year.
1. Disclaimer: I never watched a single episode of Power Rangers.↩
"
},

{
"title": "Functional Programming for Mathematical Computing",
"url": "https://endler.dev/2018/functional-mathematics/",
"body": "Programming languages help us describe general solutions for problems; the result just happens to be executable by machines. Every programming language comes with a different set of strengths and weaknesses, one reason being that its syntax and semantics heavily influence the range of problems which can easily be tackled with it.
tl;dr: I think that functional programming is better suited for mathematical computations than the more common imperative approach.

    
Using built-in abstractions for Mathematics
The ideas behind a language (the underlying programming paradigms) are distinctive for the community that builds around it. The developers create a unique ecosystem of ready-to-use libraries and frameworks around the language core. As a consequence, some languages are stronger in areas such as business applications (one could think of Cobol), others work great for systems programming (like C or Rust).
When it comes to solving mathematical and numerical problems with computers, Fortran might come to mind. Although Fortran is a general-purpose language, it is mostly known for scientific computing. Of course, the language was created with that purpose in mind – hence the name, Formula Translation.
One reason for its popularity in this area is that it offers some built-in domain-specific keywords to express mathematical concepts, while keeping an eye on performance. For instance, it has a dedicated datatype for complex numbers – COMPLEX – and a keyword named DIMENSION which is quite similar to the mathematical term and can be used to create arrays and vectors.

    
Imperative vs functional style
Built-in keywords can help expand the expressiveness of a language into a specific problem space, but this approach is severly limited. It’s not feasible to extend the language core ad infinitum; it would just be harder to maintain and take longer to learn. Therefore, most languages provide other ways of abstraction – like functions, subroutines, classes and objects – to split a routine into smaller, more manageable parts. These mechanisms might help to control the complexity of a program, but especially when dealing with mathematical problems, one has to be careful not to obfuscate the solution with boilerplate code.

    
Specimen I - Factorial
As an example, the stated problem might be to translate the following formula, which calculates the factorial of a positive number n, into program code:

An implementation of the above formula using imperative style Java might look like this:
public static long fact(final int n)  
    if (n &amp;amp;lt; 0)  
        &amp;#x2F;&amp;#x2F; Negative numbers not allowed
        return 0;
     
    long prod = 1;
    for (int i = 1; i &amp;amp;lt;= n; ++i)  
        prod *= i;
     
    return prod;
 
This is quite a long solution for such a short problem definition.
(Note that writing a version with an explicit loop from 1 to n was on purpose; a recursive function would be shorter, but uses a concept which was not introduced by the mathematical formula.)
Also, the program contains many language-specific keywords, such as public, static, and System.err.println(). On top of that, the programmer must explicitly provide all data types for the variables in use – a tiresome obligation.
All of this obfuscates the mathematical definition.
Compare this with the following version written in a functional language, like Haskell.
fact n = product [1..n]
This is an almost direct translation from the problem definition into code. It needs no explicit types, no temporary variables and no access modifiers (such as public).

    
Specimen II - Dot product
One could argue that the above Haskell program owes its brevity to the fact, that the language provides just the right abstractions (namely the product keyword and the [1..n] range syntax) for that specific task.
Therfore let’s examine a simple function which is neither available in Haskell nor in Java: The dot product of two vectors. The mathematical definition is as follows:

For vectors with three dimensions, it can be written as

First, a Haskell implementation:
type Scalar a = a
data Vector a = Vector a a a deriving (Show)
dot :: (Num a) =&amp;amp;gt; Vector a -&amp;amp;gt; Vector a -&amp;amp;gt; Scalar a
(Vector a1 a2 a3) `dot` (Vector b1 b2 b3) = a1*b1 + a2*b2 + a3*b3
Note, that the mathematical types can be defined in one line each. Further note, that we define the dot function in infix notation, that is, we place the first argument of dot in front of the function name and the second argument behind it. This way, the code looks more like its mathematical equivalent.
An example call of the above function would be
(Vector 1 2 3) ’dot’ (Vector 3 2 1)
which is short, precise and readable.
Now, a similar implementation in Java.
public static class Vector&amp;amp;lt;T extends Number&amp;amp;gt;  
    private T x, y, z;

    public Vector(T x, T y, T z)  
        this.x = x;
        this.y = y;
        this.z = z;
     

    public double dot(Vector&amp;amp;lt;?&amp;amp;gt; v)  
        return (x.doubleValue() * v.x.doubleValue() +
                y.doubleValue() * v.y.doubleValue() +
                z.doubleValue() * v.z.doubleValue());
         
     
    public static void main(String[] args)  
        Vector&amp;amp;lt;Integer&amp;amp;gt; a = new Vector&amp;amp;lt;Integer&amp;amp;gt;(3, 2, 1);
        Vector&amp;amp;lt;Integer&amp;amp;gt; b = new Vector&amp;amp;lt;Integer&amp;amp;gt;(1, 2, 3);
        System.out.println(a.dot(b));
     
 
For a proper textual representation of Vectors, the toString() Method would also need to be overwritten. In Haskell, one can simply derive from the Show typeclass as shown in the code.

    
Creating new abstractions
If functions and types are not sufficient to write straightforward programs, Haskell also offers simple constructs to create new operators and keywords which extend the language core itself. This makes domain-specific-languages feasible and enables the developer to work more directly on the actual problem instead of working around peculiarities of the programming language itself (such as memory management or array iteration). Haskell embraces this concept; Java has no such functionality.

    
Conclusion
I’m not trying to bash Java or worship Haskell here. Both languages have their place.
I merely picked Java, because lots of programmers can read it.
The comparison is more between a functional and an imperative approach for numerical and symbolical programming; and for that, I prefer a functional approach every day. It removes clutter and yields elegant solutions. It provides convenient methods to work on a high level of abstraction and speak in mathematical terms and still, these strengths are disregarded by many programmers.
Abraham H. Maslow’s observation in his 1966 book The Psychology of Science seems fitting:

“I suppose it is tempting, if the only tool you have is a hammer, to treat everything as if it were a nail.”

"
},

{
"title": "Rust for Rubyists",
"url": "https://endler.dev/2017/rust-for-rubyists/",
"body": "Recently I came across a delightful article on idiomatic Ruby.
I’m not a good Ruby developer by any means, but I realized, that a lot of the patterns are also quite common in Rust.
What follows is a side-by-side comparison of idiomatic code in both languages.
The Ruby code samples are from the original article.

    
Map and Higher-Order Functions
The first example is a pretty basic iteration over elements of a container using map.

user_ids = users.map   |user| user.id  
The map concept is also pretty standard in Rust.
Compared to Ruby, we need to be a little more explicit here:
If users is a vector of User objects, we first need to create an iterator from it:


let user_ids = users.iter().map(|user| user.id);
You might say that’s quite verbose, but this additional abstraction allows us to express an important concept:
will the iterator take ownership of the vector, or will it not?

With iter(), you get a “read-only view” into the vector. After the iteration, it will be unchanged.
With into_iter(), you take ownership over the vector. After the iteration, the vector will be gone.
In Rust terminology, it will have moved.
Read some more about the difference between iter() and into_iter() here.

The above Ruby code can be simplified like this:

user_ids = users.map(&amp;amp;amp;:id)
In Ruby, higher-order functions (like map) take blocks or procs as an argument and the language provides a convenient shortcut for method invocation — &amp;amp;amp;:id is the same as  |o| o.id() .
Something similar could be done in Rust:


let id = |u: &amp;amp;amp;User| u.id;
let user_ids = users.iter().map(id);
This is probably not the most idiomatic way to do it, though. What you will see more often is the use of Universal Function Call Syntax in this case:1


let user_ids = users.iter().map(User::id);
In Rust, higher-order functions take functions as an argument. Therefore users.iter().map(Users::id) is more or less equivalent to users.iter().map(|u| u.id()).2
Also, map() in Rust returns another iterator and not a collection.
If you want a collection, you would have to run collect() on that, as we’ll see later.

    
Iteration with Each
Speaking of iteration, one pattern that I see a lot in Ruby code is this:


[&amp;amp;quot;Ruby&amp;amp;quot;, &amp;amp;quot;Rust&amp;amp;quot;, &amp;amp;quot;Python&amp;amp;quot;, &amp;amp;quot;Cobol&amp;amp;quot;].each do |lang|
  puts &amp;amp;quot;Hello # lang !&amp;amp;quot;
end
Since Rust 1.21, this is now also possible:


[&amp;amp;quot;Ruby&amp;amp;quot;, &amp;amp;quot;Rust&amp;amp;quot;, &amp;amp;quot;Python&amp;amp;quot;, &amp;amp;quot;Cobol&amp;amp;quot;]
    .iter()
    .for_each(|lang| println!(&amp;amp;quot;Hello  lang !&amp;amp;quot;, lang = lang));
Although, more commonly one would write that as a normal for-loop in Rust:


for lang in [&amp;amp;quot;Ruby&amp;amp;quot;, &amp;amp;quot;Rust&amp;amp;quot;, &amp;amp;quot;Python&amp;amp;quot;, &amp;amp;quot;Cobol&amp;amp;quot;].iter()  
    println!(&amp;amp;quot;Hello  lang !&amp;amp;quot;, lang = lang);
 

Select and filter
Let’s say you want to extract only even numbers from a collection in Ruby.

even_numbers = [1, 2, 3, 4, 5].map   |element| element if element.even?   # [ni, 2, nil, 4, nil]
even_numbers = even_numbers.compact # [2, 4]
In this example, before calling compact, our even_numbers array had nil entries.
Well, in Rust there is no concept of nil or Null. You don’t need a compact.
Also, map doesn’t take predicates. You would use filter for that:


let even_numbers = vec![1, 2, 3, 4, 5]
    .iter()
    .filter(|&amp;amp;amp;element| element % 2 == 0);
or, to make a vector out of the result


&amp;#x2F;&amp;#x2F; Result: [2, 4]
let even_numbers: Vec&amp;amp;lt;i64&amp;amp;gt; = vec![1, 2, 3, 4, 5]
    .into_iter()
    .filter(|element| element % 2 == 0).collect();
Some hints:

I’m using the type hint Vec&amp;amp;lt;i64&amp;amp;gt; here because, without it, Rust does not know what collection I want to build when calling collect.
vec! is a macro for creating a vector.
Instead of iter, I use into_iter. This way, I take ownership of the elements in the vector. With iter() I would get a Vec&amp;amp;lt;&amp;amp;amp;i64&amp;amp;gt; instead.

In Rust, there is no even method on numbers, but that doesn’t keep us from defining one!


let even = |x: &amp;amp;amp;i64| x % 2 == 0;
let even_numbers = vec![1, 2, 3, 4, 5].into_iter().filter(even);
In a real-world scenario, you would probably use a third-party package (crate) like num for numerical mathematics:


extern crate num;
use num::Integer;

fn main()  
    let even_numbers: Vec&amp;amp;lt;i64&amp;amp;gt; = vec![1, 2, 3, 4, 5]
        .into_iter()
        .filter(|x| x.is_even()).collect();
 
In general, it’s quite common to use crates in Rust for functionality that is not in the standard lib.
Part of the reason why this is so well accepted is that cargo is such a rad package manager.
(Maybe because it was built by no other than Yehuda Katz of Ruby fame. 😉)
As mentioned before, Rust does not have nil. However, there is still the concept of operations that can fail.
The canonical type to express that is called Result.
Let’s say you want to convert a vector of strings to integers.


let maybe_numbers = vec![&amp;amp;quot;1&amp;amp;quot;, &amp;amp;quot;2&amp;amp;quot;, &amp;amp;quot;nah&amp;amp;quot;, &amp;amp;quot;nope&amp;amp;quot;, &amp;amp;quot;3&amp;amp;quot;];
let numbers: Vec&amp;amp;lt;_&amp;amp;gt; = maybe_numbers
    .into_iter()
    .map(|i| i.parse::&amp;amp;lt;u64&amp;amp;gt;())
    .collect();
That looks nice, but maybe the output is a little unexpected. numbers will also contain the parsing errors:


[Ok(1), Ok(2), Err(ParseIntError   kind: InvalidDigit  ), Err(ParseIntError   kind: InvalidDigit  ), Ok(3)]
Sometimes you’re just interested in the successful operations.
An easy way to filter out the errors is to use filter_map:


let maybe_numbers = vec![&amp;amp;quot;1&amp;amp;quot;, &amp;amp;quot;2&amp;amp;quot;, &amp;amp;quot;nah&amp;amp;quot;, &amp;amp;quot;nope&amp;amp;quot;, &amp;amp;quot;3&amp;amp;quot;];
let numbers: Vec&amp;amp;lt;_&amp;amp;gt; = maybe_numbers
    .into_iter()
    .filter_map(|i| i.parse::&amp;amp;lt;u64&amp;amp;gt;().ok())
    .collect();
I changed two things here:

Instead of map, I’m now using filter_map.
parse returns a Result, but filter_map expects an Option. We can convert a Result into an Option by calling ok() on it3.

The return value contains all successfully converted strings:


[1, 2, 3]
The filter_map is similar to the select method in Ruby:

[1, 2, 3, 4, 5].select   |element| element.even?  
    
        
Random numbers
Here’s how to get a random number from an array in Ruby:


[1, 2, 3].sample
That’s quite nice and idiomatic!
Compare that to Rust:


let mut rng = thread_rng();
rng.choose(&amp;amp;amp;[1, 2, 3, 4, 5])
For the code to work, you need the rand crate. Click on the snippet for a running example.
There are some differences to Ruby. Namely, we need to be more explicit about what random number generator
we want exactly. We decide for a lazily-initialized thread-local random number generator, seeded by the system.
In this case, I’m using a slice instead of a vector. The main difference is that the slice has a fixed size while the vector does not.
Within the standard library, Rust doesn’t have a sample or choose method on the slice itself.
That’s a design decision: the core of the language is kept small to allow evolving the language in the future.
This doesn’t mean that you cannot have a nicer implementation today.
For instance, you could define a Choose trait and implement it for [T].


extern crate rand;
use rand:: thread_rng, Rng ;

trait Choose&amp;amp;lt;T&amp;amp;gt;  
    fn choose(&amp;amp;amp;self) -&amp;amp;gt; Option&amp;amp;lt;&amp;amp;amp;T&amp;amp;gt;;
 

impl&amp;amp;lt;T&amp;amp;gt; Choose&amp;amp;lt;T&amp;amp;gt; for [T]  
    fn choose(&amp;amp;amp;self) -&amp;amp;gt; Option&amp;amp;lt;&amp;amp;amp;T&amp;amp;gt;  
        let mut rng = thread_rng();
        rng.choose(&amp;amp;amp;self)
     
 
This boilerplate could be put into a crate to make it reusable for others.
With that, we arrive at a solution that rivals Ruby’s elegance.


[1, 2, 4, 8, 16, 32].choose()

    
Implicit returns and expressions
Ruby methods automatically return the result of the last statement.

def get_user_ids(users)
  users.map(&amp;amp;amp;:id)
end
Same for Rust. Note the missing semicolon.


fn get_user_ids(users: &amp;amp;amp;[User]) -&amp;amp;gt; Vec&amp;amp;lt;u64&amp;amp;gt;  
    users.iter().map(|user| user.id).collect()
 
But in Rust, this is just the beginning, because everything is an expression.
The following block splits a string into characters, removes the h, and returns the result as a HashSet.
This HashSet will be assigned to x.


let x: HashSet&amp;amp;lt;_&amp;amp;gt; =  
    &amp;#x2F;&amp;#x2F; Get unique chars of a word  &amp;amp;#39;h&amp;amp;#39;, &amp;amp;#39;e&amp;amp;#39;, &amp;amp;#39;l&amp;amp;#39;, &amp;amp;#39;o&amp;amp;#39; 
    let unique = &amp;amp;quot;hello&amp;amp;quot;.chars();
    &amp;#x2F;&amp;#x2F; filter out the &amp;amp;#39;h&amp;amp;#39;
    unique.filter(|&amp;amp;amp;char| char != &amp;amp;#39;h&amp;amp;#39;).collect()
 ;
Same works for conditions:


let x = if 1 &amp;amp;gt; 0   &amp;amp;quot;absolutely!&amp;amp;quot;   else   &amp;amp;quot;no seriously&amp;amp;quot;  ;
Since a match statement is also an expression, you can assign the result to a variable, too!


enum Unit  
    Meter,
    Yard,
    Angstroem,
    Lightyear,
 

let length_in_meters = match unit  
    Unit::Meter =&amp;amp;gt; 1.0,
    Unit::Yard =&amp;amp;gt; 0.91,
    Unit::Angstroem =&amp;amp;gt; 0.0000000001,
    Unit::Lightyear =&amp;amp;gt; 9.461e+15,
 ;

    
Multiple Assignments
In Ruby you can assign multiple values to variables in one step:

def values
  [1, 2, 3]
end

one, two, three = values
In Rust, you can only decompose tuples into tuples, but not a vector into a tuple for example.
So this will work:


let (one, two, three) = (1, 2, 3);
But this won’t:


let (one, two, three) = [1, 2, 3];
&amp;#x2F;&amp;#x2F;    ^^^^^^^^^^^^^^^^^ expected array of 3 elements, found tuple
Neither will this:


let (one, two, three) = [1, 2, 3].iter().collect();
&amp;#x2F;&amp;#x2F; a collection of type `(_, _, _)` cannot be built from an iterator over elements of type `&amp;amp;amp; integer `
But with nightly Rust, you can now do this:


let [one, two, three] = [1, 2, 3];
On the other hand, there’s a lot more you can do with destructuring apart from multiple assignments. You can write beautiful, ergonomic code using pattern syntax.


let x = 4;
let y = false;

match x  
    4 | 5 | 6 if y =&amp;amp;gt; println!(&amp;amp;quot;yes&amp;amp;quot;),
    _ =&amp;amp;gt; println!(&amp;amp;quot;no&amp;amp;quot;),
 
To quote The Book:

This prints no since the if condition applies to the whole pattern 4 | 5 | 6, not only to the last value 6.


String interpolation
Ruby has extensive string interpolation support.

programming_language = &amp;amp;quot;Ruby&amp;amp;quot;
&amp;amp;quot;# programming_language  is a beautiful programming language&amp;amp;quot;
This can be translated like so:


let programming_language = &amp;amp;quot;Rust&amp;amp;quot;;
format!(&amp;amp;quot;   is also a beautiful programming language&amp;amp;quot;, programming_language);
Named arguments are also possible, albeit much less common:


println!(&amp;amp;quot; language  is also a beautiful programming language&amp;amp;quot;, language=&amp;amp;quot;Rust&amp;amp;quot;);
Rust’s println!() syntax is even more extensive than Ruby’s. Check the docs if you’re curious about what else you can do.

    
That’s it!
Ruby comes with syntactic sugar for many common usage patterns, which allows for very elegant code.
Low-level programming and raw performance are no primary goals of the language.
If you do need that, Rust might be a good fit, because it provides fine-grained hardware control with comparable ergonomics.
If in doubt, Rust favors explicitness, though; it eschews magic.
Did I whet your appetite for idiomatic Rust? Have a look at this Github project. I’d be thankful for contributions.

    
Footnotes
1. Thanks to Florian Gilcher for the hint.↩
2. Thanks to masklin for pointing out multiple inaccuracies.↩
3. In the first version, I sait that ok() would convert a Result into a boolean, which was wrong. Thanks to isaacg for the correction.↩
"
},

{
"title": "Making Myself Obsolete",
"url": "https://endler.dev/2017/obsolete/",
"body": "
    

            The Stegosaurus had better days 150 million years ago.
            
            
            Source: Paleontologists once thought it had a brain in its butt.
            
        
In December 2015 I was looking for static analysis tools to integrate into trivago’s CI process.
The idea was to detect typical programming mistakes automatically.
That’s quite a common thing, and there are lots of helpful tools out there which fit the bill.
So I looked for a list of tools…
To my surprise, the only list I found was on Wikipedia — and it was outdated.
There was no such project on Github, where most modern static analysis tools were hosted.
Without overthinking it, I opened up my editor and wrote down a few tools I found through my initial research. After that, I pushed the list to Github.
I called the project Awesome Static Analysis.
Fast forward two years and the list has grown quite a bit.
So far, it has 75 contributors, 277 forks, and received over 2,000 stars.
(Thanks for all the support!)
(Update May 2018: 91 contributors, 363 forks, over 3,000 stars.)
(Update October 2025: 316 contributors, 1,400 forks, over 14,000 stars.)
Around 1000 unique visitors find the list every week. Not much by any means, but I feel obliged to keep it up-to-date
because it has become an essential source of information for many people.
It now lists around 300 tools for static analysis. Everything from Ada to TypeScript is on there.
What I find particularly motivating is, that now the authors themselves create pull requests to add their tools!
There was one problem though: The list of pull requests got longer and longer, as I was busy doing other things.

    
            The list of Github Pull requests for awesome-static-analysis
            
            
Adding contributors
I always try to make team members out of regular contributors. My friend and colleague Andy Grunwald as well as Ouroboros Chrysopoeia are both valuable collaborators. They help me weed out new PRs whenever they find the time.
But let’s face it: checking the pull requests is a dull, manual task.
What needs to be checked for each new tool can be summarized like this:

Formatting rules are satisfied
Project URL is reachable
License annotation is correct
Tools of each section are alphabetically ordered
Description is not too long

I guess it’s obvious what we should do with that checklist: automate it!

    
A linter for linting linters
So why not write an analysis tool, which checks our list of analysis tools!
What sounds pretty meta, is actually pretty straightforward.
With every pull request, we trigger our bot, which checks the above rules and responds with a result.
The first step was to read the Github documentation about building a CI server.
Just for fun, I wanted to create the bot in Rust.
The two most popular Github clients for Rust were github-rs (now deprecated) and hubcaps.
Both looked pretty neat, but then I found afterparty, a “Github webhook server”.
The example looked fabulous:
#[macro_use]
extern crate log;
extern crate env_logger;
extern crate afterparty;
extern crate hyper;

use afterparty:: Delivery, Hub ;

use hyper::Server;

pub fn main()  
    env_logger::init().unwrap();
    let addr = format!(&amp;amp;quot;0.0.0.0:  &amp;amp;quot;, 4567);
    let mut hub = Hub::new();
    hub.handle(&amp;amp;quot;pull_request&amp;amp;quot;, |delivery: &amp;amp;amp;Delivery|  
        match delivery.payload  
            Event::PullRequest   ref action, ref sender, ..   =&amp;amp;gt;  
                &amp;#x2F;&amp;#x2F; TODO: My code here!
                println!(&amp;amp;quot;sender    action   &amp;amp;quot;, sender.login, action)
             
            _ =&amp;amp;gt; (),
         
     );
    let srvc = Server::http(&amp;amp;amp;addr[..])
                   .unwrap()
                   .handle(hub);
    println!(&amp;amp;quot;listening on   &amp;amp;quot;, addr);
    srvc.unwrap();
 
This allowed me to focus on the actual analysis code,
which makes for a pretty boring read. It mechanically checks for the things mentioned above and could be written in any language.
If you want to have a look (or even contribute!), check out the repo.

    
Talking to Github
After the analysis code was done, I had a bot, running locally, waiting for incoming pull requests.
But how could I talk to Github?
I found out, that I should use the Status API
and send a POST request to &amp;#x2F;repos&amp;#x2F;mre&amp;#x2F;awesome-static-analysis&amp;#x2F;statuses&amp;#x2F;:sha
(:sha is the commit ID that points to the HEAD of the pull request):
 
  &amp;amp;quot;state&amp;amp;quot;: &amp;amp;quot;success&amp;amp;quot;,
  &amp;amp;quot;description&amp;amp;quot;: &amp;amp;quot;The build succeeded!&amp;amp;quot;
 
I could have used one of the existing Rust Github clients, but I decided to write a simple function to update the pull request status code.
fn set_status(status: Status, desc: String, repo: &amp;amp;amp;str, sha: &amp;amp;amp;str) -&amp;amp;gt; Result&amp;amp;lt;reqwest::Response&amp;amp;gt;  
    let token = env::var(&amp;amp;quot;GITHUB_TOKEN&amp;amp;quot;)?;
    let client = reqwest::Client::new();
    let mut params = HashMap::new();
    params.insert(&amp;amp;quot;state&amp;amp;quot;, format!(&amp;amp;quot;  &amp;amp;quot;, status));
    params.insert(&amp;amp;quot;description&amp;amp;quot;, desc);
    println!(&amp;amp;quot;Sending status:  :#? &amp;amp;quot;, params);

    let status_url = format!(&amp;amp;quot;https:&amp;#x2F;&amp;#x2F;api.github.com&amp;#x2F;repos&amp;#x2F;  &amp;#x2F;statuses&amp;#x2F;  &amp;amp;quot;, repo, sha);
    println!(&amp;amp;quot;Status url:   &amp;amp;quot;, status_url);
    Ok(client
        .request(
            reqwest::Method::Post,
            &amp;amp;amp;format!(
                &amp;amp;quot;  ?access_token=  &amp;amp;quot;,
                status_url,
                token,
            ),
        )
        .json(&amp;amp;amp;params)
        .send()?)
 
You can see that I pass in a Github token from the environment and then I send the JSON payload as a post request using the reqwest library.
That turned out to become a problem in the end: while afterparty was using version 0.9 of hyper, reqwest was using 0.11. Unfortunately, these two versions depend on a different build of the openssl-sys bindings. That’s a well-known problem and the only way to fix it, is to resolve the conflict.
I was stuck for a while, but then I saw, that there was an open pull request to upgrade afterparty to hyper 0.10.
So inside my Cargo.toml, I locked the version of afterparty to the version of the pull request:
[dependencies]
afterparty =   git = &amp;amp;quot;https:&amp;#x2F;&amp;#x2F;github.com&amp;#x2F;ms705&amp;#x2F;afterparty&amp;amp;quot;  
This fixed the build, and I could finally move on.

    
Deployment
I needed a place to host the bot.
Preferably for free, as it was a non-profit Open Source project.
Also, the provider would have to run binaries.
For quite some time, I was following a product named zeit.
It runs any Docker container using an intuitive command line interface called now.
I fell in love the first time I saw their demo on the site, so I wanted to give it a try.
So I added a multi-stage Dockerfile to my project:
FROM rust as builder
COPY . &amp;#x2F;usr&amp;#x2F;src&amp;#x2F;app
WORKDIR &amp;#x2F;usr&amp;#x2F;src&amp;#x2F;app
RUN cargo build --release

FROM debian:stretch
RUN apt update 
    &amp;amp;amp;&amp;amp;amp; apt install -y libssl1.1 ca-certificates 
    &amp;amp;amp;&amp;amp;amp; apt clean -y 
    &amp;amp;amp;&amp;amp;amp; apt autoclean -y 
    &amp;amp;amp;&amp;amp;amp; apt autoremove -y
COPY --from=builder target&amp;#x2F;release&amp;#x2F;check .
EXPOSE 4567
ENTRYPOINT [&amp;amp;quot;.&amp;#x2F;check&amp;amp;quot;]
CMD [&amp;amp;quot;--help&amp;amp;quot;]
The first part would build a static binary, the second part would run it at container startup.
Well, that didn’t work, because zeit does not support multi-stage builds yet.
The workaround was to split up the Dockerfile into two and connect them both with a Makefile. Makefiles are pretty powerful, you know?
With that, I had all the parts for deployment together.
# Build Rust binary for Linux
docker run --rm -v $(CURDIR):&amp;#x2F;usr&amp;#x2F;src&amp;#x2F;ci -w &amp;#x2F;usr&amp;#x2F;src&amp;#x2F;ci rust cargo build --release

# Deploy Docker images built from the local Dockerfile
now deploy --force --public -e GITHUB_TOKEN=$ GITHUB_TOKEN 

# Set domain name of new build to `check.now.sh`
# (The deployment URL was copied to the clipboard and is retrieved with pbpaste on macOS)
now alias `pbpaste` check.now.sh
Here’s the output of the deploy using now:
&amp;amp;gt; Deploying ~&amp;#x2F;Code&amp;#x2F;private&amp;#x2F;awesome-static-analysis-ci&amp;#x2F;deploy
&amp;amp;gt; Ready! https:&amp;#x2F;&amp;#x2F;deploy-sjbiykfvtx.now.sh (copied to clipboard) [2s]
&amp;amp;gt; Initializing…
&amp;amp;gt; Initializing…
&amp;amp;gt; Building
&amp;amp;gt; ▲ docker build
Sending build context to Docker daemon 2.048 kBkB
&amp;amp;gt; Step 1 : FROM mre0&amp;#x2F;ci:latest
&amp;amp;gt; latest: Pulling from mre0&amp;#x2F;ci
&amp;amp;gt; ...
&amp;amp;gt; Digest: sha256:5ad07c12184755b84ca1b587e91b97c30f7d547e76628645a2c23dc1d9d3fd4b
&amp;amp;gt; Status: Downloaded newer image for mre0&amp;#x2F;ci:latest
&amp;amp;gt;  ---&amp;amp;gt; 8ee1b20de28b
&amp;amp;gt; Successfully built 8ee1b20de28b
&amp;amp;gt; ▲ Storing image
&amp;amp;gt; ▲ Deploying image
&amp;amp;gt; ▲ Container started
&amp;amp;gt; listening on 0.0.0.0:4567
&amp;amp;gt; Deployment complete!
The last step was to add check.now.sh as a webhook inside the awesome-static-analysis project settings.
Now, whenever a new pull request is coming in, you see that little bot getting active!


Outcome and future plans
I am very pleased with my choice of tools: afterparty saved me from a lot of manual work, while zeit made deployment really easy.
It feels like Amazon Lambda on steroids.
If you look at the code and the commits for my bot, you can see all my little missteps, until I got everything just right. Turns out, parsing human-readable text is tedious.
Therefore I was thinking about turning the list of analysis tools into a structured format like YAML. This would greatly simplify the parsing and have the added benefit of having a machine-readable list of tools that can be used for other projects.

    
Update May 2018
While attending the WeAreDevelopers conference in Vienna (can recommend that), I moved the CI pipeline from zeit.co to Travis CI.
The reason was, that I wanted the linting code next to the project, which greatly simplified things.
First and foremost I don’t need the web request handling code anymore, because travis takes care of that.
If you like, you can compare the old and the new version.

    
Update October 2025
Most everything about that project has changed since then.
I now use Github Actions to run the CI checks, and the README.md is completely auto-generated from the YAML files for each tool.
We support specia metadata fields like a list of additional resources (tutorials, videos, etc) or the list of paid plans a tool offers.
We have a fancy website now, where we list all 700+ tools and allow people to vote.
The website is built with Next.js and uses Algolia for search.
We also have a bunch of sponsors now, who help us pay for hosting and other expenses.
It’s crazy to see how projects grow if you give them some love and your time!
I hope this serves as an inspiration for you to start your own open source project!
It’s not that hard, and incredibly rewarding.
"
},

{
"title": "Modern Day Annoyances - Digital Clocks",
"url": "https://endler.dev/2017/digitial-clocks/",
"body": "This morning I woke up to the beeping noise of our oven’s alarm clock.
The reason was that I tried to correct the oven’s local time the day before — and I pushed the wrong buttons.
As a result I didn’t set the correct time, instead, I set a cooking timer… and that’s what woke me up today.

    
Let’s add a clock to the microwave!
On occasions like these, I wonder why there’s a digital clock on every single household device these days.
They’re integrated into microwaves, fridges, ovens, dishwashers, dryers, mixers — and that’s just the kitchen!
There is an inflation of digital clocks on modern-day devices.
A lot of times I was wondering why that is the case. Here’s my best guess:
It’s easier to add a useless digital clock to the design than to leave it out.
Say you are the engineer responsible for the control panel of a run-of-the-mill microwave.
The microwave chip comes with a digital timer, which is perfect for showing the remaining time until the food is warmed up.
Now the question is, what will the timer show when you don’t want to heat anything?
Well, why not show the current time?
It’s unobtrusive and adds value.
Except that these digital clocks can be quite annoying:

They run out of sync and show the wrong time.
They get reset when being plugged off or there’s a power outage. (That’s the dreaded, blinking 00:00 we all learned to love.)
They don’t automatically switch between summer and winter time (hey Germany!).

That’s why I constantly need to look after those clocks.
Let me tell you a secret:
When I’m not warming stuff in the oven, I don’t want it to tell me the local time. I want the stove to be off.

    
Why I have trouble setting the clock on our oven
Our oven has three buttons related to time: plus, minus and a clock symbol.
To set the time, you push the clock symbol. An arrow appears and the display changes to 00:00. You press time again and another arrow appears.
Pressing it two more times shows a blinking clock symbol. Then you can use the + and - buttons to adjust the time. After that, you wait to confirm.
Easy!
The problem is, there is no immediate relationship between the controls and the result in the world.
The underlying concept is called mapping and is prevalent in interface design.
To add some functionality to a device you have two options:

Add more buttons.
Teach an existing button a new trick.

Option 1 might dilute your beautiful design, while option 2 might mean frustration for your users.
Neither option is appealing.
Our oven maps multiple functions to the same button.
But the most annoying thing is, that each device has a different mapping.
Learning to set the time on my oven won’t help me with the dishwasher, which sports an entirely different interface!

    
Takeaways
Good industrial designs are few and far between.
A clock on your product will most likely not add any additional value.
In the best case it might be an annoyance, in the worst case it’s harmfully misleading.
When given a choice, I prefer home appliances without clocks.
Looking at today’s market, that’s harder than it sounds.
Arguably, a device with a clock is cheaper than one without; just because the ones with timers get produced more often.
Now I can understand why it took Steve Jobs two weeks to decide on a washing machine:

We spent some time in our family talking about what’s the trade-off we want to make.
We spent about two weeks talking about this. Every night at the dinner table.

He chose a Miele Washing machine in the end - without a digital clock, I assume.
"
},

{
"title": "Learn Some Rust During Hacktoberfest",
"url": "https://endler.dev/2017/hacktoberfest/",
"body": "
    

            Dirndl, Lederhose, Brezn, Beer, Rust
            
            
            Source: Designed by Freepik
            
        
October is the perfect time to contribute to Open Source — at least according to Github and DigitalOcean.
Because that’s when they organize Hacktoberfest, a global event where you get a free shirt and lots of street cred for creating pull requests. Read the official announcement here.
Some people think they cannot contribute anything of value. Either because they lack the programming skills or because they don’t know where to start.
This guide is trying to change that!
Let me show you, how everybody can contribute code to Rust, a safe systems programming language.
I was inspired to write this by a tweet from llogiq.

    
1. Find a great Rust project to work on
We all want our work to be appreciated.
Therefore I suggest to start contributing to medium-sized projects, because they gained some momentum but are still driven by a small number of maintainers, so help is always welcome. By contrast, tiny projects are mostly useful to the original author only, while large projects can be intimidating at first and have stricter guidelines.
For now, let’s look at repositories with 5-100 stars, which were updated within this year.
Github supports advanced search options based on Lucene syntax.
language:Rust stars:5..100 pushed:&amp;amp;gt;2017-01-01
Here’s a list of projects, which match this filter.

    
2. Install the Rust toolchain
To start contributing, we need a working Rust compiler and the cargo package manager.
Fortunately, the installation should be straightforward.
I recommend rustup for that.
Run the following command in your terminal, then follow the onscreen instructions.
curl https:&amp;#x2F;&amp;#x2F;sh.rustup.rs -sSf | sh
If you’re unsure, just accept the defaults.
After the installation is done, we also need to get the nightly version of the compiler for later.
rustup install nightly
Questions so far? Find more detailed installation instructions here.

    
3. Fork the project and clone it to your computer
First, click on the little fork button on the top right of the Github project page. Then clone your fork to your computer.
git clone git@github.com:yourusername&amp;#x2F;project.git
For more detailed instructions, go here.

    
4. Does it build?
Before we start modifying the codebase, we should make sure that it is in a workable state.
The following commands should work right away from inside the project folder.
cargo build
cargo test
If not, you might want to consult the README for further instructions. (But feel free to choose another project.)

    
5. The magic sauce
Here’s the trick: we use a linter called clippy to show us improvement areas in any Rust codebase.
To get clippy, install it like so:
cargo +nightly install clippy
Afterwards, run it from the project root as often as you like.
rustup run nightly cargo clippy
This should give you actionable information on how to improve the codebase.
Here’s some sample output:
warning: useless use of `format!`
   --&amp;amp;gt; src&amp;#x2F;mach&amp;#x2F;header.rs:420:49
    |
420 |             let error = error::Error::Malformed(format!(&amp;amp;quot;bytes size is smaller than an Mach-o header&amp;amp;quot;));
    |                                                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    |
    = note: #[warn(useless_format)] on by default
    = help: for further information visit https:&amp;#x2F;&amp;#x2F;rust-lang-nursery.github.io&amp;#x2F;rust-clippy&amp;#x2F;v0.0.165&amp;#x2F;index.html#useless_format

warning: this expression borrows a reference that is immediately dereferenced by the compiler
   --&amp;amp;gt; src&amp;#x2F;mach&amp;#x2F;header.rs:423:36
    |
423 |             let magic = mach::peek(&amp;amp;amp;bytes, 0)?;
    |                                    ^^^^^^ help: change this to: `bytes`
    |
    = help: for further information visit https:&amp;#x2F;&amp;#x2F;rust-lang-nursery.github.io&amp;#x2F;rust-clippy&amp;#x2F;v0.0.165&amp;#x2F;index.html#needless_borrow
Just try some of the suggestions and see if the project still compiles and the tests still pass.
Check out the links to the documentation in the help section to learn more.
Start small to make your changes easier to review.

    
6. Creating a Pull Request
If you’re happy with your changes, now is the time to publish them!
It’s best to create a new branch for your changes and then push it to your fork.
git checkout -b codestyle
git commit -am &amp;amp;quot;Minor codestyle fixes&amp;amp;quot;
git push --set-upstream origin codestyle
Afterwards, go to the homepage of your fork on Github.
There should be a button titled Compare &amp;amp;amp; pull request.
Please add a meaningful description and then submit the pull request.
Congratulations! You’ve contributed to the Rust ecosystem. Thank you! 🎉

    
Trophy case

m4b&amp;#x2F;goblin
fitzgen&amp;#x2F;cpp_demangle
fdehau&amp;#x2F;tui-rs
christophertrml&amp;#x2F;rs-natural


Bonus!
If all of the manual fixing and checking sounds too dull, you can automate step number 5 using rustfix by Pascal Hertleif (@killercup):
rustfix --yolo &amp;amp;amp;&amp;amp;amp; cargo check"
},

{
"title": "A Little Story About the `yes` Unix Command",
"url": "https://endler.dev/2017/yes/",
"body": "What’s the simplest Unix command you know?
There’s echo, which prints a string to stdout and true, which always terminates with an exit code of 0.
Among the series of simple Unix commands, there’s also yes.
If you execute it without arguments, you get an infinite stream of y’s, separated by a newline:
y
y
y
y
(...you get the idea)
What seems to be pointless in the beginning turns out to be pretty helpful :
yes | sh boring_installation.sh
Ever installed a program, which required you to type “y” and hit enter to keep going?
yes to the rescue! It will carefully fulfill its duty, so you can keep watching Pootie Tang.

    
Writing yes
Here’s a basic version in… uhm… BASIC.
10 PRINT &amp;amp;quot;y&amp;amp;quot;
20 GOTO 10
And here’s the same thing in Python:
while True:
    print(&amp;amp;quot;y&amp;amp;quot;)
Simple, eh? Not so quick!
Turns out, that program is quite slow.
python yes.py | pv -r &amp;amp;gt; &amp;#x2F;dev&amp;#x2F;null
[4.17MiB&amp;#x2F;s]
Compare that with the built-in version on my Mac:
yes | pv -r &amp;amp;gt; &amp;#x2F;dev&amp;#x2F;null
[34.2MiB&amp;#x2F;s]
So I tried to write a quicker version in Rust. Here’s my first attempt:
use std::env;

fn main()  
  let expletive = env::args().nth(1).unwrap_or(&amp;amp;quot;y&amp;amp;quot;.into());
  loop  
    println!(&amp;amp;quot;  &amp;amp;quot;, expletive);
   
 
Some explanations:

The string we want to print in a loop is the first command line parameter and is named expletive. I learned this word from the yes manpage.
I use unwrap_or to get the expletive from the parameters. In case the parameter is not set, we use “y” as a default.
The default parameter gets converted from a string slice (&amp;amp;amp;str) into an owned string on the heap (String) using into().

Let’s test it.
cargo run --release | pv -r &amp;amp;gt; &amp;#x2F;dev&amp;#x2F;null
   Compiling yes v0.1.0
    Finished release [optimized] target(s) in 1.0 secs
     Running `target&amp;#x2F;release&amp;#x2F;yes`
[2.35MiB&amp;#x2F;s]
Whoops, that doesn’t look any better. It’s even slower than the Python version!
That caught my attention, so I looked around for the source code of a C implementation.
Here’s the very first version of the program, released with Version 7 Unix and famously authored by Ken Thompson on Jan 10, 1979:
main(argc, argv)
char **argv;
 
  for (;;)
    printf(&amp;amp;quot;%sn&amp;amp;quot;, argc&amp;amp;gt;1? argv[1]: &amp;amp;quot;y&amp;amp;quot;);
 
No magic here.
Compare that to the 128-line-version from the GNU coreutils, which is mirrored on Github. After 25 years, it is still under active development!
The last code change happened around a year ago.
That’s quite fast:
# brew install coreutils
gyes | pv -r &amp;amp;gt; &amp;#x2F;dev&amp;#x2F;null
[854MiB&amp;#x2F;s]
The important part is at the end:
&amp;#x2F;* Repeatedly output the buffer until there is a write error; then fail.  *&amp;#x2F;
while (full_write (STDOUT_FILENO, buf, bufused) == bufused)
  continue;
Aha! So they simply use a buffer to make write operations faster.
The buffer size is defined by a constant named BUFSIZ, which gets chosen on each system so as to make I&amp;#x2F;O efficient (see here).
On my system, that was defined as 1024 bytes. I actually had better performance with 8192 bytes.
I’ve extended my Rust program:
use std::env;
use std::io:: self, BufWriter, Write ;

const BUFSIZE: usize = 8192;

fn main()  
    let expletive = env::args().nth(1).unwrap_or(&amp;amp;quot;y&amp;amp;quot;.into());
    let mut writer = BufWriter::with_capacity(BUFSIZE, io::stdout());
    loop  
        writeln!(writer, &amp;amp;quot;  &amp;amp;quot;, expletive).unwrap();
     
 
The important part is, that the buffer size is a multiple of four, to ensure memory alignment.
Running that gave me 51.3MiB&amp;#x2F;s.
Faster than the version, which comes with my system, but still way slower than the results from this Reddit post that I found, where the author talks about 10.2GiB&amp;#x2F;s.

    
Update
Once again, the Rust community did not disappoint.
As soon as this post hit the Rust subreddit, user nwydo pointed out a previous discussion on the same topic.
Here’s their optimized code, that breaks the 3GB&amp;#x2F;s mark on my machine:
use std::env;
use std::io:: self, Write ;
use std::process;
use std::borrow::Cow;

use std::ffi::OsString;
pub const BUFFER_CAPACITY: usize = 64 * 1024;

pub fn to_bytes(os_str: OsString) -&amp;amp;gt; Vec&amp;amp;lt;u8&amp;amp;gt;  
  use std::os::unix::ffi::OsStringExt;
  os_str.into_vec()
 

fn fill_up_buffer&amp;amp;lt;&amp;amp;#39;a&amp;amp;gt;(buffer: &amp;amp;amp;&amp;amp;#39;a mut [u8], output: &amp;amp;amp;&amp;amp;#39;a [u8]) -&amp;amp;gt; &amp;amp;amp;&amp;amp;#39;a [u8]  
  if output.len() &amp;amp;gt; buffer.len() &amp;#x2F; 2  
    return output;
   

  let mut buffer_size = output.len();
  buffer[..buffer_size].clone_from_slice(output);

  while buffer_size &amp;amp;lt; buffer.len() &amp;#x2F; 2  
    let (left, right) = buffer.split_at_mut(buffer_size);
    right[..buffer_size].clone_from_slice(left);
    buffer_size *= 2;
   

  &amp;amp;amp;buffer[..buffer_size]
 

fn write(output: &amp;amp;amp;[u8])  
  let stdout = io::stdout();
  let mut locked = stdout.lock();
  let mut buffer = [0u8; BUFFER_CAPACITY];

  let filled = fill_up_buffer(&amp;amp;amp;mut buffer, output);
  while locked.write_all(filled).is_ok()   
 

fn main()  
  write(&amp;amp;amp;env::args_os().nth(1).map(to_bytes).map_or(
    Cow::Borrowed(
      &amp;amp;amp;b&amp;amp;quot;yn&amp;amp;quot;[..],
    ),
    |mut arg|  
      arg.push(b&amp;amp;#39;n&amp;amp;#39;);
      Cow::Owned(arg)
     ,
  ));
  process::exit(1);
 
Now that’s a whole different ballgame!

We prepare a filled string buffer, which will be reused for each loop.
Stdout is protected by a lock. So, instead of constantly acquiring and releasing it, we keep it all the time.
We use a the platform-native std::ffi::OsString and std::borrow::Cow to avoid unnecessary allocations.

The only thing that I could contribute was removing an unnecessary mut. 😅

    
Lessons learned
The trivial program yes turns out not to be so trivial after all.
It uses output buffering and memory alignment to improve performance.
Re-implementing Unix tools is fun and makes me appreciate the nifty tricks,
which make our computers fast.
"
},

{
"title": "Lightning Fast Image Previews with Pure CSS and LQIP",
"url": "https://endler.dev/2017/image-previews/",
"body": "
    

            Source: Adapted from Freepik
            
        
My website is reasonably fast.
I hope that every page load feels snappy, no matter your device or location.
That should not come as a surprise. After all, I’m just using plain HTML and CSS.
JavaScript is avoided whenever possible.
There was one thing left, which really annoyed me: layout reflow after images got loaded.
The problem is, that the image dimensions are not known when the text is ready to be displayed.
As a result, the text will be pushed down on the screen as soon as an image is loaded above.
Also, while an image is loading, there is no preview, just blank space.
Here’s what that looks like on a slower connection:

    
            Illustration of a flash of unstyled content
            
            
I could fix that, by hardcoding the image width and height, but that would be tedious and error-prone.
And there would be no preview.
So I was wondering, what others were doing. 🤔

    
Tiny image thumbnails
I vaguely remembered, that Facebook uses tiny preview thumbnails in their mobile app.
They extract the quantization table from the JPEG header to render the preview. This information
is stored on the client, so it doesn’t need to be downloaded every time.
Unfortunately, this approach requires full control over the image encoder.
It works for apps, but hardly for websites.
The search continued.
Until my colleague Tobias Baldauf introduced me to LQIP (Low-Quality Image Placeholders).
Here’s the idea:

Load the page including inlined, low-quality image thumbnails.
Once the page is fully loaded (e.g. when the onload event is fired), lazy load full quality images.

Unfortunately, this technique requires JavaScript.
Nevertheless, I liked the idea, so I started experimenting with different image sizes and formats. My goal was to create the smallest thumbnails using any standard image format.

    
Benchmark
Here are 15 pixel wide thumbnails encoded in different file formats:

    
            Comparison of different image formats when creating thumbnails
            
            
I used different tools to create the thumbnails.
For JPEG and PNG encoding, I used svgexport.
svgexport img.svg img.png &amp;amp;quot;svg background:white; &amp;amp;quot; 15: 1%
For webp, I used cwebp:
cwebp img.png -o img.webp
The gif was converted using an online tool and optimized using gifsicle:
gifsicle -O3 &amp;amp;lt; img.gif &amp;amp;gt; img_mini.gif
    
        
Comparison
WebP is the smallest, but it’s not supported by all browsers.
Gif was second, but when resizing the image and applying the blur filter, I was not happy with the result.
In the end, I settled for PNG, which provided an excellent tradeoff between size and quality.
I optimized the images even further using oxipng, which supports zopfli compression.
With that, I end up with thumbnails of around 300-400 bytes in size.
I integrated the thumbnail creation process into my build toolchain for the blog.
The actual code to create the images is rather boring.
If you really want to have a look, it’s on Github.

    
Avoiding JavaScript
Here is the skeleton HTML for the image previews:
&amp;amp;lt;figure&amp;amp;gt;
  &amp;amp;lt;div class=&amp;amp;quot;loader&amp;amp;quot;&amp;amp;gt;
    &amp;amp;lt;object data=&amp;amp;quot;image.svg&amp;amp;quot; type=&amp;amp;quot;image&amp;#x2F;svg+xml&amp;amp;quot;&amp;amp;gt;&amp;amp;lt;&amp;#x2F;object&amp;amp;gt;
    &amp;amp;lt;img class=&amp;amp;quot;frozen&amp;amp;quot; src=&amp;amp;quot;data:image&amp;#x2F;png;base64,...&amp;amp;quot; &amp;#x2F;&amp;amp;gt;
  &amp;amp;lt;&amp;#x2F;div&amp;amp;gt;
&amp;amp;lt;&amp;#x2F;figure&amp;amp;gt;
The trick is to wrap both the full-size image and the preview image into a loader div,
which gets a width: auto CSS attribute:
.loader  
  position: relative;
  overflow: hidden;
  width: auto;
 
I wrap the SVG into an object tag instead of using an img element.
This has the benefit, that I can show a placeholder in case the SVG can’t be loaded.
I position the object at the top left of the loader div.
.loader object  
  position: absolute;
 

.loader img,
.loader object  
  display: block;
  top: 0;
  left: 0;
  width: 100%;
 
Here’s the placeholder hack including some references:
&amp;#x2F;* https:&amp;#x2F;&amp;#x2F;stackoverflow.com&amp;#x2F;a&amp;#x2F;29111371&amp;#x2F;270334 *&amp;#x2F;
&amp;#x2F;* https:&amp;#x2F;&amp;#x2F;stackoverflow.com&amp;#x2F;a&amp;#x2F;32928240&amp;#x2F;270334 *&amp;#x2F;
object  
  position: relative;
  float: left;
  display: block;

  &amp;amp;amp;::after  
    position: absolute;
    top: 0;
    left: 0;
    display: block;
    width: 1000px;
    height: 1000px;
    content: &amp;amp;quot;&amp;amp;quot;;
    background: #efefef;
   
 
The last part is the handling of the thumbnails.
Like most other sites, I decided to apply a blur filter.
In a way, it looks like the image is frozen, so that’s what I called the CSS selector.
I also applied a scaling transformation to achieve sharp borders.
.frozen  
  -webkit-filter: blur(8px);
  -moz-filter: blur(8px);
  -o-filter: blur(8px);
  -ms-filter: blur(8px);
  filter: blur(8px);
  transform: scale(1.04);
  animation: 0.2s ease-in 0.4s 1 forwards fade;
  width: 100%;
 

@keyframes fade  
  0%  
    opacity: 1;
   
  100%  
    opacity: 0;
   
 
I use CSS animations instead of JavaScript.
The duration of the animation is based on the 95% percentile load time of all visitors of the page. Although it’s just an approximation, this should work for most readers.

    
Result

No JavaScript needed
Works on all modern browsers
Supports a fallback in case the main image can’t be loaded
Tiny overhead


Resources

Introducing LQIP – Low Quality Image Placeholders
How Medium does progressive image loading
SQIP, a new preview technique using pure SVG

"
},

{
"title": "Go vs Rust? Choose Go.",
"url": "https://endler.dev/2017/go-vs-rust/",
"body": "
    

            Source: Gopher designed with Gopherize.me. Gears designed by Freepik.
            
        
     I wrote this article a long time ago. In the meantime,
my opinion on some aspects has changed.
In order to give a more balanced perspective on the pros and cons, I suggest to
read this comparison on Go vs Rust instead, which I wrote in collaboration with
Shuttle 🚀
Rust vs Go: A Hands-On Comparison


“Rust or Go, which one should I choose?” is a question I get quite often.
Both languages seem to be competing for the same user base and they both seem to be
systems programming languages, so there must be a clear winner, right?

    
Go: practical, pragmatic, plain

    
            The Golang learning curve over time, a straight line.
            
            
I don’t think Go is an elegant language. Its biggest feature is simplicity easiness.
Go is not even a systems programming language. While it’s great for writing microservices and tooling around backend infrastructure, I would not want to write a kernel or a memory allocator with it.
But with Go, you get things done — fast.
Go is one of the most productive languages I’ve ever worked with.
The mantra is: solve real problems today.

    
Rust’s strong guarantees come at a cost

    
            The Rust learning curve over time, a bumpy ride.
            
            
Rust in comparison is hard. It took me many months to become somewhat productive.
You need to invest a serious amount of time to see any benefit.
Rust is already a powerful language and it gets stronger every day.
It feels much more like a pragmatic Haskell to me than a safer C.
Don’t get me wrong: I love Rust, and it helped me become a better programmer. It is certainly a nice language to learn. The big question is, if it is the right choice for your next major project.
Here’s the thing: if you choose Rust, usually you need the guarantees, that the language provides:

Safety against Null pointers, race conditions and all sorts of low-level threats.
Predictable runtime behavior (zero cost abstractions and no garbage collector).
(Almost) total control over the hardware (memory layout, processor features).
Seamless interoperability with other languages.

If you don’t require any of these features, Rust might be a poor choice for your next project.
That’s because these guarantees come with a cost: ramp-up time.
You’ll need to unlearn bad habits and learn new concepts.
Chances are, you will fight with the borrow checker a lot when you start out.

    
Case-study: Primality by trial division
Let’s say, you want to check if a number is prime.
The easiest way is to check if we can divide the number by any smaller natural number (without a remainder). If not, we found a prime number! This approach is called trial division.
Here’s how to do that in Golang (courtesy of Rosetta Code):
func IsPrime(n int) bool  
	if n &amp;amp;lt; 0  
		n = -n
	 
	switch  
	case n &amp;amp;lt; 2:
		return false
	default:
		for i := 2; i &amp;amp;lt; n; i++  
			if n%i == 0  
				return false
			 
		 
	return true
 
And here’s the same thing in Rust:
pub fn is_prime(n: u64) -&amp;amp;gt; bool  
    match n  
        0...1 =&amp;amp;gt; false,
        _ =&amp;amp;gt;  
            for d in 2..n  
                if n % d == 0  
                    return false;
                 
             
            true
         
     
At first sight, both solutions look pretty similar.
But if we look closer, we can spot some differences.

In Go, we use a simple switch-case statement. In Rust, we use a match statement, which is much more powerful.
In Go, we use a simple for-loop to iterate over the numbers 2 to n. In Rust, we use a range expression (2..n).
In Go, we use two return statements, in Rust we have one return expression. In general, most things in Rust are expressions, which can be returned and assigned to a variable. Read more about expressions here.

In many areas, Rust is more functional than Golang. You could rewrite the above code using the any method, which is implemented for Range.
fn is_prime(n: u64) -&amp;amp;gt; bool  
    match n  
        0...1 =&amp;amp;gt; false,
        _ =&amp;amp;gt; !(2..n).any(|d| n % d == 0),
     
 
It might seem a little alien at first, but it will become second-nature after a while.
This was just a quick example, of course. I suggest, you browse some code on Rosetta Code to get a better feeling for both languages.

    
Case study: Finding duplicate words in text files
If you’re more like a visual type, here is a video where I write a simple
concurrent program in Go and Rust to compare both languages:


  document.addEventListener( DOMContentLoaded , function()  
    lightEmbedInit();
   );


Some things I prefer in Go

Fast compile times
Pragmatic problem-solving approach
Nice ecosystem for typical DevOps tasks
Batteries-included standard-library
IDE support
Simple error handling
The mascot 😉


Some things I prefer in Rust

Safety: No null pointers, no data races,…
Fine-grained system control
Incredible runtime speed (comparable with C&amp;#x2F;C++)
Zero-cost abstractions
Awesome, open-minded community
Simple package management with cargo
Support for Generics in form of traits
C interop and FFI


Conclusion
99% of the time, Go is “good enough” and that 1% where it isn’t, you’ll know.
And then take a look at Rust, because the two languages complement each other pretty well.
If you’re interested in hands-on Rust consulting, pick a date from my
calendar and we can talk about how I can help.
After all is said and done, Rust and Go are not really competitors.
"
},

{
"title": "Afraid of Makefiles? Don't be!",
"url": "https://endler.dev/2017/makefiles/",
"body": "
    

            What do clothes have to do with Makefiles? Find out in this post!
            
            
            Source: Illustration by Anindyanfitri - Freepik.com
            
        
In the last few years, I’ve had the pleasure to work with a lot of talented Software Engineers.
One thing that struck me is that many of them did not have any working knowledge of Makefiles
and why they are useful.
When faced with the task to automate a build process, they often roll their own shell scripts.
Common culprits are called build.sh, run.sh or doall.sh in a project folder.
They implement the same basic functionality over and over again:

Parsing input parameters and environment variables.
Manually managing dependencies between build steps.
Error handling (…maybe).

Along the way, they keep making the same basic mistakes:

Incorrectly handling input parameters and environment variables.
Missing dependencies between build steps.
Forgetting to handle errors and — even worse — carrying on with the program execution.

These are issues Makefiles were invented to solve.

    
Makefiles are scary!
If you think that make is scary, you probably think of complicated build machinery for big software projects.
It doesn’t need to be that way. Let’s hear from the author of make, Stuart Feldman himself:

It began with an elaborate idea of a dependency analyzer, boiled down to something much simpler, and turned into Make that weekend. Use of tools that were still wet was part of the culture. Makefiles were text files, not magically encoded binaries because that was the Unix ethos: printable, debuggable, understandable stuff.
— The Art of Unix Programming (2003)

Make was built in one weekend to solve a reoccuring problem in a simple way.

    
Makefiles are simple!
Before I leave the house, I need to get dressed.
I use the same simple routine every time:
Underpants, trousers, shirt, pullover, socks, shoes, jacket.
Most likely you also have a routine, even though yours might be different.
Some of these steps depend on each other.
Make is useful for handling dependencies.
Let’s try to express my routine as a Makefile.
dress: trousers shoes jacket
	@echo &amp;amp;quot;All done. Let&amp;amp;#39;s go outside!&amp;amp;quot;

jacket: pullover
	@echo &amp;amp;quot;Putting on jacket.&amp;amp;quot;

pullover: shirt
	@echo &amp;amp;quot;Putting on pullover.&amp;amp;quot;

shirt:
	@echo &amp;amp;quot;Putting on shirt.&amp;amp;quot;

trousers: underpants
	@echo &amp;amp;quot;Putting on trousers.&amp;amp;quot;

underpants:
	@echo &amp;amp;quot;Putting on underpants.&amp;amp;quot;

shoes: socks
	@echo &amp;amp;quot;Putting on shoes.&amp;amp;quot;

socks: pullover
	@echo &amp;amp;quot;Putting on socks.&amp;amp;quot;
If we execute the Makefile, we get the following output:
$ make dress
Putting on underpants.
Putting on trousers.
Putting on shirt.
Putting on pullover.
Putting on socks.
Putting on shoes.
Putting on jacket.
All done. Let&amp;amp;#39;s go outside!
    
        
What just happened?
Noticed how the steps are in the correct order?
By plainly writing down the dependencies between the steps, make helps us to execute them correctly.
Each build step has the following structure:
target: [dependencies]
	&amp;amp;lt;shell command to execute&amp;amp;gt;
	&amp;amp;lt;shell command to execute&amp;amp;gt;
	...


The first target in a Makefile will be executed by default when we call make.


The order of the targets does not matter.


Shell commands must be indented with a tab.


Add an @ sign to suppress output of the command that is executed.


If target isn’t a file you want to build, please add .PHONY &amp;amp;lt;target&amp;amp;gt; at the end of the build step.
Common phony targets are: clean, install, run,…
Otherwise, if somebody creates an install directory, make will silently fail, because the build target already exists.
.PHONY: install
install:
	npm install

Congratulations! You’ve learned 90% of what you need to know about make.

    
Next steps
Real Makefiles can do much more! They will only build the files that have changed instead of doing a full rebuild.
And they will do as much as possible in parallel.
Just try to keep them simple please.
"
},

{
"title": "Of Boxes and Trees - Smart Pointers in Rust",
"url": "https://endler.dev/2017/boxes-and-trees/",
"body": "Recently, I tried to implement a binary tree data structure in Rust.
Each binary tree has a root value, a left, and a right subtree.
I started from this Python implementation, which is quite straightforward.
class Tree:
  def __init__(self, val, left=None, right=None):
    self.val = val
    self.left = left
    self.right = right
This allows us to declare a fancy tree object like this:
t = Tree(15,
      Tree(12,
           None,
           Tree(13)),
      Tree(22,
           Tree(18),
           Tree(100)))
And the result can be visualized beautifully.
(Yes I’ve drawn that myself.)

    
            A binary search tree representing our data structure
            
            
Porting that code to Rust turned out to be a little… challenging.
My first attempt looked quite innocuous.
struct Tree  
  root: i64,
  left: Tree,
  right: Tree,
 
That’s pretty much a one-to-one translation of the Python definition — but rustc says no.
error[E0072]: recursive type `Tree` has infinite size
 --&amp;amp;gt; src&amp;#x2F;main.rs:1:1
  |
1 | struct Tree  
  | ^^^^^^^^^^^ recursive type has infinite size
  |
  = help: insert indirection (e.g., a `Box`, `Rc`, or `&amp;amp;amp;`) at some point to make `Tree` representable
Coming from memory-managed languages (like Python, PHP, or Ruby), I was confused by this.
The problem is easy to understand, though.
Computers have a limited amount of memory.
It’s the compiler’s job to find out how much memory to allocate for each item.
In our case, it infers the following:
A tree is a structure containing an i64, and two trees. Each of these trees is a structure containing an i64, and two trees. Each of these…
You get the idea.
Tree   i64, Tree, Tree  
Tree   i64, Tree   ...  , Tree   ...    
&amp;#x2F;&amp;#x2F; The next expansion won&amp;amp;#39;t fit on the page anymore
Since we don’t know how many subtrees our tree will have, there is no way to tell how much memory we need to allocate up front. We’ll only know at runtime!
Rust tells us how to fix that: by inserting an indirection like Box, Rc, or &amp;amp;amp;.
These are different “pointer types” in Rust. They all point to places in memory. So, instead of knowing the total size of our tree structure, we just know the point in memory where the tree is located. But that’s enough to define the tree structure.
These pointer types allow us to do that safely and without manual memory management.
They all offer different guarantees and you should choose the one that fits your requirements best.


&amp;amp;amp; is called a borrow in Rust speech. It’s the most common of the three. It’s a reference to some place in memory, but it does not own the data it points to. As such, the lifetime of the borrow depends on its owner.
Therefore we would need to add lifetime parameters here. This can make it tedious to use.
struct Tree&amp;amp;lt;&amp;amp;#39;a&amp;amp;gt;  
  root: i64,
  left: &amp;amp;amp;&amp;amp;#39;a Tree&amp;amp;lt;&amp;amp;#39;a&amp;amp;gt;,
  right: &amp;amp;amp;&amp;amp;#39;a Tree&amp;amp;lt;&amp;amp;#39;a&amp;amp;gt;,
 

Box is a smart pointer with zero runtime overhead. It owns the data it points to and stores it on the heap.
We call it smart because when it goes out of scope, it will first drop the data it points to and then itself. No manual memory management required, which is neat. ✨
struct Tree  
  root: i64,
  left: Box&amp;amp;lt;Tree&amp;amp;gt;,
  right: Box&amp;amp;lt;Tree&amp;amp;gt;,
 

Rc is another smart pointer. It’s short for “reference-counting”. It keeps track of the number of references to the data structure internally. As soon as the number of references is down to zero, it cleans up after itself.
Choose Rc if you need to have multiple owners of the same data in one thread.
For multithreading, there’s also Arc (atomic reference count).
struct Tree  
  root: i64,
  left: Rc&amp;amp;lt;Tree&amp;amp;gt;,
  right: Rc&amp;amp;lt;Tree&amp;amp;gt;,
 

Putting the tree into a box
All three options are totally valid. Which one you should choose, depends on your use-case.
A rule of thumb is to keep it simple.
In my case, I chose to use a Box, because I did not need any special guarantees.

    
Making subtrees optional
The next problem I faced was that I could not instantiate a tree structure.
The left and right subtree have the type Box&amp;amp;lt;Tree&amp;amp;gt;, but at some
point I would need an empty subtree.
In the Python example, I used None to signal the end of my data structure.
Thanks to Rust’s Option type we can do the same:
struct Tree  
  root: i64,
  left: Option&amp;amp;lt;Box&amp;amp;lt;Tree&amp;amp;gt;&amp;amp;gt;,
  right: Option&amp;amp;lt;Box&amp;amp;lt;Tree&amp;amp;gt;&amp;amp;gt;,
 
After all of this, we can create our first tree:
Tree  
    root: 15,
    left: Some(Box::new(Tree  
            root: 12,
            left: None,
            right: Some(Box::new(Tree  
                    root: 13,
                    left: None,
                    right: None,
             )),
     )),
    right: Some(Box::new(Tree  
            root: 22,
            left: Some(Box::new(Tree  
                    root: 18,
                    left: None,
                    right: None,
             )),
            right: Some(Box::new(Tree  
                    root: 100,
                    left: None,
                    right: None,
             )),
     )),
 ;
Depending on your point of view, you might say this is either verbose or explicit.
Compared to the Python version, it looked a bit too cluttered for my taste.
Can we do better?
Chris McDonald helped me come up with the following representation:
Tree::new(15)
  .left(
    Tree::new(12)
      .right(Tree::new(13))
  )
  .right(
    Tree::new(22)
      .left(Tree::new(18))
      .right(Tree::new(100))
  );
To me, this is much easier on the eye.
Here’s the full tree implementation that makes this possible:
#[derive(Default)]
struct Tree  
  root: i64,
  left: Option&amp;amp;lt;Box&amp;amp;lt;Tree&amp;amp;gt;&amp;amp;gt;,
  right: Option&amp;amp;lt;Box&amp;amp;lt;Tree&amp;amp;gt;&amp;amp;gt;,
 

impl Tree  
  fn new(root: i64) -&amp;amp;gt; Tree  
    Tree  
      root: root,
      ..Default::default()
     
   
  fn left(mut self, leaf: Tree) -&amp;amp;gt; Self  
    self.left = Some(Box::new(leaf));
    self
   

  fn right(mut self, leaf: Tree) -&amp;amp;gt; Self  
    self.right = Some(Box::new(leaf));
    self
   
 
Update: Danny Grein mentioned on Twitter, that
we can support the following syntax by implementing From&amp;amp;lt;i64&amp;amp;gt; for Tree:
root(15)
  .left(
    root(12)
      .right(13)
   )
  .right(
    root(22)
      .left(18)
      .right(100)
  );
    
        
Why did it just work in Python?
Now you might be wondering why our tree implementation worked so flawlessly in Python.
The reason is that Python dynamically allocates memory for the tree object at runtime.
Also, it wraps everything inside a PyObject, which is kind of similar to Rc from above
— a reference-counted smart pointer.
Rust is more explicit here. It gives us more flexibility to express our needs
but we also need to know about all the possible alternatives to make good use of them.
My advice is to stay away from smart pointers if a simple borrow will do.
If lifetimes get in the way however or you need additional guarantees like thread-safety, smart pointers are a great addition to your toolkit. The Rust documentation is a good starting point to learn more about smart pointers.
Also, read “Idiomatic tree and graph-like structures in Rust” for some clever use of allocators in case your tree needs to be mutable at runtime.
"
},

{
"title": "Why Type Systems Matter",
"url": "https://endler.dev/2017/why-type-systems-matter/",
"body": "I’ve written most of my code in dynamically typed languages such as Python or PHP; but ever since dabbling with Rust, I’ve developed a passion for static type systems.
It began to feel very natural to me; like a totally new way to express myself.

    
Types are here to help
With types, you communicate your guarantees and expectations. Both, to the machine and other developers. Types express intent.
As a programmer, you’ve probably gained some intuition about types.
sentence = &amp;amp;quot;hello world&amp;amp;quot;
You might guess that sentence is a string. It’s in quotes, after all.
It gets a little more tricky if the type gets inferred from some other location.
sentence = x
Is sentence still a string? Uhm… we don’t know. It depends on the type of x. Maybe x is a number, and so sentence is also a number? Maybe xused to be a string but during refactoring it is now a byte array? Fun times had by all. 🎉
What about this one?
filesize = &amp;amp;quot;5000&amp;amp;quot; # Size in bytes
Here, we express a file size as a string.
While this might work, it’s an unsettling idea.
Even simple calculations might lead to unexpected results:
file1 = &amp;amp;quot;5000&amp;amp;quot;
file2 = &amp;amp;quot;3000&amp;amp;quot;
total = file1 + file2
print(total) # prints &amp;amp;#39;50003000&amp;amp;#39;
    
        
How can we fix that?
We can safely assume that a file size is always a number.
To be more precise, it must be a positive, natural number.
There can be no negative file size, and our smallest block of memory is one byte
(on all but the most obscure systems).
And since we’re dealing with a discrete machine here, we know it can only be
a filesize the computer can handle.
If we only could express all of this in a precise way…?
This is where type systems enter the stage.
In Rust, you could define a File type with a field named size.
struct File  
  name: String,
  size: usize,
 
The usize gives you the guarantee to be always big enough to hold any pointer into memory (on 64 bit computers usize = u64).
Now there is no more ambiguity about the type of size.
You can’t even create an invalid file object:
&amp;#x2F;&amp;#x2F; Error: `size` can&amp;amp;#39;t be a string.
let weird_file = File   name: 123, size: &amp;amp;quot;hello&amp;amp;quot;  ;
The type system will prevent invalid state. It will simply not allow you to
break your own rules. It will hold you accountable for your design choices.
Dare I say it: it becomes an extension of your brain.
After some time you start to rely on the type checker. “If it compiles, it runs”
is a powerful mantra.

    
Types improve readability and provide context
Consider the following Python snippet:
def filter_files(files):
  matches = []
  for file in files:
    if file.status == 0:
      matches.append(file)
  return matches
What does 0 represent?
We can’t say. We lack the context!
The story gets a little clearer once we define an enum type like this:
from enum import Enum

class FileStatus(Enum):
  OPEN = 0
  CLOSED = 1
Our example from above becomes
def filter_files(files):
  matches = []
  for file in files:
    if file.status == FileStatus.OPEN:
      matches.append(file)
  return matches
In a larger codebase, FileStatus.OPEN is much easier to search for than 0.
Note: The native enum type was introduced very late in the history of Python. It serves as a nice
example of how enhancing the type system can help improve readability.

    
When you combine different types, magic happens.
All pieces suddenly fall into place when you choose your types wisely. Out of nowhere, the compiler will start
checking your design decisions and if all your types work well together. It will point out flaws in your mental model.
This gives you a great amount of confidence during refactoring.
For example, let’s think about sorting things.
When I think of sorting, I first think about a list of numbers:
sorted([1,5,4,3,2]) # [1,2,3,4,5]
That’s the happy path. How about this one?
sorted(1)
Ouch. This can’t work because 1 is a single number and not a collection!
If we forget to check the type before we pass it to sorted, we get an error
while the program runs.
sorted([1, &amp;amp;quot;fish&amp;amp;quot;])
In Python 2, this would result in [1, fish] (because strings will be compared by length)
Edit: Reddit user jcdyer3 pointed out that the reason is that when incomparable types are compared, they sort by their type, so all ints will come before all strings. It’s a CPython implementation detail).

    
            1 &amp;amp;lt; fish according to Python 2
            
            
            Source: Illustration provided by Freepik
            
        
Since Python 3, this throws an Exception.
TypeError: &amp;amp;#39;&amp;amp;lt;&amp;amp;#39; not supported between instances of &amp;amp;#39;str&amp;amp;#39; and &amp;amp;#39;int&amp;amp;#39;
Much better! One less source of error. The problematic thing is though, that this happens at runtime.
That’s because of Python’s dynamic typing.
We could have avoided that with a statically typed language.
fn sorted&amp;amp;lt;T&amp;amp;gt;(collection: &amp;amp;amp;mut [T]) where T: PartialOrd  
  &amp;#x2F;&amp;#x2F; TODO: Sort the collection here.
 
Looks scary but it really isn’t.
We define a function named sorted which takes one input parameter named
collection.
The type of collection consists of four parts:

The &amp;amp;amp; means that we “borrow” the collection, we don’t own it. After the function returns, it will still exist. It won’t be cleaned up.
The mut means that the collection is mutable. We are allowed to modify it.
[T] indicates that we expect a list&amp;#x2F;slice&amp;#x2F;vector as input. Everything else
will be rejected at compile time (before the program even runs).
PartialOrd is
the magic sauce. It is a trait, which is something like an interface. It means that all elements T in the collection must be partially ordered.

All of this information helps the compiler to prevent us from shooting ourselves in the foot.
And we can understand the inputs and outputs of the function without looking elsewhere.

    
Takeaways

Types force developers to do their homework and think about the guarantees and limitations of their code.
Don’t think of types as constraints, think of them as a safety net which will protect you from your own flawed mental models.
Always choose the type which most precisely expresses your intent.
If there is no perfect type in the standard library, create your own from simpler types.

Following these rules, I found that I was magically guided towards the most elegant representation of my ideas.
My code became much more idiomatic.
"
},

{
"title": "Being a Professional Programmer",
"url": "https://endler.dev/2017/professional-programming/",
"body": "When I was around 12, I set myself the goal to become a professional programmer.
I can tell, because at this time I made the conscious decision to use my right hand to control the mouse — even though I’m left-handed.
My reasoning was, that if I ever had to help out a colleague with a computer problem I sure did not want to move her mouse to the other side before getting started. That would be awkward.
(Of course I did not foresee the advent of the wireless mouse… As a matter of fact, I still use the right hand out of habit.)
One thing I always wanted to know is how a typical workday of a programmer looked like.
Was I wasting my time by pursuing this career?
Only later I found the answer — but I had to become a professional programmer myself.
This article aims to save you from a few years of uncertainty.
Before you dig into this, be sure to read the first part of this series titled “Why I love Programming”.

    
What’s the difference between “professional” and “hobby” programming?
In one word: accountability.
You are expected to be responsible.
Programming in your free time is like throwing a party without having to clean up: pure fun!
If you get bored you’re free to move on.
Not so in professional programming, where you’re expected to get the job done.
Every application requires constant bug fixing, refactoring and sometimes even monkey patching. Maintaining code is no amusement park; especially if it’s not your own.

    
Being a Junior Developer
Fresh out of school you might think you’re a pretty kick-ass programmer. Let me tell you: you’re not.
You wouldn’t guess what talented people can do with these blinking machines.
You’ll have tons of things to learn in the first few years.
Professional software development is a lengthy process. Writing readable, well-tested, well-documented code is a substantial effort. You will need patience, lots of it. Both, with yourself and with others.
As a junior, you only think in black and white. You look at some code, and it’s all wrong. Who in their right mind created this horrible monstrosity?!
As you become more experienced, you’ll see the shades of grey.
Eventually, you’ll understand that those neckbeards were not slower than you, but
more careful. You learn how to test your code, how to document it. You even begin to
appreciate UML diagrams.

    
Becoming obsolete
“The world is moving too fast. What you learned today is obsolete tomorrow. Why bother?”.
I’ve heard that saying countless times throughout my career.
It’s both, popular and wrong.
If a skill becomes obsolete, it’s not a skill.
Throughout your career you don’t want to be known as “the Jenkins guy”, you want to be the
expert in Software Quality. Hint: If you don’t know what Jenkins is, that’s the
whole point. You should not narrow down your scope too much.
The right skills never become obsolete.
From time to time it happens, that due to some new company policy your beautiful creation will become obsolete.
As depressing as it sounds: it’s a regular part of the software business.
You need to adapt.
One advice I can give you is not to take it too seriously.
Drop the project, keep the wisdom.
Embrace change.

    
Writing software in a non-perfect world
A professional programmer has to deal with deficiencies all the time. The game is called “balancing constraints”. Deadlines, budgets, and code quality are just a few competing constraints we have to consider.
Elegant designs fade away in the face of reality.
In the end you want to earn money with your software, so you have to ship it!
The best developers I know, keep the balance between pragmatism and elegance.
They know which parts matter and which don’t. Those who don’t will be replaced
when there’s a need.
For me, I was always leaning more towards elegance.
That’s just a nicer way to say I was a perfectionist.
I needed to learn the pragmatic part through hard work.

    
Mentoring less experienced Programmers

The better you become at programming, the less you code.

Instead, you will spend more time thinking about Software Architecture,
high-level designs and splitting up the work into smaller junks for other developers to consume.
You will start mentoring Junior Developers. Recruiting will require a lot of your
attention. You will spend your time in Meetings, discussing project goals with
business people.
One might say, you take the role of a mediator. Others might call you a manager.
Once you know the ins and outs of the business, you are an essential asset for
the company. You might get asked to become a manager, or at least managing projects will slowly feel like a natural extension of your responsibilities.
But beware! This slow and gradual process is dangerous.
Moving back to being a full-time programmer is not easy.
During the time you were busy with project management, others were busy improving their
coding skills.
You can try to keep up-to-date in your free time but that’s hard.
I’ve seen excellent developers become great managers. At some point in your career
it’s a decision you need to make for yourself.
However you decide, it pays off to invest some time into learning how to
communicate. Empathy plays a prominent role in that.
Developing software as a team is so complicated that a lot of time is spent on aligning goals and communicating problems. In fact, communication is what you get paid for. This includes documentation, tests and the code itself.
Talk to others, listen to their problems. Read books about Software Project
Management, even though you don’t want to be a manager yourself. It will help
you understand the role of your boss.

    
A word about money
There are many good reasons to work in IT, but money is not one of them.
While it can be tempting to base your career decisions on prospective salary,
don’t do it. You will be very unhappy. You will spend eight hours or more each day sitting in front of a blinking cursor.
That’s a lot of time, and time is much more valuable than money.
Don’t get me wrong. There’s plenty of jobs that pay well.
You will most likely not get rich, though. If you want
to make it big, I can’t help you. Maybe look into Real Estate or so…
The only way to get rich as a developer is to work on something really hard, put in lots of hours and get
lucky. Startups, basically. Keep in mind: One Bill Gates takes a thousand failed
attempts.
Another way is to stop being a programmer and become a manager instead.
I’ve already shared my opinion on that in the last section.

    
Final words
While you should learn to read (and maybe write) code, working as a professional programmer is not for everyone.
You might ask: “Is it worth it?”.
For me it was the right decision. Hopefully I could help you to make your own.
"
},

{
"title": "The Future of Rust",
"url": "https://endler.dev/2017/future-of-rust/",
"body": "Let me first point out the obvious: yes, the title is a little sensationalist. Also
you might be asking why I should be entitled to talk about the future of Rust. After
all, I’m neither part of the Rust core team, nor a major contributor to the Rust
ecosystem. To that I answer: why not? It’s fun to think about the future of
systems programming in general and Rust in particular.

    
            Ferris is the inofficial Rust mascot
            
            
            Source: Illustration provided by FreePik.com
            
        
You might have heard of the near-term goals that the core team has committed itself to. Faster compile times and a more gentle learning curve come to mind.
This post is not about that.
Instead, I want to explore some more exotic areas where Rust could shine in
five to ten years from now. To make it big, we need both, roots and wings.

    
Data Science
Right now, the most popular languages for Data Science are Python, Java, R, and C++.

  
  Programming language popularity for data science (Source).
  

We’ve observed that while prototypes are mostly written in dynamically typed
languages like Python and R, once an algorithm reaches production level quality
it is often rewritten in faster languages such as C++ for scalability.
It is not unthinkable that Rust is going to be some healthy competition for C++ in the near future.
The benchmarks of leaf, a machine learning library written in Rust, are already nothing short of
impressive.

    
Blockbuster games
Games are another area where Rust might shine.
It’s financially attractive for Game Studios to support multiple platforms without much
effort. Cargo and rustup make cross-compiling easy.
Modern libraries slowly fill the tooling gaps for large-scale game development.
Rust’s support for the Vulkan 3D graphics API might already be the best of class.
The killer feature though is the unique combination of safety and performance.
If you ship a game to a million players and they throw money at you, you’ll better make sure that it doesn’t crash… right?
That said, the first AAA Rust game might still be far in the future. Here’s Blizzard’s standpoint on Rust in 2017.

    
Systems Engineering
Maybe — eventually — we will also see formal verification of the Rust core. Projects like RustBelt would then open new opportunities in safety-focused industries like the Space industry. Wouldn’t it be nice to safely land a Spacecraft on Mars that is controlled by Rust? (Or by one of its spiritual successors.)
I wonder if SpaceX is experimenting with Rust already…

    
Integrating with other languages
There are many other areas I haven’t even mentioned yet. For example, financial and medical software or Scientific Computing, just to name a few.
In all cases, Rust might be a good fit. Right now the biggest barrier to entry
is probably the huge amount of legacy code. Many industries maintain large codebases in Cobol,
C or Fortran that are not easily rewritten.
Fortunately, Rust has been proven to work very nicely with other languages.
Partly because of strong C-compatibility and partly because there is no Runtime or Garbage Collector.
A typical pattern is to optimize some core part of an application in Rust that has hard safety&amp;#x2F;performance
requirements, while leaving the rest untouched.
I think this symbiosis will only become stronger in the long run.
There are even ambitious projects like Corrode which attempt to translate C code to Rust automatically.

    
Summary
Overall I see huge potential for Rust in areas where safety, performance or total control over the machine are essential. With languages like Rust and Crystal, a whole class of errors is a thing of the past. No null pointers, no segmentation faults, no memory leaks, no data races.
I find it encouraging that future generations of programmers will take all that for granted.
"
},

{
"title": "Launching a URL Shortener in Rust using Rocket",
"url": "https://endler.dev/2017/rust-url-shortener/",
"body": "One common systems design task in interviews is to sketch the software architecture of a URL shortener (a bit.ly clone, if you may).
Since I was playing around with Rocket – a web framework for Rust – why not give it a try?

    
            A rocket travelling through space
            
            
Requirements
A URL shortener has two main responsibilities:

Create a short URL for a longer one (d’oh!).
Redirect to the longer link when the short link is requested.

Let’s call our service rust.ly (Hint, hint: the domain is still available at the time of writing…).
First, let’s create a new Rust project:
cargo new --bin rustly
Next, we add Rocket to our Cargo.toml:
[dependencies]
rocket = &amp;amp;quot;0.2.4&amp;amp;quot;
rocket_codegen = &amp;amp;quot;0.2.4&amp;amp;quot;
Warning: Most likely you need to get the very newest Rocket version.
Otherwise, you might get some… entertaining error messages. Find the newest
version on crates.io.
Since Rocket requires cutting-edge Rust features, we need to use a recent nightly
build. Rustup provides a simple way to switch between stable and nightly.


     🤔 Nightly Rust might no longer be required. Has anyone tried without and can
report back?


rustup update &amp;amp;amp;&amp;amp;amp; rustup override set nightly
    
        
A first prototype
Now we can start coding our little service.
First, let’s write a simple “hello world” skeleton to get started.
Put this into src&amp;#x2F;main.rs:
#![feature(plugin)]
#![plugin(rocket_codegen)]

extern crate rocket;

#[get(&amp;amp;quot;&amp;#x2F;&amp;amp;lt;id&amp;amp;gt;&amp;amp;quot;)]
fn lookup(id: &amp;amp;amp;str) -&amp;amp;gt; String  
    format!(&amp;amp;quot;⏩ You requested   . Wonderful!&amp;amp;quot;, id)
 

#[get(&amp;amp;quot;&amp;#x2F;&amp;amp;lt;url&amp;amp;gt;&amp;amp;quot;)]
fn shorten(url: &amp;amp;amp;str) -&amp;amp;gt; String  
    format!(&amp;amp;quot;💾 You shortened   . Magnificent!&amp;amp;quot;, url)
 

fn main()  
    rocket::ignite().mount(&amp;amp;quot;&amp;#x2F;&amp;amp;quot;, routes![lookup])
                    .mount(&amp;amp;quot;&amp;#x2F;shorten&amp;amp;quot;, routes![shorten])
                    .launch();
 
Under the hood, Rocket is doing some magic to enable this nice syntax.
More specifically, we use the rocket_codegen crate for that.
In order to bring the rocket library into scope, we write extern crate rocket;.
We defined the two routes for our service. Both routes will respond to a GET request.
This is done by adding an attribute named get to a function.
The attribute can take additional arguments.
In our case, we define an id variable for the lookup endpoint and a url variable for the shorten endpoint.
Both variables are Unicode string slices. Since Rust has awesome Unicode support, we respond with a nice emoji just to show off. 🕶
Lastly, we need a main function, which launches Rocket and mounts our two routes. This way, they become publicly available.
If you want to know even more about the in-depth details, I may refer you to the official Rocket documentation.
Let’s check if we’re on the right track by running the application.
cargo run
After some compiling, you should get some lovely startup output from Rocket:
🔧  Configured for development.
    =&amp;amp;gt; address: localhost
    =&amp;amp;gt; port: 8000
    =&amp;amp;gt; log: normal
    =&amp;amp;gt; workers: 8
🛰  Mounting &amp;amp;#39;&amp;#x2F;&amp;amp;#39;:
    =&amp;amp;gt; GET &amp;#x2F;&amp;amp;lt;hash&amp;amp;gt;
🛰  Mounting &amp;amp;#39;&amp;#x2F;shorten&amp;amp;#39;:
    =&amp;amp;gt; GET &amp;#x2F;shorten&amp;#x2F;&amp;amp;lt;url&amp;amp;gt;
🚀  Rocket has launched from https:&amp;#x2F;&amp;#x2F;localhost:8000...
Sweet! Let’s call our service.
&amp;amp;gt; curl localhost:8000&amp;#x2F;shorten&amp;#x2F;www.endler.dev
💾 You shortened www.endler.dev. Magnificent!

&amp;amp;gt; curl localhost:8000&amp;#x2F;www.endler.dev
⏩ You requested www.endler.dev. Wonderful!
So far so good.

    
Data storage and lookup
We need to keep the shortened URLs over many requests… but how?
In a production scenario, we could use some NoSQL data store like Redis for that.
Since the goal is to play with Rocket and learn some Rust, we will simply use an
in-memory store.
Rocket has a that feature called managed state.
In our case, we want to manage a repository of URLs.
First, let’s create a file named src&amp;#x2F;repository.rs:
use std::collections::HashMap;
use shortener::Shortener;

pub struct Repository  
    urls: HashMap&amp;amp;lt;String, String&amp;amp;gt;,
    shortener: Shortener,
 

impl Repository  
    pub fn new() -&amp;amp;gt; Repository  
        Repository  
            urls: HashMap::new(),
            shortener: Shortener::new(),
         
     
    pub fn store(&amp;amp;amp;mut self, url: &amp;amp;amp;str) -&amp;amp;gt; String  
        let id = self.shortener.next_id();
        self.urls.insert(id.to_string(), url.to_string());
        id
     

    pub fn lookup(&amp;amp;amp;self, id: &amp;amp;amp;str) -&amp;amp;gt; Option&amp;amp;lt;&amp;amp;amp;String&amp;amp;gt;  
        self.urls.get(id)
     
 
Within this module we first import the HashMap implementation from the standard library.
We also include shortener::Shortener;, which helps us shorten the URLs in the next step. Don’t worry too much about that for now.
By convention, we implement a new() method to create a Repository struct with an empty HashMap and a new Shortener. Additionally, we have two methods, store and lookup.
store takes a URL and writes it to our in-memory HashMap storage. It uses our yet-to-be-defined shortener to create a unique id. It returns the shortened ID for the entry.
lookup gets a given ID from the storage, and returns it as an Option. If the ID is found, the return value will be Some(url); if there is no match it will return None.
Note that we convert the string slices (&amp;amp;amp;str) to String using the to_string() method. This way we don’t need to deal with lifetimes. As a beginner, don’t think too hard about them.

    
Additional remarks (can safely be skipped)
A seasoned (Rust) developer™ might do a few things differently here. Did you notice the tight coupling between the repository and the shortener? In a production system, Repository and Shortener might simply be concrete implementations of traits (which are a bit like interfaces in other languages, but more powerful). For example, Repository could implement a Cache trait:
trait Cache  
    &amp;#x2F;&amp;#x2F; Store an entry and return an ID
    fn store(&amp;amp;amp;mut self, data: &amp;amp;amp;str) -&amp;amp;gt; String;
    &amp;#x2F;&amp;#x2F; Look up a previously stored entry
    fn lookup(&amp;amp;amp;self, id: &amp;amp;amp;str) -&amp;amp;gt; Option&amp;amp;lt;&amp;amp;amp;String&amp;amp;gt;;
 
This way we get clear sepration of concerns, and we can easily switch to a different implementation (e.g. a RedisCache). Also, we could have a MockRepository to simplify testing. Same for Shortener.
On top of that, you might want to use the Into trait to support both, &amp;amp;amp;str and String as parameters of store:
pub fn store&amp;amp;lt;T: Into&amp;amp;lt;String&amp;amp;gt;&amp;amp;gt;(&amp;amp;amp;mut self, url: T) -&amp;amp;gt; String  
		let id = self.shortener.shorten(url);
		self.urls.insert(id.to_owned(), url.into());
		id
 
If you’re curious about this, read this article from Herman J. Radtke III.
For now, let’s keep it simple.

    
Actually shortening URLs
Let’s implement the URL shortener itself.
You might be surprised how much was written about URL shortening all over the web.
One common way is to create short URLs using base 62 conversion.
After looking around some more, I found this sweet little crate called harsh, which perfectly fits the bill. It creates a hash id from an input string.
To use harsh, we add it to the dependency section of our Cargo.toml:
harsh = &amp;amp;quot;0.1.2&amp;amp;quot;
Next, we add the crate to the top of to our main.rs:
extern crate harsh;
Let’s create a new file named src&amp;#x2F;shortener.rs and write the following:
use harsh:: Harsh, HarshBuilder ;

pub struct Shortener  
    id: u64,
    generator: Harsh,
 

impl Shortener  
    pub fn new() -&amp;amp;gt; Shortener  
        let harsh = HarshBuilder::new().init().unwrap();
        Shortener  
            id: 0,
            generator: harsh,
         
     
    pub fn next_id(&amp;amp;amp;mut self) -&amp;amp;gt; String  
        let hashed = self.generator.encode(&amp;amp;amp;[self.id]).unwrap();
        self.id += 1;
        hashed
     
 
With use harsh:: Harsh, HarshBuilder ; we bring the required structs into scope. Then we define our own Shortener struct, which wraps Harsh. It has two fields: id stores the next id for shortening. (Since there won’t be any negative ids, we use an unsigned integer for that.) The other field is the generator itself, for which we use Harsh.
Using the HarshBuilder you can do a lot of fancy stuff, like setting a custom alphabet for the ids. We’re good for now, but for more info, check out the official docs.
With next_id we retrieve a new String id for our URLs.
As you can see, we don’t pass the URL to next_id. That means we actually don’t shorten anything. We merely create a short, unique ID. That’s because most hashing algorithms produce fairly long URLs and having short URLs is kind of the whole idea.

    
Wiring it up
So we are done with our shortener and the repository.
We need to adjust our src&amp;#x2F;main.rs again to make use of the two.
This is the point where it gets a little hairy.
I have to admit that I struggled a bit here.
Mainly because I was not used to multi-threaded request handling. In Python or
PHP you don’t need to think about shared-mutable access.
Initially I had the following code in my main.rs:
#[get(&amp;amp;quot;&amp;#x2F;&amp;amp;lt;url&amp;amp;gt;&amp;amp;quot;)]
fn store(repo: State&amp;amp;lt;Repository&amp;amp;gt;, url: &amp;amp;amp;str)  
    repo.store(url);
 

fn main()  
    rocket::ignite().manage(Repository::new())
                    .mount(&amp;amp;quot;&amp;#x2F;store&amp;amp;quot;, routes![store])
                    .launch();
 
State is the built-in way to save data across requests in Rocket. Just tell it what belongs to your application state with manage() and Rocket will automatically inject it into the routes.
But the compiler said no:
error: cannot borrow immutable borrowed content as mutable
  --&amp;amp;gt; src&amp;#x2F;main.rs
   |
   |     repo.store(url);
   |     ^^^^ cannot borrow as mutable
In hindsight it all makes sense: What would happen if two requests wanted to modify our repository at the same time?
Rust prevented a race condition here! Yikes.
Admittedly, the error message could have been a bit more user-friendly, though.
Fortunately, Sergio Benitez (the creator of Rocket) helped me out on the Rocket IRC channel (thanks again!).
The solution was to put the repository behind a Mutex.
Here is our src&amp;#x2F;main.rs in its full glory:
#![feature(plugin, custom_derive)]
#![plugin(rocket_codegen)]

extern crate rocket;
extern crate harsh;

use std::sync::RwLock;
use rocket::State;
use rocket::request::Form;
use rocket::response::Redirect;

mod repository;
mod shortener;
use repository::Repository;

#[derive(FromForm)]
struct Url  
    url: String,
 

#[get(&amp;amp;quot;&amp;#x2F;&amp;amp;lt;id&amp;amp;gt;&amp;amp;quot;)]
fn lookup(repo: State&amp;amp;lt;RwLock&amp;amp;lt;Repository&amp;amp;gt;&amp;amp;gt;, id: &amp;amp;amp;str) -&amp;amp;gt; Result&amp;amp;lt;Redirect, &amp;amp;amp;&amp;amp;#39;static str&amp;amp;gt;  
    match repo.read().unwrap().lookup(id)  
        Some(url) =&amp;amp;gt; Ok(Redirect::permanent(url)),
        _ =&amp;amp;gt; Err(&amp;amp;quot;Requested ID was not found.&amp;amp;quot;)
     
 
#[post(&amp;amp;quot;&amp;#x2F;&amp;amp;quot;, data = &amp;amp;quot;&amp;amp;lt;url_form&amp;amp;gt;&amp;amp;quot;)]
fn shorten(repo: State&amp;amp;lt;RwLock&amp;amp;lt;Repository&amp;amp;gt;&amp;amp;gt;, url_form: Form&amp;amp;lt;Url&amp;amp;gt;) -&amp;amp;gt; Result&amp;amp;lt;String, String&amp;amp;gt;  
    let ref url = url_form.get().url;
    let mut repo = repo.write().unwrap();
    let id = repo.store(&amp;amp;amp;url);
    Ok(id.to_string())
 

fn main()  
    rocket::ignite().manage(RwLock::new(Repository::new()))
                    .mount(&amp;amp;quot;&amp;#x2F;&amp;amp;quot;, routes![lookup, shorten])
                    .launch();
 
As you can see we’re using a std::sync::RwLock here, to protect our repository from shared mutable access. This type of lock allows any number of readers or at most one writer at the same time.
It makes our code a bit harder to read because whenever we want to access our repository, we need to call the read and write methods first.
In our lookup method, you can see that we are returning a Result type now. It has two cases: if we find an id in our repository, we return Ok(Redirect::permanent(url)), which will take care of the redirect. If we can’t find the id, we return an Error.
In our shorten method, we switched from a get to a post request.
The advantage is, that we don’t need to deal with URL encoding. We just create a struct Url and derive FromForm for it, which will handle the deserialization for us. Fancy!
We’re done. Let’s fire up the service again and try it out!
cargo run
In a new window, we can now store our first URL:
curl --data &amp;amp;quot;url=https:&amp;#x2F;&amp;#x2F;www.endler.dev&amp;amp;quot; https:&amp;#x2F;&amp;#x2F;localhost:8000&amp;#x2F;
We get some ID back that we can use to retrieve the URL again. In my case, this was gY.
Point your browser to https:&amp;#x2F;&amp;#x2F;localhost:8000&amp;#x2F;gY and you should be redirected to my homepage.

    
Summary
Rocket provides fantastic documentation and a great community.
It really feels like an idiomatic Rustlang web framework.
I hope you had some fun while playing with Rocket.
You can find the full example code on Github.
"
},

{
"title": "The Essence of Information",
"url": "https://endler.dev/2017/the-essence-of-information/",
"body": "People look confused when I tell them about my passion for algorithms and data-structures.
Most of them understand what a Programmer is doing, but not what Computer Science is good for.
And even if they do, they think it has no practical relevance.
Let me show you with a simple example, that applied Computer Science can be found everywhere.
Imagine a pile of socks that need to get sorted.
Not exactly the most exciting pastime.
You’ve put off this task for so long, that it will inevitably take an hour to be done.

    
            Yes, there is a game about sorting socks.
            
            
            Source: It’s called Sort the Socks and you can get it for free on the App Store.
            
        
Considering your options, you decide to get some help.
Together with a friend you get to work. You finish in roughly half the time.
A Computer Scientist might call this pile of socks a resource.
You and your friend get bluntly degraded to workers.
Both of you can work on the problem at the same time — or in parallel.
This is the gist of Parallel Computing.
Now, some properties make sock-sorting a good fit for doing in parallel.

The work can be nicely split up. It takes about the same time for every worker to find a pair of socks.
Finding a different pair is a completely separate task that can happen at the same time.

The more workers you assign to this task, the faster you’re done.

1 worker takes 60 minutes.
2 workers take 30 minutes.

How long will 3 workers take? Right! Around 20 minutes. We could write down
a simple formula for this relationship:

    
            The formula for the sorting time.
            
            
Well, that is not quite correct. We forgot to consider the overhead: When Mary
tries to pick up a sock, Stephen might reach for the same.
They both smile and one of them picks another sock.
In computing, a worker might do the same. Well, not smiling but picking another
task. When lots of workers share resources, these situations occur quite
frequently. And resolving the situation always takes a little extra time. So we are a
bit away from our optimal sorting speed because of that.
But it gets worse! Let’s say you have 100 workers for 100 socks.
In the beginning, every worker might take one sock and try to find a match for
it. Here’s the problem: As soon as they pick up one sock each, there are no
socks left. All workers are in a waiting state. The sorting takes forever.
That’s a deadlock, and it’s one of the most frightening scenarios of parallel computing.
In this case, a simple solution is to put down the sock again and wait for some time until trying to get a new sock.
Another way out of the dilemma would be, to enforce some kind of “protocol” for sorting.
Think of a protocol as a silent agreement between the workers on how to achieve a common goal.
So, in our case, each worker might only be responsible for one color of socks.
Worker one takes the green socks, worker two the gray ones and so on.
With this simple trick, we can avoid a deadlock, because we work on completely
separate tasks.
But there’s still a catch. What if there are only four green socks and 4000 gray socks?
Worker one would get bored fairly quickly. He would sort the two pairs of socks in
no time and then watch worker two sort the rest.
That’s not really team spirit, is it?
Splitting up the work like this makes most sense, if we can assume that we
have around the same number of socks for every color.
This way we achieve roughly the same workload for
everyone.
The following histogram gives you an idea of what I mean:

    
            Even piles of socks.
            
            
In this case, we have about equally sized piles for each color. Looks
like a fair workload for every worker to me.

    
            Uneven piles of socks.
            
            
In the second case, we don’t have an equal distribution. I don’t want to sort the
gray socks in this example. We need to think a little harder here.
What can we do?
Most of the time it helps to think of other ways to split up work.
For example, we could have two workers sort the big gray pile together. One
sorts the large socks; the other one sorts the small ones. We run into another problem, though: Who decides what “large” and “small” means in this case?
So, instead of thinking too hard about a smarter approach, we decide to be
pragmatic here. Everyone just grabs an equally sized pile of socks — no
matter the color or the size — and gets
to work.
Most likely, there will be some remaining socks in each pile, which have no match.
That’s fine. We just throw them all together, mix the socks, create new piles from
that, and sort them again. We do so until we’re done.
We call that a task queue. It has two advantages: First, you don’t need any additional agreements between the workers and second, it scales reasonably
well with the number of workers without thinking too hard about the problem
domain.
The tricky part about distributed systems is, that seemingly straightforward solutions can fail
miserably in practice.
What if our small piles look like this?

    
            A random pile of socks.
            
            
The number of pairs in each pile is… sobering.
What we could do is run a very quick presorting step to increase the number of matches. Or maybe you come up with an even better idea?
The cool thing is, once you have found a faster approach, it works for similar tasks, too.
Problems like this have their roots in Computer Science, and they can be found everywhere.
Personally, I don’t like the term Computer Science too much. I prefer
the German term “Informatik”, which I would roughly translate as “Information Science”.
Because the real essence of what we’re doing here is to find a general way to solve a
whole class of problems. We think of the nature of objects and their properties.
We don’t sort socks; we try to answer the fundamental questions of information. Maybe now you can understand why I’m so passionate about this subject.
Oh, and here’s a related post about why I love programming.
"
},

{
"title": "Why I Love Programming",
"url": "https://endler.dev/2017/why-i-love-programming/",
"body": "Programming has many faces. It is the science of structured thinking.
It is the art of eloquent expression.
It teaches you to be humble when you look at other peoples’ fascinating work.
Most of all, it teaches you a lot about yourself.
While the syntax may change, the concepts will not.

This post is split into two parts.
In the first part, I will talk about the joy of programming.
The second part will deal with the notion of being a professional programmer.
If you’re not sure yet whether you want to learn how to program, this article is for you.

    
Automating stuff gives you superhero strengths
Being able to program is infinitely rewarding. You can help your sister sort a
thousand pictures in a few seconds. You write a little backup
script for your grandma. The possibilities are endless.

    
Coding is fun!
Coding something is more fun than using it. It’s even better than playing games.
Why? Learn how to program a computer and get the best games for free — your own.
You’re in total control. It’s your idea, your logic, even your laws of physics.
It’s like building a house but without paying anything for the
building materials. You can build a mansion for free.

    
Sharing is fun, too!
To get new inspiration for your next project, read the programs of others.
This will give you an idea of how they think and how they solve problems.
Many great programmers share their best code with you.
You can do the same and share your project - or just the prettiest parts of it - with other programmers.
Watching somebody else use your work is one
of the most satisfying things you will ever experience.
It’s very fulfilling to see your tool serve a purpose it wasn’t built for.

    
Elegant, creative solutions
It’s very appealing to work so hard on your vision that everything unnecessary peels off.
All these little ideas and fundamental insights suddenly fall into place.
What’s remaining is the distilled truth, the result of an ambitious but rewarding thought process
and when you write it down as a program you can see all the little pieces working together.
This makes it so gratifying to figure stuff out on your own.
Programming is about understanding a problem so thoroughly, that you can teach a
piece of metal how to solve it.
Even the way your program is structured can be a piece of art.
It can be concise, witty and fast all at the same time.

    
Talk to a machine
It’s fascinating that something is understood by machines and humans using the same language.
I’m baffled when I realize that these circuits can actually “understand” and interpret words - in a way.

    
Standing on the shoulders of giants
Talking to other programmers and watching them work is a fascinating inspiration.
The very system you are using to read this text relies on their work.
Even if you’re far apart, you can study their work on Open Source projects online.
But if you get a chance, watch them giving talks at conferences and meet them at local user groups.
Becoming part of a community is gratifying.
To exchange ideas and to collaborate on projects helps you push your boundaries and learn something new every day.

    
Have fun, forget the rest
The machine is agnostic to your skin color. It doesn’t matter if you’re a twelve-year-old girl or a lecturer at University.
If you keep making the same mistake for ten hours straight, your computer won’t scream at you. It won’t punish you. It will happily await your commands. Also, the hurdles of entry are pretty low. An old computer is enough; even pen and paper and a book will suffice to work on cool programming ideas.

    
Get started!
You choose your own projects; nobody else.
Don’t let anybody tell you that you’re not smart enough for this stuff. Ever.
Each program is a wonderful journey so join us and code the world around you.
"
},

{
"title": "Tools",
"url": "https://endler.dev/2011/tools/",
"body": "For as long as I can think, religious flamewars have infected computer science.
Having arguments about technical topics can be healthy, but flamewars are not. I’m sick of it.
I’m fed up with people telling me that their work environment is oh-so better,
faster and so on. That’s fine, but it doesn’t matter. Your equipment only plays a supporting role. You don’t even need
a computer to do programming. Donald Knuth wrote algorithms on a
notepad. Alan Turing wrote the first chess computer on a piece of
paper. And it worked. Beat that!
For an average user, the next best system is probably good enough. Just a few bucks and you get an excellent piece of hardware which is completely sufficient to surf the web, chat, archive photos, write documents, listen to music and watch movies. You can do that with a Pentium IV, 256 MB RAM and any recent Operating System (you will likely get that one for free). Heck, you can use your old Commodore for most of that. Computers have been mature and reliable enough to do all that for ages. There’s no need to upgrade your system for Farmville, just like there’s no reason to buy a new car if the old one works perfectly fine. When it comes to software, many of us still use Office 2000 or Photoshop 8 or VisiCalc without feeling the urge to upgrade.
Professionals find themselves in a similar situation. Well, maybe we invest a bit more money, but still, our hardware is incredibly cheap compared to our salary (hopefully). Nothing is perfect, but most of the time it’s good enough. That compiler you were using a decade ago? Still does the job. We are still using slightly modified descendants of programming languages from computing stone-age. Even if you’re doing numerical computing for NASA, your primary work environment is a black box running a text editor or an IDE.
I don’t care what you are using to get things done. Find an environment that suits your needs and be happy with it. Maybe you use Emacs on a Lemote Yeelong netbook (hello Richard Stallman) or Vim on your workstation. It’s the same thing: A text editor running on a piece of metal.
You’re not a worse programmer for using Nano, ed or TextMate. Notepad works just fine, too. It loads files, saves files and lets you edit them in between. That’s a hell lot more functionality than Bill Gates and Paul Allen had when they wrote a BASIC interpreter for the Altair. If you find something you’re happy with, just stick with it but don’t start arguing. It isn’t worth your time.
Don’t feed the trolls. When it comes to software, don’t fall into the old FreeBSD vs. Linux vs. Windows vs. mum cliche. Instead, talk about your code. Let’s look at your problem-solving skills. Let’s be pragmatic here.

Talk is cheap. Show me the code. - Linus Torvalds

I don’t care which programming language you are using. Java? Fine. Visual Basic? Great! Scala, Cobol, PHP, C++? All fine. Write in Assembler or lolcode. Don’t moan about the fact that language X is missing feature Y. Write a library or use something different. Stop saying JavaScript is a toy language. It just doesn’t fit your needs. Instead, show me your Lisp adventure game. Write an interpreter for Brainfuck. Do something. Move things.
Concerning PHP, nir wrote on Hacker News:

Any idiot can write a snarky comment about PHP. Very few get to write code that has anywhere near the impact it had.

Will you fall off your chair when I admit that I like the PHP syntax?
OK, it has its rough edges (do we really need the $ sign?) but what’s
more important is how much I can get done with it. PHP was my long time
go-to language for off the hook, one time scripts. It looks a bit ugly
but it runs on any server and comes with an enormous amount of built-in
functionality. It’s great for rapid prototyping and gluing things together.
In fact, when you write a piece of software, what you should strive for is to produce quite good software and what you really need to accomplish is good enough software to make your users happy.
Zed A. Shaw puts it quite nicely in the afterword to Learn Python the hard way

I have been programming for a very long time. So long that it is incredibly boring to me. At the time that I wrote this book I knew about 20 programming languages and could learn new ones in about a day to a week depending on how weird they were. Eventually though this just became boring and couldn’t hold my interest. What I discovered after this journey of learning was that the languages didn’t matter, it was what you did with them. Actually, I always knew that, but I’d get distracted by the languages and forget it periodically. The programming language you learn and use does not matter. Do not get sucked into the religion surrounding programming languages as that will only blind you to their real purpose of being your tool for doing interesting things.

Don’t get emotional for any tool you use. An iPhone - I’m sorry to disappoint you - is just a phone. No magic. No “think different”. “But it’s evil!”, the ether says, “it’s not open source”. Well, Android just exists because Google needed to rapidly develop a mobile platform. It’s simply part of their business. There is no moral behind that. Google is a yet another company just like Microsoft or Apple.
My MacBook serves me as a solid tool, but if something “better” comes around, I will happily kick it out. I’ve ditched Firefox after five years just because Chrome is faster and I will get rid of Chrome when I find a worthy successor.
Vim is quite good in my opinion but if there’s a faster way to do things I’m not afraid to dump it. Instead get your hands dirty and fix the problems or craft something new.
"
},

{
"title": "Are you a Programmer?",
"url": "https://endler.dev/2011/are-you-a-programmer/",
"body": "My geography teacher once told the story of her first lecture at University.
As an introduction, her professor asked the class to draw
a map of Germany without any help and as accurate as possible. To her surprise, she was not
able to fill the map with much detail. Even the shape of the country was a bit vague.
She had seen thousands of images of Germany (her mother country) but
wasn’t able to reproduce it from her blurry memory. She would have to look it up.
Doesn’t this sound familiar? We rely on machines to manage large portions
of our knowledge. There’s hard work involved to learn something by heart.
Here is a similar test for programmers:

Using a programming language of your choice, write a correct sorting
algorithm with an average runtime complexity of O(n*log n) (Heapsort,
Quicksort, Bucketsort, you name it) on a piece of paper without the help of any
external tools.

And by correct I mean it must be free of bugs without any modifications when you type it in.
You would be surprised by the large percentage of professional software
engineers who can’t pull this off.
Some might argue that knowledge about details of programming language
syntax is unimportant: “Why learn all the little nitpicks when you know
how to use a search engine? Why start with a clean slate when you can easily
copy, paste and modify an example from a tutorial?
Every few years&amp;#x2F;months I have to completely relearn the syntax for a different language anyway.”
But that is a myth. If you know only
one programming language really well - even if it is something
outdated like Fortran or COBOL - you could easily earn a fortune with
that knowledge. Suppose you started with C in 1975. You could still
use the same syntax today - almost four decades later.
Same for text editors. Emacs and Vim are both decades
old. They are battle-hardened. I don’t care which one you prefer, but you
will spend a large part of your life with your tools so invest the time to master them.
As a side note, it appears that very few people strive for perfection in anything they do.
They happily settle for “good enough”. This can have many different reasons, and I’m not
blaming anybody for not doing his homework but maybe I’m not alone with
that observation.
If you don’t know how to use your tools without a manual, you are a lousy craftsman.
If you need a dictionary to write a simple letter, you will have a hard
time becoming a writer because it would already be challenging for you to form elegant, fluent
sentences — let alone engaging and original stories.
I don’t want to read these books.
What makes a programmer?

She has at least one programming language she knows inside out.
She can implement standard algorithms (i.e. for sorting, searching)
and data-structures (i.e. trees, linked lists) which are robust and
reasonably fast on the fly.
She has at least a basic understanding of complexity theory and
programming concepts like recursion and pointers.

But, to be a good programmer, you should

Be able to code in at least two fundamentally different programming
paradigms (i.e. declarative, functional).
Have experience with big software architectures.
Be familiar with your programming environment like the operating system and a sophisticated text editor of your choice. Preferably one, that is
easily extendable.

And that is just the tip of the iceberg.
“There’s too much to learn!”, I hear some of you say.
Start slowly.
You need only three commands to start with Vim: i, ESC, :wq.
That’s enough for day one.
I realize that most of these essentials won’t be taught during lectures.
You have to learn a vast portion on your own.
But let’s face it: If you don’t know this stuff, you are not a programmer, you’re a freshman.
"
},

{
"title": "On Hard Work",
"url": "https://endler.dev/2011/on-hard-work/",
"body": "Great people get shaped by their achievements

There’s Thomas Edison who developed countless prototypes before selling a single light bulb.
The unemployed Joanne K. Rowling writing Harry Potter in a Cafe while caring for her child.
Steve Wozniak creating the first personal computer in his spare time while working at HP.

What do they have in common?
They all lived through frustration and contempt but still reached their goals, even though the chances for success were
low. These people are stemming their strong will from an intrinsic curiosity.

    
Dedication
Sure, I love what I do. I want to be a programmer for the rest of my life, but sometimes it seems simply too hard to finish a project.
I get scared by the big picture and fear that I won’t finish on time. What I need is a different mindset.
Dhanji R. Prasanna, a former Google Wave team member made this observation

And this is the essential broader point–as a programmer you must have a series of wins, every single day. It is the Deus Ex Machina of hacker success. It is what makes you eager for the next feature, and the next after that.

While Google Wave has not been commercially successful, it sure was a
technical breakthrough — and it was a drag to push it out into public.
We always have to see our goal right in front of us, as we take a billion baby steps to reach it.
This is true for any profession. Winners never give up.

    
Direction
Today it is easier to accomplish something meaningful than ever before.
If you are reading this, you have access to a powerful instrument — a
computer with an Internet connection. We live in a time where a single
person can accomplish miracles without hard physical labor.
A time where billions of people can grow a business from their desk, get famous in minutes,
publish books in seconds and have instant access to large amounts of
data. The most potent development over the last 100
years has been the reduction of communication costs. Transferring a bit of
information to the other end of the world is virtually free and takes
fractions of a second. While proper education was a privilege of a lucky few
well into the 20th century, learning new things is now mostly a question of
will.
Nevertheless, learning is still a tedious task,
requiring patience and determination.
As the amount of information has increased, so have the ways of distraction.
Losing focus is just a click away.

    
Devotion
Everybody can start something. Few will finish anything.
That’s because getting things done is hard, even if you love what
you’re doing. (Watch the beginnings of There Will Be Blood and Primer for a
definition of hard work.)
No matter what they tell you, achieving anything sustainable means hustling. It means making
sacrifices. It means pushing through.
It means selling something even though it isn’t perfect. Your beautiful project might turn into an ugly groundhog in
the end. Put makeup on it and get it out the door.
On a report about Quake’s 3D-Engine, developer Michael Abrash says:

By the end of a project, the design is carved in stone, and most of the work involves fixing bugs, or trying to figure out how to shoehorn in yet another feature that was never planned for in the original design. All that is a lot less fun than starting a project, and often very hard work–but it has to be done before the project can ship. As a former manager of mine liked to say, “After you finish the first 90% of a project, you have to finish the other 90%.” It’s that second 90% that’s the key to success.


A lot of programmers get to that second 90%, get tired and bored and frustrated, and change jobs, or lose focus, or find excuses to procrastinate. There are a million ways not to finish a project, but there’s only one way to finish: Put your head down and grind it out until it’s done. Do that, and I promise you the programming world will be yours.

That last part has influenced me a lot.
The dedication, the urgency to reach your aims must come from within you.
It’s your raw inner voice speaking — don’t let it fade away.
And when you are close to giving up, stop thinking so hard. Just try to
push forward and make a tiny step in the right direction.
Ship it!
"
},

{
"title": "Overkill – Java as a First Programming Language",
"url": "https://endler.dev/2010/overkill-java-as-a-first-programming-language/",
"body": "I recently talked to a student in my neighborhood about his first programming
experiences. They started learning Java at school, and it soon turned out to
be horrible.
A lot of us learned to code in languages like BASIC or Pascal. There was no
object orientation, no sophisticated file I&amp;#x2F;O and almost no modularization…
and it was great. In BASIC you could just write
PRINT &amp;amp;quot;HELLO WORLD&amp;amp;quot;
and you were done. This was actually a running program solving a basic and
reoccurring problem: Output some text on a screen.
If you wanted to do the same thing in Java you just write:
public class Main  
  public static void main (String[] args)  
    System.out.println(&amp;amp;quot;Hello, world!&amp;amp;quot;);
   
 
Do you see how much knowledge about programming you must have to achieve the
easiest task one could think of? Describing the program to a novice programmer
may sound like this:

Create a Main class containing a main-method returning void expecting a string
array as a single argument using the println method of the out object of
class PrintStream passing your text as a single argument.

— please just don’t forget your brackets. This way your first programming
hours are guaranteed to be great fun.
OK. So what are the alternatives? I admit that nobody wants to write BASIC
anymore because of its lack of a sophisticated standard library for graphics
(Java doesn’t have one either) and its weak scalability. The language has to
be clean and straightforward. It should be fast enough for numerical tasks but
not as wordy as the rigid C-type bracket languages (sorry C++ guys). It should
have a smooth learning curve and provide direct feedback (compiled languages
often suck at that point). It should encourage clean code and reward best
practices. One language that provides all that is Python.
And Python has even more: hundreds of libraries that help you with almost
everything, good integration into common IDEs (PyDev in Eclipse, IDLE…), a
precise and elegant syntax.
Here is our program from above written in Python:
print(&amp;amp;quot;Hello World&amp;amp;quot;)
There’s no need to know about object orientation, scopes and function arguments
at this point. No householding or book-keeping. Yes, it’s an interpreted
language, but that’s not a deal breaker for beginners.
If you aren’t convinced yet, printing and formatting text output in Java is
relatively easy for an advanced programmer but the gruesome stuff begins with
file input:
import java.io.BufferedReader;
import java.io.FileNotFoundException;
import java.io.FileReader;
import java.io.IOException;

public class fileIO  
    public static void main(String[] args)  
        String filename = &amp;amp;quot;test.txt&amp;amp;quot;, line;
        try  
            BufferedReader myFile =
                new BufferedReader(new FileReader(filename));

            while ( ( line = myFile.readLine()) != null)  
                System.out.println(line);
             
          catch (FileNotFoundException e)  
            e.printStackTrace();
          catch (IOException e)  
            e.printStackTrace();
         
     
I hear you say: “Dude, file I&amp;#x2F;O is pretty complex. It’s just the way it is”.
That’s true… internally . But a beginner should get an easy interface.
Python shows how it’s done:
file = open(&amp;amp;quot;test.txt&amp;amp;quot;)
text = file.read()
print(text);
The code goes hand in hand with the natural understanding of how the process
works: “The computer opens a file, reads it and prints it”. Even a five-year-old
kid can understand that. Nobody would start to explain: “Before you can read a
file you need a BufferedReader that works on a FileReader…” even if this is
precisely how it works internally. You want to explain the big picture at
first. The elementary principles of teaching a computer how to do useful stuff.
Otherwise, you will start frustrating beginners and fool them into thinking that
they are not bright enough for programming. Programming is fun and starting with
it is the most crucial step. So don’t spoil that experience with layers of
unneeded abstraction.

    
Links

Response to this article (almost ten years later): Why Kotlin may be better than Java and Python as the first programming language

"
},

{
"title": "Howto Sort a Vector or a List in C++ using STL",
"url": "https://endler.dev/2010/howto-sort-a-vector-or-a-list-in-c-using-stl/",
"body": "A little code snippet that people need very often.
&amp;#x2F;*
* Howto sort a vector or a list in C++ using STL
*&amp;#x2F;

#include &amp;amp;lt;algorithm&amp;amp;gt;  &amp;#x2F;&amp;#x2F; Needed for sort() method
#include &amp;amp;lt;vector&amp;amp;gt;     &amp;#x2F;&amp;#x2F; STL vector class
#include &amp;amp;lt;list&amp;amp;gt;       &amp;#x2F;&amp;#x2F; STL list class
#include &amp;amp;lt;iostream&amp;amp;gt;   &amp;#x2F;&amp;#x2F; Needed for cout,endl

using namespace std;  &amp;#x2F;&amp;#x2F; Save us some typing

&amp;#x2F;*
* This is a comparison function. It can be used to tell sort()
* how to order the elements in our container (the vector or list).
* You can write a comparator for every data type (i.e. double, string...).
*&amp;#x2F;
bool comp(const int&amp;amp;amp; num1, const int&amp;amp;amp; num2)  
    return num1 &amp;amp;gt; num2;
 

int main()  
    &amp;#x2F;&amp;#x2F; SORTING WITH VECTORS &amp;#x2F;&amp;#x2F;

    &amp;#x2F;&amp;#x2F; A vector containing integers
    vector&amp;amp;lt;int&amp;amp;gt; v;

    &amp;#x2F;&amp;#x2F; Insert some values
    v.push_back(5);
    v.push_back(12);
    v.push_back(1);

    &amp;#x2F;&amp;#x2F; The generic STL sort function uses three parameters:
    &amp;#x2F;&amp;#x2F; 
    &amp;#x2F;&amp;#x2F; v.begin()  Iterator pointing at the _beginning_ of the container
    &amp;#x2F;&amp;#x2F; v.end()    Iterator pointing at the _end_ of it
    &amp;#x2F;&amp;#x2F; comp       [Optional] A comparison function (see above)
    &amp;#x2F;&amp;#x2F; 
    &amp;#x2F;&amp;#x2F; The above mentioned iterators must be random access iterators because
    &amp;#x2F;&amp;#x2F; sort() takes advantage of clever tricks that require direct access to
    &amp;#x2F;&amp;#x2F; all elements of the vector. This makes it really fast.
    &amp;#x2F;&amp;#x2F; (Currently introsort is used with O(n*log n) even in worst case).

    sort(v.begin(), v.end(), comp);

    cout &amp;amp;lt;&amp;amp;lt; &amp;amp;quot;Vector: &amp;amp;quot;;

    &amp;#x2F;&amp;#x2F; Iterate over vector elements
    vector&amp;amp;lt;int&amp;amp;gt;::iterator vIt;
    for (vIt = v.begin(); vIt != v.end(); vIt++)  
        &amp;#x2F;&amp;#x2F; Print current element to standard output
        cout &amp;amp;lt;&amp;amp;lt; *vIt &amp;amp;lt;&amp;amp;lt; &amp;amp;quot; &amp;amp;quot;;
     
    cout &amp;amp;lt;&amp;amp;lt; endl;

    &amp;#x2F;&amp;#x2F; SORTING WITH LISTS &amp;#x2F;&amp;#x2F;
    &amp;#x2F;&amp;#x2F; A list containing integers
    list&amp;amp;lt;int&amp;amp;gt; l;

    &amp;#x2F;&amp;#x2F; Insert some values
    l.push_back(5);
    l.push_back(12);
    l.push_back(1);

    &amp;#x2F;&amp;#x2F; Here is the major difference between vectors and lists in general:
    &amp;#x2F;&amp;#x2F; Vectors offer fast random access to every element
    &amp;#x2F;&amp;#x2F; but inserting a new element at the beginning or in the middle is slow.
    &amp;#x2F;&amp;#x2F; On the other hand inserting into a list is fast but searching for
    &amp;#x2F;&amp;#x2F; a specific element is slow.
    &amp;#x2F;&amp;#x2F;
    &amp;#x2F;&amp;#x2F; Vectors behave much like an array, while lists only allow slow sequential access.
    &amp;#x2F;&amp;#x2F; Therefore we need a different function to sort all elements that does
    &amp;#x2F;&amp;#x2F; not need random access iterators.
    &amp;#x2F;&amp;#x2F;  
    &amp;#x2F;&amp;#x2F; comp  [Optional] A comparison function (see above)
    &amp;#x2F;&amp;#x2F;  
    &amp;#x2F;&amp;#x2F; Note that sort() is specific for the list and is implemented as a
    &amp;#x2F;&amp;#x2F; member function of list&amp;amp;lt;&amp;amp;gt;. This is feels more object oriented than the vector.
    
    l.sort(comp);

    cout &amp;amp;lt;&amp;amp;lt; &amp;amp;quot;List: &amp;amp;quot;;

    &amp;#x2F;&amp;#x2F; A pointer to a list element
    list&amp;amp;lt;int&amp;amp;gt;::iterator lIt;
    for (lIt = l.begin(); lIt != l.end(); lIt++)  
        cout &amp;amp;lt;&amp;amp;lt; *lIt &amp;amp;lt;&amp;amp;lt; &amp;amp;quot; &amp;amp;quot;;
     
    cout &amp;amp;lt;&amp;amp;lt; endl;

    return 0;
 
    
Compilation and execution
Save the above code inside a file, e.g. list_vector.cpp and compile it like so:
clang++ list_vector.cpp
To run it, execute the resulting binary.
.&amp;#x2F;a.out
    
        
Program output
Vector: 12 5 1
List: 12 5 1"
},

{
"title": "Why I Love Text Files",
"url": "https://endler.dev/2010/why-i-love-text-files/",
"body": "Text files are the single most important way we can communicate with computers. It’s no coincidence that they are also the most vital way to interact with other human beings. What we can achieve with text files is invaluable: Write it once and refer to it whenever you want to get the message across in the future. Write a program (it’s just text), save it and let the machine execute it whenever you like. Write another text file which contains the rules for the execution of your program and the computer runs your application exactly as you specified (cron files do that on Unix).
Text files can be structured in any way you can imagine. Some flavours are JSON, Markdown and SVG. It’s all just text. There exist a billion of programs and algorithms to access, modify and distribute text files. You can write them with Emacs, print them on a terminal, pipe them through sed and send them via email to a friend who publishes them on the web. Because text files are so important we have good support for them on any computing system. On Unix, everything is a file and HTML is just structured text. It’s a simple and powerful tool to make a contribution to society that outlasts our lives.
I have a single text file in my mac dock bar which is called TODO.txt. I open it every day, and after years of experimenting with different task management apps from simple command line tools to sophisticated online information storage systems, I always come back to plain text files. And the explanation is simple: If humanity will still be around a thousand years from now, chances are that plain text files are one of the very few file formats that will still be readable.
Text files are an incremental part of how we can modify our environment without even leaving our desk. They have no overhead and can contain a single thought or the complete knowledge of our species. Distributing textual information is so vital for us that we permanently develop faster distribution networks – the fastest by now being the internet.
On the web, you have instant access to a virtually endless amount of information and data distributed as plain text files. New web services made accessing the data even easier, offering APIs and feeds. You can pull down the data from their servers and make statistics with a programming language of your choice. As you may have noticed, my affinity to text files partially comes from my programming background. As Matt Might correctly points out on his blog:

The continued dominance of the command line among experts is a testament to the power of linguistic abstraction: when it comes to computing, a word is worth a thousand pictures.

Whenever you like a text on the web, just link to it and create a wonderful chain of ideas. Want to read it later or recommend it to a friend? Just share the text or print it on paper. The fact that we all take such things for granted is a testament for the power of text files and their importance for the information age.
"
},

{
"title": "Running Legacy Code",
"url": "https://endler.dev/2009/running-legacy-code/",
"body": "This short article deals with a severe problem in software development: bit rot.
When switching to a new platform (for instance from Windows XP to Windows Vista&amp;#x2F;7), the programmers need to make sure that old bits of code run flawlessly. There are several ways to achieve this goal that will be discussed in the next paragraphs:

    
Porting the code
This is generally considered a hard path to follow. For non-trivial legacy code-blocks, chances are high that they contain side-effects and hacks to make them work in different environments. Porting code means replacing parts of the program that use functions and methods that don’t exist anymore with new ones which make use of the modern libraries  and routines of the new platform. The significant advantages are maintainable software and sometimes faster running programs. But it may be needed to hack the new platform libraries in order to preserve the whole functionality of an old application. When changing an algorithm inside legacy code, the ported version may become unstable. Thus there may be better ways of maintaining obsolete code today.

    
Emulators
Emulators work much the same like porting the code. You replace old function calls with new ones to make everything work again. However you don’t alter the old codebase itself (because you may not have the source code available) but you create a new compatibility layer that “translates” the communication between the underlying operating system and software (our new platform) and our old software. Emulation can also be very fast and run stable for many years but writing an emulator can be even harder than porting the code because an educational guess may be needed to figure out how the program works internally. Additionally, the emulator itself may become obsolete in the future and might eventually  be replaced by a new one.

    
Virtual machines
During the last years, a new approach was gaining popularity. The idea is simple: Don’t touch anything. Take the whole platform and copy it in order to run old software. The old software runs on top of the old operating system within a virtual machine that runs on the new platform.
From a sane software developers view, this method is ridiculous. A lot of resources are wasted along the way. The system is busier switching contexts from an old platform to the new one and back than running the actual legacy program. However, with cheap and capable hardware everywhere this idea gets more and more interesting. As Steve Atwood coined it:

Always try to spend your way out of a performance problem first by throwing faster hardware at it.

And he’s right. The Microsoft developers did the same on their new NT 6.0 platform (Vista, Windows 7, Windows Server 2008…): Windows XP is running on a virtual machine. This way everything behaves just like one would run the software on the old system. And by optimizing the performance bottlenecks (input&amp;#x2F;output, context switches), one gets a fast and stable, easy to maintain product.
Every method has its major advantages and disadvantages. It’s on the developer to select the appropriate strategy.
"
}
]