Old notes

July 21, 2025Aaron M. Cohen Leave a comment

When I was a child I had some cognitive challenges. My parents moved me to a school which used the Slingerland program. As such I got into the habit of taking notes and repeating information I wanted to remember. I still do that today. I am a big fan of using paper & pen. Yet I am now left with piles of old notes.

Since my wife moved in and the arrival of my two children my notes have moved many times. I have gone from a home office to a man cave, then a man corner, and now a cabinet. We are constantly reclaiming our home from times of yore. So what do I do these notes?

The easy answer is to scan it digitally and put it in an online store somewhere. But that just moves my notes from one cabinet that I ignore to another. So I plan to revisit each note and turn it into a blog post. Not only does that refresh the content in my memory but also shares it with others.

Lessons from porting Lucene: Starting over

July 17, 2025Aaron M. Cohen Leave a comment

So I took a break from my effort to port Lucene. Late one night I couldn’t figure out how to port logic from Java to Rust. I decided to sleep on it. A few years passed and now I am looking at my code with fresh eyes.

Rather than just pick up where I left off I asked myself what went wrong? Turns out I made a mistake that kills many software projects. I didn’t spend enough time starting the project correctly. The rush to code resulted in spending less time on project setup. I selected a Rust project scaffolding and tried to force a Java project into it. As result one project had to resolve too many problems which resulted in my getting stuck.

Every project needs ADRs

Architecture Decision Records (ADR) allow a team to document why something was done a certain way. Yet more importantly it forces teams to thoroughly explore and agree on decisions. Yes even as a one person team I need to come to the understanding this is the right decision. In this case, I would have discovered that the current project setup was inappropriate. This would have happened while addressing the “Consequences” section of the ADR.

Choosing the right Priorities

I recently read “Decisive” by Chip and Dan Heath. It highlighted the importance of having the right priorities to help resolve conflicts. For instance I wanted the port to have the same features as Lucene. I figured I can reuse Lucene’s unit tests to verify to guarantee that. Well… Lucene is a java library. So I was trying to make Rust behave like Java with JNI in the middle which made life difficult. I was putting a priority on supporting Lucene’s unit tests when it wasn’t always appropriate.

Know the usage

Often when porting code I recall the parable of the blind men and an elephant. I end up spending a good amount of time very focused on a specific part without understanding the big picture. For instance I found myself stuck trying to figure out how to achieve Lucene’s inheritance in a non-object oriented language. I really should have asked how is this part used? If it is used in only one place then inheritance doesn’t matter. I can just combine them.

Making Hot Sauce

December 24, 2022Aaron M. Cohen Leave a comment

I am a big fan of Hot Sauce. Every time my in-laws visit from Belize I ask them to bring Marie Sharps hot sauce. Yes it is spicy but it also has so much flavor. Recently I tried to make my own hot sauce based on a recipe from Joshua Weissman. It was good and tasted a little bit like Franks RedHot. I tried it a few more times with each being a new rift on the recipe. Here is my current recipe for my Garlic Hot Sauce.

Ingredients:

Mix of Peppers. (1 – 1.5 lbs)
3-5 tablespoons of salt.
Water
1/2 to 3/4 cup of Canola or Avocado oil
1/2 cup of distilled vinegar
1-2 head of garlic

For the peppers I started off with mostly tabasco, cayenne, and jalapeño peppers I grew in my garden. All red in color.

However since then I have changed the mix to be more green than red. My current mix is poblano, green jalapeño, and red fresno. Different peppers bring different flavors and I find a mix is good.

Steps

Separate half of the peppers out and place on a baking pan.
Place the baking pan under the broiler. Turn the peppers every so often so they get roasted on all sides. Then let cool.
De-stem peppers, slice lengthwise, and then place in a jar. Please use protection. I like to remove the seeds but it isn’t necessary.
Add a salt/water mix to the jar until full. The mix should be about 4-5% salt by weight.
Let sit for about 2 weeks. I don’t recommend the 3 year required for a Tabasco sauce. You will see it turn white and bubble. That is OK. However do let it air out from time to time otherwise it will explode you on. You are making hot pepper sauce not hot pepper spray.
Peel and slice 1-2 head soft garlic depending on the size of your garlic heads and how much garlic flavor you want added.
Cook the garlic on low heat in the oil until lightly golden brown and let cool and separate garlic from oil.
Separate peppers from brine.
Combine peppers & garlic in blender and blend until completely smooth.
Add oil, vinegar and 4 tablespoons of brine to the blender.

So what is next?

My wife just bought me Hot Sauce Cookbook: The Book of Fiery Salsa and Hot Sauce Recipes. So should have many new recipes to try and riff on in the future.

How porting Lucene made me care about bit operations…

September 4, 2021Aaron M. Cohen Leave a comment

I am going to be honest… I haven’t touched binary operations since I attended a university assembly class about 20 years ago. So when I came across the writeVInt and readVInt methods from DataOutput and DataInput base classes I thought this would be a good to brush up. I lost a good few days because I did not consider the difference between arithmetic and logical shifts.

Unlike Java Rust does not have separate operators for arithmetic and logical operators. In java >> is arithmetic and >>> is logical. However in the rust documentation there was a footnote I completely glanced over stating.

** Arithmetic right shift on signed integer types, logical right shift on unsigned integer types.
https://doc.rust-lang.org/reference/expressions/operator-expr.html#arithmetic-and-logical-binary-operators

So what does this mean in practice is a signed data primitive like i8 when shifted gets a 0 or 1 based on it”s sign. So -127 (10000000) becomes -64 (11000000). So what is wrong with this logic? The implementation of variable length quantity in Lucene uses prefix 0’s to determine how many 7 bit bytes to write. For instance a typical VByte encoding is:

Value	byte 1	byte 2	byte3
0	00000000
1	00000001
2	00000010
127	01111111
128	10000000	00000001
129	10000001	00000001
130	10000010	00000001
16383	11111111	01111111
16384	10000000	10000000	00000001

So for positive or unsigned numbers this logic transferred over easily. However a negative number would cause an infinite loop. To work around this I flip the sign after shifting the bits for the first run. So it now looks like this.

loop step	value	binary
1	-2147483648	0b10000000000000000000000000000000
2	16777216	0b00000001000000000000000000000000
3	131072	0b00000000000000100000000000000000
4	1024	0b00000000000000000000010000000000
5	8	0b00000000000000000000000000001000

After spending all that time making negative numbers working with variable length integers I realized that use case may never be used. The documentation for those methods specifically states

Negative numbers are supported, but should be avoided.
https://lucene.apache.org/core/6_4_0/core/org/apache/lucene/store/DataOutput.html

When I asked the lucene mailing list about this I got

They are fully supported, so you can write and read them.
The problem with negative numbers is that they need lot of (disk) space, because in two’s complement they have almost all bits set. The largest number is kinds of disk space is -1. Negative numbers appear in older index formats, so they can’t be prevented. Just take the comment as given: all is supported, but if you want to store negative numbers use a different encoding, e.g. zigzag.
http://mail-archives.apache.org/mod_mbox/lucene-java-user/202109.mbox/%3cBEFCBB73-0848-4EB6-80E8-249D3D14EE30@thetaphi.de%3e

Nevertheless I now know the difference between arithmetic and logical shifts in Rust.

The most carrot pasta I have ever made.

August 23, 2021August 24, 2021Aaron M. Cohen Leave a comment

During the year of Covid I started a garden. I’ve had plants in pots before but this was the first time I had a section of land dedicated to plants. It turned out to be a great education opportunity for my son. However now I need to actually do something with what I grew. I was amazed how much plant there was above the carrot and wanted to make a meal which used it.

Ingredients:

1+ carrots with top. Enough to make 2 cups of greens once the stems are removed.
zucchini (optional)
2 cups of baby spinach
3-4 cloves of garlic.
1 cup of roasted unsalted cashews
olive oil
Salt & pepper
1lb of pasta (penne works well)
Parmesan to taste

Instructions

Preheat oven to 425
Separate carrot(s) from top and wash/peel
Slice carrots and zucchini (optional) to somewhat equal size
Toss carrots with olive oil, salt, and pepper.
Place on sheet pan and put in the oven for about 20 minutes
Wash and separate the greens from the tough stems. Also remove anything that looks…ugly
Put greens, spinach, garlic, roasted unsalted cashews, and 1 cup of olive oil into a blender.
Pulse until smooth. Depending on your blender you may want to add the olive oil in parts.
Cook pasta until al-dente
Drain most the water but not all. roughly half a cup
Combine all and serve with parmesan.

Grill + Veggies + Pasta

July 15, 2021July 18, 2021Aaron M. Cohen Leave a comment

Two of our new favorites summer pandemic activities come together in this dish which is gardening and grilling. Grilling because I got a new grill and I intend to use it. We also joined our community garden and now have more squash than we can count.

Ingredients

Pasta: Penne works best but others like rigatoni or elbow macaroni will work to.

Veggies:

1-2 large Zucchini
1-2 large Summer squash
1 lb Asparagus
1 ln Bell peppers
Kosher salt & pepper to taste

Sauce:

1 package of goat cheese
1/3 cup olive oil
3 Tbsp balsamic vinegar
2 Tbsp mayonnaise
1/2 Tbsp Dijon mustard
1 clove garlic, minced
1/2 tsp dried basil
1/2 tsp salt
Pepper to taste

Steps

Warm up the grill to 400-500 degrees F
Toss asparagus with olive oil, salt & pepper
Cut squash evenly and toss with olive oil, salt, & pepper.
Place veggies on grill. You may want to do this in batches. Asparagus and peppers are usually done first. I usually rotate them every 2 minutes.squash is usually closer to 4 depending on thickness.
Cook past to slightly Al dente. Reserve 1/3-1/2 cup of pasta water and drain the rest.
Combine reserved water with the rest of the sauce ingredients minus the goat cheese.
Roughly chop veggies.
Combine pasta, sauce, & veggies.
Crumble in goat cheese and combine.

Kubernetes useful tricks: Creating a secret from a file in an image

June 18, 2021Aaron M. Cohen Leave a comment

Recently I had an interesting problem. The product I was working on needed to create a secret from a file. Now on one hand this is an easy thing as you can just have a job run within an image

kubectl create secret generic mysecret --from-file=./file.txt

Ah… However Kubernetes command line client is only compatible within one minor version of the Kubernetes api server. So if you want to support all major version for Redhat Openshift currently supported you have to support Kubernetes 1.11 to 1.21. So to get around this problem you have to curl against the kubapi server directly. Here is a sample script I created to demonstrate this method.

apiVersion: batch/v1
kind: Job
metadata:
  name: readfiletosecret
  description: "Example of reading a file to a kubernetes secret."
spec:
  template:
    spec:
      serviceAccountName: account-with-secret-create-priv
      volumes:
      - name: local
        emptyDir: {}
      initContainers:
      - name: get-file
        image: registry.access.redhat.com/ubi8/ubi-minimal:latest
        command:
        - "/bin/sh"
        - "-c"
        env:
        - name: UPLOAD_FILE_PATH
          value: "/root/buildinfo/content_manifests/ubi8-minimal-container*.json"
        args:
        - |
          cat $UPLOAD_FILE_PATH
          cp -vf $UPLOAD_FILE_PATH /work/
        volumeMounts:
        - name: local
          mountPath: /work
      containers:
      - name: create-secret
        image: registry.access.redhat.com/ubi8/ubi-minimal:latest
        command:
        - "/bin/sh"
        - "-c"
        args:
        - |
          ls /work/
          i#
          export CONTENT=$(cat /work/* | base64 )
          echo $CONTENT
          #Set auth info
          export SERVICEACCOUNT=/var/run/secrets/kubernetes.io/serviceaccount
          export NAMESPACE=$(cat ${SERVICEACCOUNT}/namespace)
          export TOKEN=$(cat ${SERVICEACCOUNT}/token)
          export CACERT=${SERVICEACCOUNT}/ca.crt
          export APISERVER="https://kubernetes.default.svc"
          # Explore the API with TOKEN
          curl --cacert ${CACERT} --header "Authorization: Bearer ${TOKEN}" -X POST -H 'Accept: application/json' -H 'Content-Type: application/json' -d @-  ${APISERVER}/api/v1/namespaces/$NAMESPACE/secrets <<EOF
          {
            "kind": "Secret",
            "apiVersion": "v1",
            "metadata": {
              "name": "example"
            },
            "data": {
              "file": "$CONTENT"
            }
          }
          EOF
          rm /work/*
        volumeMounts:
        - name: local
          mountPath: /work
      restartPolicy: Never
  backoffLimit: 4

Porting Lucene: Iteration 0

June 18, 2021June 18, 2021Aaron M. Cohen Leave a comment

Iteration 0 is often used to create a product backlog or setup technical foundation (code repos, build pipelines, etc..). I frankly thought the concept was odd. In Agile you work to dates not scope. So to have an iteration that defines scope is odd. Thus I have a different take on it.

Iteration 0 is when you “start to know what you don’t” and with each subsequent iteration you learn more and the plan changes; So with that said it is time to prime the product backlog. Let’s start with defining goals.

Goal #1: Port Lucene from Java to Rust

While it seems straight forward we need to understand what Java parts are in the Lucene project. So let’s start start with a…

git clone https://github.com/apache/lucene.git
cd lucene
ls                                            
LICENSE		build.gradle	dev-docs	gradle		gradlew.bat	lucene		versions.lock
README.md	buildSrc	dev-tools	gradlew		help		settings.gradle	versions.props

OK, So right off the bat I can see it is a Gradle project. Gradle supports multiple languages out of the box and supports plugins to support others like Rust. Here we have our first decision… Do we port the Lucene library code or do we port the project? Let’s revisit that later after we are done exploring.

The interesting thing is there are a good number of files that are very specific to how Apache runs their projects and performs releases. For instance I haven’t seen an RDF document in probably 10 years yet here is a DOAP document defining the project and all the releases per Apache standards. The two items which I should focus on is the Lucene directory with all the java files and dev-tools/scripts/smokeTestRelease.py. The later will be useful for the next goal.

Goal #2: It should pass existing test scripts

So I think this is something which will set the project apart from other ports. Providing a way to reuse existing test scripts will ensure compatibility over time. However doing so will require the ability to do 2 things.

Leverage JNI from RUST

It has been a while since I’ve used JNI so this will be a good refresher. Based on existing documentation it seems this is done all the time for Android but still an experiment is required to ensure there are not any unforeseen gotchas.

The ability to sync Tests with the parent project

This is going to be a hard one. Reading through dev-tools/scripts/smokeTestRelease.py the Release verification is more than just Unit Tests. It also verifies digest mismatch, documentation, and missing metadata. Not all of these verifications will be applicable. For instance verifying the jar “Implementation-Vendor”metadata would not apply. So valid parts of this script will need to be ported and maintained.

At first glance majority of the verification is in the form of unit tests. In fact it looks like roughly 33% of all the Java source files are unit tests. That being said the Gradle build handles preparing data which may be used in those tests.

find lucene/ |grep ".java" |wc -l
5505
find lucene/ |grep ".java" | grep test | wc -l
1844

So we probably need a mechanism to

pull the latest project from Lucene
Clear out the non-test related Java source files
Update the build scripts to leverage an external JAR
Run the tests

Of course this means the code for the port needs to be managed separately from Lucene. This is turning into more of a complex project than I expected. Going to need to spend some time understanding a good project setup which will allow this. However it is currently the end of this iteration so that will have to wait for the next one.

Building a boat in the basement….

June 14, 2021Aaron M. Cohen 1 Comment

It goes without saying that 2020 was a tough year. While 2021 is getting better we are still not out of the woods yet. Sometimes the best way to get through it is to have a project that isn’t related to work but helps exercise the mind. Now I am not actually building a boat. My wife would kill me. However I am starting a project of similar ambition.

While granted I am a manager I still like to keep my development skills sharp. This is why I jump in to write code that helps my team reach their goals. This can range from everything from GO to Python. So for a personal project I don’t want to program in any of those.

Thus I am going to take a few open source projects and port them to Rust. I am gong document my experience and what works/doesn’t. Now I intent to do something a bit different for this port. I am going to use JNI to ensure the port is 100% compatible with the old one.

Choosing the right project

It goes without saying that there are plenty of open source projects. So here is the criteria I am using for selecting the right project to port.

It should be an established project which is not radically changing. This will make maintenance of the port easer over time.
It should have existing ports and the community should be open to new ports.
It should be large but not so large that it can never be completed.
It should be a project I am familiar with.

Based on this criteria I have decided to port the Lucene project. I have worked with Lucene on multiple projects back in the day. It will be good to revisit it and understand better how it works under the hood.

Applying Behavior Driven Development practices to infrastructure.

September 1, 2017Aaron M. Cohen Leave a comment

Earlier this summer I worked with my team to apply Behavior Driven Development practices to infrastructure that we deployed our products to. Prior to this the DevOps team simply identified toil and implemented solutions. Unfortunately this meant the reasons behind the changes would get lost over time. Having GHERKIN files with your solution means that information would not get lost. Interesting enough we were able to reduce the total size of the source code considerably because some of the user stories were no longer relevant.

I wrote a tutorial based on this experience that can be found here.

Aaron Cohen

Thoughts on software, development, & life

Category Uncategorized

Old notes

Lessons from porting Lucene: Starting over

Every project needs ADRs

Choosing the right Priorities

Know the usage

Making Hot Sauce

Ingredients:

Steps

So what is next?

How porting Lucene made me care about bit operations…

The most carrot pasta I have ever made.

Ingredients:

Instructions

Grill + Veggies + Pasta

Ingredients

Steps

Kubernetes useful tricks: Creating a secret from a file in an image

Porting Lucene: Iteration 0

Goal #1: Port Lucene from Java to Rust

Goal #2: It should pass existing test scripts

Leverage JNI from RUST

The ability to sync Tests with the parent project

Building a boat in the basement….

Choosing the right project

Applying Behavior Driven Development practices to infrastructure.