Serverless Over Kubernetes - The Clean Way

TL;DR: In this article, we’re presenting our implementation of serverless computing over kubernetes. You will learn a new way to implement serverless with a lean solution, without the need of using any external dependencies. Upon reading this, you will be able to create your own serverless architecture inside kuberenetes cluster. If your’e looking for a solution for your on-prem environment, stop using remote serverless providers, or just don’t want to pay them for maintainance, this is the place for you.

Implementing Serverless

Serverless computing has become a common term among software development companies. It is a valid architectural solution for teams trying to use cloud services instead of setting up their own servers. Most major cloud platforms have their own solution for this type of development. Azure has Azure Functions, AWS offers AWS Lambda and so on.

‍

What if we want to use the logic behind serverless computing, and not having to depend directly on these types of cloud platforms?

A viable solution for this is using the serverless idea, while using kubernetes as the platform to do so.

In this article, I will take you through the thought process behind the decision to use serverless over kubernetes, and of course the implementation. Whether you are a developer looking for a solution for your project, or just an enthusiastic reader looking to learn something new, I believe there’s something here for everyone.

What is serverless?

Most developers, both seniors and beginners with some experience, probably had to face the agony of starting up a server just to test their work. These servers need to be maintained, which in most cases can take a lot of time. If you disagree, then you probably are not the person responsible to do this in your company, and trust me, someone’s hair is getting grayer and grayer with every server failure.

Serverless architecture is solving this exact issue. It takes the problem of server maintainance out of the dev team hands, and strips it down to the main functionality needed to execute a specific action. Meaning, I don’t have to wait for a server to “wake up” to execute my code. All I need to do is to send a message using a predefined api (http request in my case) and the server will execute my request. No dependencies, no configurions, no wait time. Easy.

So why rolling our own?

Although severless computing over the cloud is very promising, it does have some limitations. In valid network, we develop a security platform for companies that are using blockchain technology. We allow our clients to analyse their code for potential vulnerabilities. Since we want this to be repeatable, we used AWS’ serverless technology called lambda.

While using lambda we started having some issues scanning some customers’ projects. A quick research uncovered that lambda’s tmp folder had a limit of 500MB. Plus, AWS api gateway had a timeout of 29 seconds. So if our process takes longer than that, tough luck.

This was about the time we started thinking of becoming available on premise as well. We had some clients who were’nt willing to publish their git in our SaaS platform for security reasons, so we had to come up with a solution for them as well.

All of the above led us to start looking into implementing serverless on our own, to be able to maintain the advantages of serverless logic and accommodate them to our needs.

Assumptions

Important thing to note is that this is not a begginer’s guide. The technologies we used here are simply a result of our prior knowledge. The specific tech stack used here is not a constraint. If you’re developing in a different language and/or database, you should be able to implement the same logic.

It’s recommended to familiarize yourself with the following tech stack before jumping into the next section to get a better understanding of the implementation

Tech Stack

Kubernetes — used as the orchestrator
RabbitMQ — queue service to manage jobs
Nodejs — javascript framework for backend development
Typescript — optional. type safety for nodejs
MongoDB — the database used to deliver data

General Terms

Http requests — basic understanding of how http requests work
Object oriented programming

The Solution

To implement serverless successfully, we used two different kubernetes deployments running nodejs, RabbitMQ as a queue service, and a MongoDB database instance.

This can be achieved using most of available queue services and NO-SQL databases.

The kubernetes deployments are divided into two. The request pod and the execution pod.

Request pod

Responsible for sending the requests. Should be stable, and can have many instances. Can use persistent volume.

Connects to the queue service and adds requests to the queue. Other services that wish to run serverless requests, send http requests directly to the request service.

The pod has cache on every job it creates, to monitor the responses using a separate channel in the queue where it functions as the recipient. As soon as it receives notice that a job the service is responsible for is done, it accesses the database and requests data response on the specific job.

When data is successfully received, the pod then sends a response back to the service that created the http request with the requested data.

Execution pod

Responsible for executing requests. Functions as a one-time pod. Dies as soon as it finishes processing the request, and cannot be connected to a persistent volume. Also, the image used to create the pod should be as slim as possible.

The execution pod registers to the queue and waits for a job request. Once it pulls a job with all its data, it executes it. As soon as it finishes, the result of the job is processed on two different channels:

A new entity is added to the database with the job id and the data generated from completing the request (if any).
A new channel is opened to send information back to the request pod through the queue service alerting them the job is done, along with completion status (done, timeout or fail).

When the execution pod has done its purpose, it dies. The execution service is responsible for creating a new execution pod, that immediately registers to the queue and waits for new jobs.

High-level design of the implementation

Deep Dive

Lets go over piece by piece about what needs to be done to successfully implement serverless computing.

RabbitMQ

We’re using rabbitmq service here since it is very easy to setup. All you have to do is create a service and deployment for rabbitmq.

Here’s a yaml configuration example:

RabbitMQ Service

‍

apiVersion: v1 kind: Service metadata: labels: component: rabbitmq name: rabbitmq spec: ports: - port: 5672 selector: app: taskQueue component: rabbitmq type: ClusterIP

‍

RabbitMQ Replication Controller

kind: ReplicationController apiVersion: v1 metadata: labels: component: rabbitmq name: rabbitmq spec: replicas: 1 template: metadata: labels: app: taskQueue component: rabbitmq spec: containers: - image: rabbitmq name: rabbitmq ports: - containerPort: 5672 env: - name: RABBITMQ_DEFAULT_PASS value: [password] - name: RABBITMQ_DEFAULT_USER value: [username]

You can set any username and password you want
Notice that amount of replicas should be just one. Since rabbitmq functions as a “proxy” of sort to forward messages, there shouldn’t be any issue with overloading it

Request Entity

We’re adding a deployment to our kubernetes cluster to function as the request endpoint. The purpose of this endpoint is to receive http requests from other services, create jobs out of these requests and send them to rabbitmq.

Request Service

apiVersion: v1 kind: Service metadata: name: requestEntity spec: ports: - port: [port number] protocol: TCP targetPort: [target port number] selector: app: requestEntity type: ClusterIP

Request Deployment

apiVersion: v1 kind: Deployment metadata: labels: app: requestEntity name: requestEntity spec: replicas: [replica number] strategy: type: RollingUpdate selector: matchLabels: app: requestEntity template: metadata: app: requestEntity spec: containers: - name: requestEntity image: [image url] command: - node args: - index.js ports: - containerPort: [port number] volumeMounts: [add volumes here] env: - name: rabbitmqPassword value: [password] - name: rabbitmqUsername value: [username]

You can define as many replicas as you want. This implementation supports scaling
Username and password should match the ones set previously in rabbitmq configuration

The Code

We use several npm package dependencies:

amqplib — to connect to rabbitmq using qmap protocol
express — to receive http requests
typescript — optional, for type safety

Using express library, we receive http requests and add jobs to rabbitmq to execute these requests. We then wait for a response, and once we receive a finished job response with the same job id, we access our database (mongoDB) to extract data from it.

We only forward job id using rabbitmq since forwarding messages over the size of 128MB using rabbitmq is not recommended and in our use-case we sent over 1GB of data

Global variables

private readonly writeQueue: string = 'assignments'; private readonly readQueue: string; private readonly exchange: string = 'responses'; // The channel in charge of sending job requests private writeChannel: any; // The channel in charge of receiving completed jobs private readChannel: any; // A hash table allowing us to track the jobs requested by this // pod, supporting scaling with multiple replicas // The Observable class is a helper class allowing to track // timed-out jobs private runningJobs: { [jobId: string]: { params: any, finished: Observable; } } = {}; // A mongodb collection contains info about requested data private mongoCollection: any

Initialization method

sendJob(functionName: string, params: any) method

// Generate unique random jobId let jobId = IdGenerator.generate(); let queueRequest: QueueRequest = { jobId, params, functionName }; this.runningJobs[jobId] = { params, finished: new Observable(false) }; console.log(`Sending job request: ${JSON.stringify(queueRequest)}`); // Send to queue as buffer object await this.writeChannel.sendToQueue( this.writeQueue, Buffer.from(JSON.stringify(queueRequest)) ); return jobId;

handleRequest(functionName: ExternalFunctions, params: any) method

let jobId = await this.sendJob(functionName, params); // Set 2 minutes timeout let timeout: number = 1000 * 60 * 2; try { // The Observable helper class throws an error when timout is // reached and there was no value change await this.runningJobs[jobId].finished.waitForValueToChange( true, timeout); } catch (e) { this.log.error(e); throw new Error('timeout'); } let data = (await this.collections.queueCollection.get(jobId)).data; return data;

handleResponse method

let jobId = response.jobId; // Make sure the job was generated using this specific pod if (this.runningJobs[jobId]) { if (response.status === 'error') { this.runningJobs[jobId].finished.setValue(true); // Throw error in case an error is returned throw new Error(JSON.stringify(response.error)); } this.runningJobs[jobId].finished.setValue(true); }

Execution Entity

Now we need to add another deployment to our kubernetes cluster to function as the execution endpoint. The purpose of this endpoint is to read jobs directly from rabbitmq, execute them, add results to mongodb and add the result data to a seperate response queue via rabbitmq.

Execution Service

apiVersion: v1 kind: Service metadata: name: executionEntity spec: ports: - port: [port number] protocol: TCP targetPort: [target port number] selector: app: executionEntity type: ClusterIP

Execution Deployment

apiVersion: v1 kind: Deployment metadata: labels: app: requestEntity name: requestEntity spec: replicas: [replica number] strategy: type: RollingUpdate selector: matchLabels: app: executionEntity template: metadata: app: executionEntity spec: containers: - name: executionEntity image: [image url] command: - node args: - index.js ports: - containerPort: [port number] volumeMounts: [] env: - name: rabbitmqPassword value: [password] - name: rabbitmqUsername value: [username]

You can define as many replicas as you want. This implementation supports scaling
Username and password should match the ones set previously in rabbitmq configuration

The Code

As in the request entity, we use the same package dependencies here:

amqplib — to connect to rabbitmq using qmap protocol
express — to receive http requests
typescript — optional, for type safety

Global variables

private readonly writeQueue: string = 'responses'; private readonly readQueue: string = 'assignments'; private readonly exchange: string = 'responses'; // The channel in charge of sending responses to requested jobs private writeChannel: any; // The channel in charge of receiving job requests private readChannel: any; // A mongodb collection to be populated with job result data private collections: any;

Initialization method

We’re ending the process to allow a brand new one to be created without any “memory” of previous results (hence we’re not using static volumes)
This is why it’s hightly recommended to have several instances of the execution pod to allow other pods to manage workload while the new one is booting

handleRequest(request: any) method

Summary

In this article, we’ve created our own serverless solution over kubernetes. Although many solutions out there solve the same problem, all of them have dependencies over other services that essentially make the deployment process dependant on external services. The solution presented here is lean, and can be implemented with little-to-none external dependencies.

‍

About Valid Network

Valid Network’s blockchain security platform provides complete life cycle security for enterprise blockchains from initial development to active deployment and management. Based in Be'er Sheva, Israel, the company’s solutions enable enterprises to innovate with blockchain faster, providing complete visibility and control over their distributed applications and smart contract governance, compliance, and security posture through advanced platform capabilities.

‍

Secure the block with Valid Network.

Learn more: https://valid.network

It’s time to Deriskify Crypto!

Uncover risks & opportunities in crypto to maximize your gains.

Valid Data’s real-time and predictive insights are used by Cryptocurrency traders and exchanges, as well as investors and hedge funds, to make better investment and trading decisions, to protect the value of their digital assets, and to capitalize on market opportunities that only Valid Network’s technology can uncover.

Try Valid Data

Other Blogs

Aug 19, 2020

Decentralized Applications: The good, the bad, and why should enterprises care?

With the upsurge of blockchain, the emergence of dApps is already a steady trend. Now is the time for enterprises to pay more attention to what’s happening and how it affects them and their target audience.

Dec 14, 2020

The Risks of Broken Access Control and the Blockchain

As blockchain applications are a form of web application, access control is still a common problem even for blockchain developers. But despite its common use, access control is difficult to implement and manage properly, easily leading to a misconfigured security control that leaves an enterprise’s data at risk.

May 25, 2020

Serverless Over Kubernetes - The Clean Way

Whenever you want to login into your social media account, say Twitter, you are expected to provide a secret passage – a password. The website checks if your password is correct and if it is, you are granted access. This mechanism works because Twitter assumes

Jan 11, 2021

Financially Exploiting the Blockchain with Frontrunning

What if you could make USD 1,000,000 in 30 minutes on the blockchain with some scripts, insider knowledge, and the right timing? Unethical, illegal, and difficult to prevent, this threat to the defi is called frontrunning.

May 25, 2020

Tornado.cash Decentralization applications Issues

Tornado.cash is a recently announced dApp on Ethereum that allows private transactions on the otherwise public Ethereum network. Private transactions have been a much sought after feature on Ethereum, with many projects developing such features.

Oct 1, 2020

An Introduction to the Enterprise Blockchain Landscape

Enterprise blockchains are starting to gain popularity in recent years. After the big hype around digital currencies has slowly subsided, the interest shifted to the technology that formed the basis for these currencies, the blockchain, and the possibilities it holds for organizations.

Nov 26, 2020

The Risks of Injection Attacks on the Blockchain

Injection attacks are one of the most significant risks to any network-connected system. These attacks use malicious data to attack software systems and can be launched against the client-side of an application, but also against the server-side the database, and the smart contracts.

Feb 1, 2021

Introducing Ethereplay by Valid Network

We are excited to announce Ethereplay by Valid Network, a free community tool to support examining, analyzing, optimizing and securing of smart contract code on Ethereum.

Jul 26, 2020

Onboarding blockchain tech? Don’t miss these important facts

Key issues that enterprises must carefully consider and deal with when onboarding blockchain technology

May 25, 2020

The Reentrancy Strikes Again — The Case of Lendf.Me

DeFi or decentralized finance is a growing sector in the blockchain and cryptocurrency space that defines an ecosystem of decentralized applications providing financial services with no governing authority.

Oct 15, 2021

Insights from Mainnet 2021: The Future is Multichain

Multichain networks will reshape the way we think and experience crypto. From the way we navigate DeFi, send funds with ease across different Blockchains, and move our digital avatars without the worry of fees and delays. Multichains will open the door to a world of opportunities, where digital assets will become the norm. Read below to participate in the revolution.

Apr 29, 2021

Top 2020 Blockchain Hacks (and what you can do about them in 2021)

2020 has seen a rise in both impact of attacks and sophistication over previous years. Attackers stole $3.8 billion -- in just 122 attacks. This blog covers the Top Blockchain Hacks of last year. Looking forward towards 2021, companies utilizing blockchain will continue to face these challenges with both known and novel threats in a highly open and visible ecosystem.

Jun 29, 2021

Who’s Afraid of DeFi?

DeFi and cryptocurrency are the next generation of finance, set to disrupt the mainstream capital markets and financial institutions. Why are banks, investors, and regulators afraid of DeFi? What can overcome the fear? Read our new Valid Network blog.

Aug 23, 2021

Blockchain Vulnerability

Blockchain is an exciting and innovative area, and unlike other nascent technologies, blockchain comes with substantive progress in its brief history, with greater exposure to wider audiences. However, there are also bad actors who are attracted to the ecosystem who try and identify potential Vulnerabilities and exploit these in a way that will allow them to generate a financial gain.

Jul 11, 2021

What are CBDC and are Digital Currencies Safe?

Cryptocurrency and DeFi trading platforms have long signified a coming change in the way currency is handled around the world.

Jul 1, 2021

Integer Overflow in Ethereum

Many involved in blockchain do not have a full comprehension of the impact of software flaws and how they can enable vulnerability.

Serverless Over Kubernetes - The Clean Way

Implementing Serverless

What if we want to use the logic behind serverless computing, and not having to depend directly on these types of cloud platforms?

What is serverless?

So why rolling our own?

Assumptions

Tech Stack

General Terms

The Solution

Request pod

Execution pod

RabbitMQ

Request Entity

The Code

Global variables

Initialization method

sendJob(functionName: string, params: any) method

handleRequest(functionName: ExternalFunctions, params: any) method

handleResponse method

The Code

Global variables

Initialization method

handleRequest(request: any) method

Summary

About Valid Network

It’s time to Deriskify Crypto!

Uncover risks & opportunities in crypto to maximize your gains.

Other Blogs

Decentralized Applications: The good, the bad, and why should enterprises care?

The Risks of Broken Access Control and the Blockchain

Serverless Over Kubernetes - The Clean Way

Financially Exploiting the Blockchain with Frontrunning

Tornado.cash Decentralization applications Issues

An Introduction to the Enterprise Blockchain Landscape

The Risks of Injection Attacks on the Blockchain

Introducing Ethereplay by Valid Network

Onboarding blockchain tech? Don’t miss these important facts

The Reentrancy Strikes Again — The Case of Lendf.Me

Insights from Mainnet 2021: The Future is Multichain

Top 2020 Blockchain Hacks (and what you can do about them in 2021)

Who’s Afraid of DeFi?

Blockchain Vulnerability

What are CBDC and are Digital Currencies Safe?

Integer Overflow in Ethereum

Subscribe to our newsletter and get the latest updates every day

Discover

Community

Legal

Contact