Coder Samurai: 2017

2017. szeptember 19., kedd

Alpine Linux install

If you start working on Docker these days it is inevitable to run into Alpine Linux at certain point. It offers 1/10 of the size of a Debian based container (5 mb vs 50 mb), and many application stacks already offer Alpine based image to derive your micro services from. Debian can be slimmed down pretty well and there is a loyal following using it for containers and servers as well.
To try it out I decided to build a VM based on Alpine so I can evaluate it as a container image base.
Adventures in the Alpines so far:

There is a pretty good starting guide, with some caveats:

use bridged connection instead of nat if you want to see your vm from your host/other machines in the network, and performs better too
use the latest-stable url to enable community repository, otherwise you could get some ugly kernel incompatibility errors

Minimalist samba install (loosely based on this wiki page)

apk add samba
mv /etc/samba/smb.conf /etc/samba/smb.conf.bak
nano /etc/samba/smb.conf
Enter the above to the text editor, save and exit

[global]
workgroup = WORKGROUP
netbios name = server1
security = user
map to guest = Bad User
dns proxy = no

Start/enable service

rc-update add samba
rc-service samba start

Enable ssh access

nano /etc/ssh/sshd_config
#PermitRootLogin prohibit-password => PermitRootLogin yes
rc-service sshd restart

Docker install is fairly simple

apk add docker
rc-update add docker boot
rc-service docker start

Some other utilities to install:

apk add git
apk add nodejs

2017. szeptember 16., szombat

CentOS howto V - Custom systemd service

nano /etc/systemd/system/cloning-vat.service

[Service]

WorkingDirectory=/home/shared/cloning-vat/

ExecStart=/usr/bin/npm start

Restart=always

StandardOutput=syslog

StandardError=syslog

SyslogIdentifier=cloning-vat

User=nodejs

Group=nodejs

Environment=NODE_ENV=development

[Install]

WantedBy=multi-user.target

systemctl daemon-reload
systemctl enable cloning-vat
systemctl start cloning-vat

CentOS howto IV - Poking holes in security (in a meaningful way)

Add an existing service

firewall-cmd --permanent --zone=public --add-service=https
firewall-cmd --reload

Create a new service (to be added as above). In this example we will be using livereload default port 35729

firewall-cmd --permanent --new-service=live-reload
firewall-cmd --permanent --service=live-reload --set-description="live reload"
firewall-cmd --permanent --service=live-reload --set-short="live reload"
firewall-cmd --permanent --service=live-reload --add-port=35729/tcp

Enabling an application (node.js in this example) to bind ports <1024

setcap 'cap_net_bind_service=+ep'/ usr/bin/node

CentOS howto III - Advanced samba

Create shared folder and grant permissions

mkdir /home/shared/
chmod -R 0777 /home/shared/
chown -R nobody:nobody /home/shared/
chcon -t samba_share_t /home/shared/

add the text below to the /etc/samba/smb.conf

[shared]
path = /home/shared
browsable = yes
guest ok = yes
read only = no

Restart samba

systemctl restart smb.service
systemctl restart nmb.service

Test samba

testparm /etc/samba/smb.conf

CentOS howto II - Shared folders

Install Guest additions on the guest

Prerequisites on the guest OS side:

yum install dkms kernel-devel
yum groupinstall "Development Tools"

and now the additions themselves:

Devices / Insert Guest Additions CD image...
mount /dev/sr0 /mnt
cd /mnt
./VBoxLinuxAdditions.run

Create a shared folder:

Devices / Shared Folders / Shared folder settings
Add new..
Select "Auto mount" and "Permanent" options
you can find the shared folder in /media/sf_{SHARE_NAME}

Install guest additions on the host

{VBoxManage} setextradata {VM_NAME} VBoxInternal2/SharedFoldersEnableSymlinksCreate/{SHARE_NAME} 1

{VBoxManage} executable (ususally in C:\Program Files\Oracle\VirtualBox)
{VM_NAME} is the name of the VM
{SHARE_NAME} name of the share as in shared folders settings on in /media/sf_{SHARE_NAME}

Run the VM as administrator. Simplest way to get that going is:

In VirtualBox UI, right click on the VM and "Create Desktop shortcut"
Right click on the desktop icon, properties, advanced, check "Run as administrator"

Start the VM

Watcher configuration: The Guest OS have 0 idea about any changes outside of its jurisdiction. In this setup this pretty much makes watching files from the Linux guest side nearly impossible - except if you use polling. This is an example take from my project's browsersync configuration, something similar must work for any other chokidar based frameworks as well.

var sync = require("browser-sync").create();
sync.init({
server: 'dist',

port: 80,

watchOptions: {

usePolling: true

}
});

CentOS howto I - basic VM

Install Oracle VirtualBox
Download CentOS DVD ISO: https://www.centos.org/download/
Create a new VM

mount DVD image as optical drive
swap the network adapter type to Bridged network

Start the VM, Install CentOS with "Minimal install"

Start with partitioning, defaults should be good
Go to network

Make sure adapter configured to start automatically
Rename the host

Set time zone and ntp

nmtui
Edit a connection / enp0s8 (check the name in the VM settings)
check Automatically connect
Exit the ui
systemctl restart network

Disable ipv6 (it is sooo slow)

Append below lines in /etc/sysctl.conf:

net.ipv6.conf.all.disable_ipv6 = 1
net.ipv6.conf.default.disable_ipv6 = 1

sysctl -p
nano /etc/ssh/sshd_config
Append this to the config file:

AddressFamily inet

systemctl restart sshd

Do a minimal samba installation

Configure firewall

firewall-cmd --permanent --zone=public --add-service=samba
firewall-cmd --reload

Install samba

yum install samba
mv /etc/samba/smb.conf /etc/samba/smb.conf.bak
nano /etc/samba/smb.conf
Enter the above to the text editor, save and exit

[global]
workgroup = WORKGROUP
netbios name = server1
security = user
map to guest = Bad User
dns proxy = no

Enable and start samba

systemctl enable smb.service
systemctl enable nmb.service
systemctl restart smb.service
systemctl restart nmb.service

Test samba

testparm /etc/samba/smb.conf
At this point you should be able to login with Putty as well

update your new server

yum makecache fast
yum install epel-release
yum update
yum install nano (unless you prefer vi)

2017. július 27., csütörtök

The case against automatic memory management (a.k.a. Garbage Collection)

The state of the memory...

Microservices are good for scalability, because they are small and if one of your servers become hot, you get the option to move it around. They solve the smallest meaningful problem and communicate nicely with each other. CPU, memory and other hw related issues are things from the past since we have our machine learning based automated service deployment utility. Life is good... for most of us. Some are cursed with problems less easily "micro-serviced".
Consider the world's arguably easiest-to-parallelize problem: ray-tracing. In ray tracing you can potentially calculate every pixel with a separate service and combine it to the whole picture at the end of the processing. The problem is, ideally you load all the different components of the scene (objects, textures, etc.) to each service's memory. If you are hard pressed, you can start with the wireframe models and load textures on demand. Optimizations exists (i.e. bounding boxes instead of complete objects), but at the end of the day you need to know what the current ray hits with some precision. A complicated scheme might cause out of memory problems on all of your rendering microservices at the same time.

... and the way we manage it

Considering the top 10 programming languages from Tiobe Index we can see 75% of the most popular languages utilizes some kind of memory management works independently from what we specify in the code. The notable exception is C and C++, where garbage collection is available, but optional part of life: the C++11 specification allows for optional GC mechanism.
This means 75% of the cases we let somebody else's code take care of the memory, while we are becoming increasingly particular about what the CPU supposed to be doing.
Considering how easily a memory problem turns into a CPU problem due to increased GC activity it looks like we tend to play favorite with the CPU and trying to ignore memory as much as we can.

Any better way to do this?

I've been playing with Rust for quite some time now - nothing valuable just some hello world and dice rolling. Rust is not an easy language to begin with, lots of things done in a completely different ways than i.e. Java. Yes, memory management is one of them.
At its heart all memory manager (or garbage collector) is a reference counter. We say: "I don't care about who owns this stuff, figure out when it is not being used and get rid of it" and the GC tries to do it as unobtrusively as possible.
In Rust, you do have to figure out who is the owner of your data (variables, references). When you refer to the same data with a new variable, you have to explicitly tell if you want to transfer ownership (R/W access) to the variable. If yes, the old one is not going to be available!
This has significant advantages not only in sense of memory management (when owner variable goes out of scope, data will be deleted). It helps a lot in multi-threading with data sharing as well. Combined with the deterministic GC behavior, in Rust you know what's going on with memory. It is less comfortable than outsourcing it to some other thread, but definitely more dependable.

2017. július 9., vasárnap

How to contribute to github projects?

Just another "note to self" type entry about open source contribution.

Find a good, welcoming project i.e. here: http://up-for-grabs.net/#/
Fork the GitHub repo (let's call it upstream repo from now on) to your local repositories: on GitHub, press the fork button. Now you have a copy of the source code
Get the code down to your developer machine

To obtain clone url, go to your repo's page on GitHub and click "clone or download"
git clone https://github.com/vizmi/sequelize.git

Now you have you nice, isolated copy of the original repo. Time to sync it up with the upstream repo

Navigate to the original repo (there's a link to it under the project name)
Get the clone url (click "clone or download")
Set-Location .\sequelize\
git remote -v show the currently configured remotes. Ideally you have 2, pointing to your own repo. We are about to add 2 more, with one single command
remote add upstream https://github.com/sequelize/sequelize.git

Time to sync the local repo to the upstream repo (to avoid merge conflicts later)

git fetch upstream
git checkout master
git merge upstream/master

2017. január 22., vasárnap

What is wrong with scaffolding?

This is a strongly opinionated post, reflect my personal standpoint only.
The point of scaffolding is to get you up and running real quick, which is a good thing, right? Well, not always.
Let's say you are trying to get something done in a fairly new way (this is what we love about javascript development: every month there's a new way to do the same thing). You found the perfect yeoman generator for it, downloaded it down and have half dozen of config files generated for you. Now you are ready to develop the next killer app, except you have no idea which files are needed in the git repo, what changes needed for a production ready config and many more similar questions. Now you have to backtrack whatever template you have, understand why it is there and what does it actually do, and possibly spend more time on it than doing it step by step.
The point is: if you are doing something the first time, it probably worth doing it without scaffolding.
For the nth similar javascript project scaffolding is just great!

Coder Samurai