Ops

Classic Shell Scripting 讀書筆記（二）

2021 年 4 月 22 日2021 年 5 月 5 日
Ops

排序文本 sort

locale 對排序的影響

重音位置不同的法文單字

$ cat french

côte

cote

coté

côté

查看字元在 ISO 的八進位數值

$ man iso_8859_1

Oct Dec Hex Char Description

────────────────────────────────────────────────────────────────────

351 233 E9 é LATIN SMALL LETTER E WITH ACUTE

364 244 F4 ô LATIN SMALL LETTER O WITH CIRCUMFLEX

分別用傳統 ASCII 和 Canadian-French（系統必須先安裝法文）排序，結果不同

$ LC_ALL=C sort french

cote

coté

côte

côté

$ LC_ALL=fr_CA.utf8 sort french

cote

côte

coté

côté

注意空白

以 -k 指定排序字段時

如未以 -t 指定分隔符號，預設以空白分隔並忽略開頭與結尾的空白
如果指定 -t，則開頭與結尾的空白不會被忽略，例如 ” -X- ” 以 -t " " 分隔，會被分成三個字段：空白, “-X-” 及空白
如果僅指定一個字段編號，意思是「從該字段開始，一直比對到行的結尾」
-k{n},{m} 格式代表「從第 n 個字段開始，至第 m 個字段結尾」
-k{n.i},{m.j} 格式代表「從第 n 個字段第 i 個字元開始，至第 m 個字段第 j 個字元結尾」

穩定性

紀錄內的排序字段都相同，但輸出與輸出不一致，代表 sort 並不穩定。GNU 實現了 coreutils 套件，可透過 --stable 彌補這個不足

$ sort -t_ -k1,1 -k2,2 << EOF

> one_two

> one_two_three

> one_two_four

> one_two_five

> EOF

one_two

one_two_five

one_two_four

one_two_three

排除重複

使用 -u 排除重複是基於 key 而非整筆紀錄，如果是後者，則可以搭配 uniq 工具使用

Classic Shell Scripting 讀書筆記（一）

2021 年 3 月 20 日2021 年 5 月 5 日
Ops

入門

printf

1 2	printf "The first program always prints '%s, %s!'\n" Hello world

格式聲明 (format specfications) 是一種佔位符號 (placeholder)，結構包含 1) 百分比符號 (%) 2) 指示符 (specifier)，常用的有字符串 %s 及十進位整數 %d

tr

範例：Translate DOS file to UNIX

1 2	tr -d '\r' < dos-file.txt \| sort > UNIX-file.txt

tr -d: 自 stdin 刪除 source-char-list 的字符
\r: ASCII carriage return

/dev/tty

範例：Read password via /dev/tty

printf "Enter new password: "

stty -echo

read pass < /dev/tty

stty echo

當程序打開 /dev/tty 時，UNIX 會將它重定向到一個終端再與程序結合，該終端可以是 1)實體的 console 2) 串行端口 (serial port) 3) 偽終端 (pseudoterminal)
stty (set tty) 用來控制終端的設置，echo/-echo 選項用來開關自動打印輸入的功能

i18n and l10n

當 i18n 作為設計軟體的過程時，無須再修改軟體或重新編譯程式代碼，就可以給特定的群體使用
當 l10n 作為設計軟體的過程時，目的是讓特定的使用者可以使用軟體。其中包含翻譯輸出的文字、貨幣、日期、時間、單位等格式
對使用者來說，用來控制讓哪種語言或文化環境生效的功能，叫做 locale
除了 C 與 POSTFIX 以外，locale 名稱並未標準化
BSD 與 Mac OS X 完全不支援 locale
locale 的支援仍未成熟：Shell 腳本常受到 locale 影響；在大多數 UNIX 系統下，很難從 locale 文件與工具來判定字元集 (character class)、等價字元集 (equivalence class) 實際上包含了哪些字元，以及有哪些排序符號 (collating symbol) 可用。
Shell 腳本開發者應了解 locale 對他們代碼所造成的影響

列出所有 locales

$ locale -a

C.UTF-8

POSIX

en_US.utf8

zh_TW.utf8

取得特定 locale 的資訊

定義 LC_ALL 來覆寫預設的 locale，可查詢的變量有日期時間格式 LC_TIME、貨幣格式LC_MONETARY 等

1 2	$ LC_ALL=en_US.utf8 locale -ck LC_TIME

深入理解 Nginx 讀書筆記 (第二章)

2020 年 10 月 8 日2021 年 5 月 5 日
2 Comments
Dev, Ops

進程間的關係

Nginx 支持僅單進程（master）提供服務
常態的部署是使用一個 master 進程來管理多個 worker 進程
Worker 數量與 CPU 核心數相等，進程切換代價最小

使用多進程的好處

master 進程僅專注於純管理工作，為管理員提供命令行服務（啟動、停止、重配置、升級）
master 進程需要比較大的權限，通常會以 root 使用者啟動
一個 worker 進程出錯後，其他 worker 仍然可以正常服務
充分利用 SMP（Symmetric multiprocessing）多核架構，實現微觀上真正的多核併發處理
Worker 通常不會進入睡眠狀態：可以同時處理多個請求，不像 Apache 每個進程只能同時處理一個請求，以致進程切換代價大

配置語法

每個模組都有自己感興趣的配置項，大部分模組都必須在 nginx.conf 中讀取到某個配置後才會啟用，例如只有當配置 http {…} 時， ngx_http_module 模組才會啟用，其他依賴的模組也才能正常使用

區塊配置項

http {

...

gzip on;

server {

...

location /webstatic {

gzip off;

}

由名稱及一對大括號組成，如 http, server, location 都屬於區塊配置項
傳入的參數取決於解析這個區塊配置項的模組
大括號表示包含其中的配置同時生效
可以嵌套，內層配置直接繼承外層
當內外層配置發生衝突，以哪層配置為準，取決於解析這個區塊配置項的模組，例如範例的 gzip 開關

配置項語法格式

1 2	名稱參數1 參數2;

名稱必須合法的（是某個 Nginx 模組想要處理的）
傳入的參數取決於解析這個區塊配置項的模組
若任一參數包含空格符，須要用單引號或雙引號包住
以分號結尾

Read More »深入理解 Nginx 讀書筆記 (第二章)

深入理解 Nginx 讀書筆記 (第一章)

2020 年 9 月 25 日2020 年 10 月 16 日
Dev, Ops

為什麼選擇 Nginx

更快： 1)單次請求更快響應 2) 在高峰期比其他服務器更快響應
高擴展性： 1)由耦合度極低模塊組成 2)模塊皆嵌入到2進制文件中執行
高可靠性： 1)模塊穩定 2)進程相對獨立 3)worker出錯可快速輪替
低內存消耗： 1)10,000個非活躍 HTTP Keep-Alive 連接僅消耗 2.5 MB
高併發： 1)單機支援 100,000 以上連接
熱部署： 1)基於 master 與 worker 進程分離 2)服務不間斷下，進行升級可執行元件、配置及更換日誌
BSD 許可協議

開發準備工作

必要

Linux 內核版本 2.6 以上（須靠 epoll 處理高併發）
GCC 編譯器編譯 C 語言

非必要

G++，用來編譯 C++ 以編寫 HTTP 模塊
PCRE（Perl 兼容正則表達式），用來在配置文件中使用正則表達式，pcre-devel 是使用 PCRE 做二次開發所需
zlib，用來對 HTTP 內容做 gzip 壓縮，減少網路傳輸量
OpenSSL，支持 SSL 協議，或想使用 MD5 或 SHA 雜湊

目錄結構

源代碼目錄
編譯中間文件（置於源碼目錄底下，命名為objs）
部署目錄（莫認為 /usr/local/nginx）
日誌目錄

Linux 內核參數優化

須要修改內核參數，使得 Nginx 可以擁有更高的性能
通常根據業務特性進行調整，作為內容服務器、反向代理，或是提供縮圖用的服務器，會做不同調整

Read More »深入理解 Nginx 讀書筆記 (第一章)

HBase Basics

2020 年 7 月 22 日2020 年 7 月 27 日
Ops

Apache HBase is an open source, scalable, consistent, low latency, random access data store

Source from Infinite Skills

Features

Horizontally Scalable

Linear increase in servers results in linear increases in storage capacity and I/O operations

CAP Trade off

In CAP theory, Hbase is more likely a CP type of system

Consistency: ACID(atomicity, consistency, isolation, durability) garantees on rows
Availability: Response time 2-3ms from cache, 10-20ms from disk
Partition Tolerance: Failures don’t block system. It might take longer to response to maintain consistency

Dependencies

Apache ZooKeeper

Use for distributed coordination of leaders for high availability
Optimized to be highly avaiable for reads
Not designed to scale for high write throughput

Apache Hadoop HDFS

Provide data durability and reliability
Optimized for sequential reads and writes of large files
Does not provide random updates, only simple API for rando reads
Cannot scale tens of billions of small entities (less then a few hundred MB)

Both system have their strengths but do not individually provide the same properties as HBase

Random Access

Optimized for small random reads

Entities indexed for efficient random reads

Optimized for high throughput random writes

Updates without requiring read
Random writes via Log Structured Merge (LSM)

Short History

Inspired from Google’s Bigtable

Bigtable: A Distributed Storage System for Structured Data(2006)

BigTable

Datastore for Google’s Web Crawl Table

Store web page content
Web URL as key
Use MapReduce to find links and generate backlinks
Calculate page rank to build the Google index

Later, it also used as backend for Gmail, GA, Google Earth etc.

Hadoop HDFS

Inspired by Google distributed file system GFS

Timeline

Since 2009, many compaies (Yahoo, Facebook, eBay etc.) chose to use HBase for large scale production use case

In 2015, Google announced BigTable with HBase 1.0 compatible API support for its compute engine users

2017, HBase 2.0.0

2020, HBase 3.0.0

Despite being bucketed into NoSQL category of data storage, some of intresting are moving NoSQL back to SQL, by using HBase as a storage engine for SQL compliant OLTP database system.

Use case

HBase’s strengths are its ability to scale and sustain high write throughputs

Many HBase apps are:

Ports from RDBMS to HBase
New low-latency big data apps

How to Porting RDBMS to HBase?

Many RDBMS are painful to scale
Scale up is no longer pratical for massive data
Data inconsistency was not acceptable when scaling reads
Operationally gets more complicated as the number of replicas increases
Operational techniques not sufficient when scaling writes

To make it easier to scale, we need to discard the fundamental features that RDBMS provides, such as:

text search (LIKE)
joins
foreign keys and avoid constraint checks

Changing the schema, make it only contains denormalized tables, we won’t incur replication IO when sharding the RDBMS

Now you’re relatively straightforward porting RDBMS to HBase

Why choosing HBase instead?

When your apps need high wirte and read throughput
When you tired of RDMS’s fragile scaling operations

Data Volumes

Entity data: information about the current state of a particular persion or thing
Event data(or time series data): Records events that are generally spaced over many time intervals

Data volume explods when we need both of them

HBase or Not

Q: Does your app expect new data to be vailable immediately after an update?

Yes: Use HBase
- When data queried, must reflect the most recent values
- Expect query responses in milliseconds
No: No need for HBase

Q: Whether your app analytical or operational?

Analytical: Not optimal for HBase
- Look for large set of data
- Often filter for particular time range
- Better choose Hadoop
Operational: Use HBase
- Look for single or small set of entities

Q: Does your app expect updates to be available immediately after an update?

Yes: Use HBase
- Frequently modified
- Pinpoint deletes
- Updates must be reflected within milliseconds
No: No need for HBase
- Data is append-only
- Deletes in bulk or never
- Updates can be ignored until the next report is run

comparison

Workload	HBase	Hadoop
Low Latency	1ms from cache 10ms from disk	1min vis MR/Spark 1s via Impala
Random Read	Rowkey is primary index	The small file problem
Short Scan	Sorted and efficient	Bespoke partitioning can help
Full Scan	Possible but non-optimal Improved pref w/MR on snapshots	Optimized with MR, Hive, Impala
Updates	Optimized	Not supported

Networking

Networking for Linux Basics

Network Switch

A switch is a device in a computer network that connects other devices together, can only enable a communication within a network

Host A(192.168.1.10)[eth0] &harr Switch(192.168.1.0) &harr [eth0]Host B(192.168.1.11)

# For Network A

$ ip link

$ ip addr add 192.168.1.10/24 dev eth0 # set a ip addr for interface eth0

# For Network B

$ ip link

$ ip addr add 192.168.1.11/24 dev eth0

# Test

$ ping 192.168.1.11

Router

A router is a device/service that provides the function of routing IP packets between networks

Switch(192.168.1.0) <–> [192.168.1.1]Router[192.168.2.1] <–> Switch(192.168.2.0)

Route/Gateway

A gateway (in network terms) is a router that describes the function for connectivity

# For Network A

$ ip route add 192.168.2.0/24 via 192.168.1.1

# For Network B

$ ip route add 192.168.1.0/24 via 192.168.2.1

Default Gateway

If none of these forwarding rules in the routing table is appropriate for a given destination address, the default gateway is chosen as the default router of last resort

1 2	$ ip route show default

Forwording packets between interfaces

By default in linux, packets are not forwarded from one interface to the next, for security reasons

Explicity allow it

1 2	echo 1 > /proc/sys/net/ipv4/ip_forward

Persists the settings

1 2	net.ipv4.ip_forward = 1

DNS

Translate host name to IP address by configure the /etc/hosts

When a environment has too many entries and IP address are not persistent, we need a DNS server

$ cat /etc/resolv.conf

nameserver 192.168.1.100

The host will lookup an entry in /etc/hosts first, then lookup in the DNS. This order can be changed by configure file /etc/nsswitch.conf

$ cat /etc/nsswitch.conf

passwd: files

group: files

shadow: files

gshadow: files

hosts: files dns

networks: files

protocols: db files

services: db files

ethers: db files

rpc: db files

netgroup: nis

You can configure the DNS server to forward unknown host name to the public name server in the Internet, for example reach www.google.com

private DNS → Root DNS → .com DNS → google DNS → cache the result

When looking for a host in the same domain, we want to simple use the host name not the full name, such as using web not web.mycompany.com, therefore we specify the domain name you want to append in /etc/resolv.conf

$ cat /etc/resolv.conf

search mycompany.com

There are records stores in DNS with specific types:

A: ipv4
AAAA: ipv6
CNAME: name to name mapping

You can use tools like nslookup, dig to debug, note that nslookup only query from dns, not files

There are plenty DNS solutions, such as CoreDNS, except configure from files, CoreDNS supports other ways of configuring DNS entries through plugins like kubernetes

Network Namespace

A namespace is a way of scoping a particular set of identifiers

Linux provides namespaces for networking and processes, if a process is running within a process namespace, it can only see and communicate with other processes in the same namespace

Linux starts up with a default network namespace

Each network namespace has its own routing table and has its own set of iptables

# Create namespace

ip netns add red

# List namespace

ip netns list

# List interface

ip link

# List interface in namespace

ip netns exec red ip link

# or

ip -n red link

Connect namespaces together using a virtual Ethernet pair (or virtual cable, pipe)

# Create veth pair

$ ip link add veth-red type veth peer name veth-blue

# Attach each interface to the appropriate namespace

$ ip link set veth-red netns red

$ ip link set veth-blue netns blue

# Assign IP to each namespaces

$ ip -n red addr add 192.168.15.1 dev veth-red

$ ip -n blue addr add 192.168.15.2 dev veth-blue

# Bring up the interface for each device within the respective namespace

$ ip -n red link set veth-red up

$ ip -n blue link set veth-blue up

# List ARP table to see neighbor

$ ip netns exec red arp

# Ping across namespace

$ ip netns exec red ping 192.168.15.2

When there more of namespaces need connected, use a virtial switch to create a virtial network. There few solutions:

Linux Bridge
Open vSwitch

# Create a virtial switch interface

$ ip link add v-net-0 type bridge

# Bring the interface up

$ ip link set dev v-net-0 up

# Create cables for each namespace to connect to the bridge

$ ip link add veth-red type veth peer name veth-red-br

$ ip link add veth-blue type veth peer name veth-blue-br

# Attach one end to the appropriate namespace

$ ip link set veth-red netns red

$ ip link set veth-blue netns blue

# Attach the other end to the bridge

$ ip link set veth-red-br master v-net-0

$ ip link set veth-blue-br master v-net-0

# Assign IP to each namespaces

$ ip -n red addr add 192.168.15.1 dev veth-red

$ ip -n blue addr add 192.168.15.2 dev veth-blue

# Bring up the interface for each device within the respective namespace

$ ip -n red link set veth-red up

$ ip -n blue link set veth-blue up

# Assign IP address to the bridge (since it’s just another interface on the host)

$ ip addr add 12.168.15.3/24 dev v-net-0

# Ping accross namespaces

$ ip netns exec red ping 192.168.15.2

When a private virtual network need to reach the outer network, it need a gateway, the host is the gateway

1 2	$ ip netns exec red ip route add 192.168.1.0/24 via 192.168.15.3

For destination network to response, enable NAT on host acting as a gateway.

Add a new rule in the NAT IP table in the POSTROUTING chain to masquerade or replace the from address on all packets coming from the source network 192.168.15.0 with its own IP address.

Thus anyone receiving these packets outside the network will think that they are coming from the host and not from within the namespaces

1 2	$ iptables -t nat -A POSTROUTING -s 192.168.15.0/24 -j MAS

Add a route using default gateway to outside world

1 2	$ ip netns exec red ip route add default via 192.168.15.3

For outside world to reach the namespace in a private network, add a port forwarding rule using IP tables to say any traffic coming to port 80 on the localhost is to be forwarded to port 80 on the IP assigned to the namespace

$ iptables \

-t nat \

-A PREROUTING \

--dport 80 \

--to-destination 192.168.15.1:80 \

-j DNAT

Ops

Classic Shell Scripting 讀書筆記（二）

排序文本 sort

locale 對排序的影響

注意空白

穩定性

排除重複

Classic Shell Scripting 讀書筆記（一）

入門

printf

tr

/dev/tty

i18n and l10n

深入理解 Nginx 讀書筆記 (第二章)

進程間的關係

配置語法

深入理解 Nginx 讀書筆記 (第一章)

為什麼選擇 Nginx

開發準備工作

目錄結構

Linux 內核參數優化

HBase Basics

Features

Horizontally Scalable

CAP Trade off

Dependencies

Apache ZooKeeper

Apache Hadoop HDFS

Random Access

Short History

Use case

Kubernetes Short Notes(4)

tags: `k8s`

Networking

Networking for Linux Basics

Ops

Classic Shell Scripting 讀書筆記（二）

排序文本 sort

locale 對排序的影響

注意空白

穩定性

排除重複

Classic Shell Scripting 讀書筆記（一）

入門

printf

tr

/dev/tty

i18n and l10n

深入理解 Nginx 讀書筆記 (第二章)

進程間的關係

配置語法

深入理解 Nginx 讀書筆記 (第一章)

為什麼選擇 Nginx

開發準備工作

目錄結構

Linux 內核參數優化

HBase Basics

Features

Horizontally Scalable

CAP Trade off

Dependencies

Apache ZooKeeper

Apache Hadoop HDFS

Random Access

Short History

Use case

Kubernetes Short Notes(4)

tags: k8s

Networking

Networking for Linux Basics

tags: `k8s`