2018-01-06

Emacs Coding-Systems

Emacs

suffixは、いろいろあるっぽい

...-unix
...-dos
...-mac

https://www.gnu.org/software/emacs/manual/html_node/emacs/Coding-Systems.html

2018-01-06

JSON_AS_ASCII

Python

JSON_AS_ASCII

JSONの中の文字(non ASCII)をASCIIにエンコードしない。

By default Flask serialize object to ascii-encoded JSON. If this is set to False Flask will not encode to ASCII and output strings as-is and return unicode strings. jsonify will automatically encode it in utf-8 then for transport for instance.

http://flask.pocoo.org/docs/0.12/config/

なのだけど、例のUnicodeDecodeErrorでコケた。

Flask 0.12.2
Python 2.7.13

2018-01-06

Emacs let*

Emacs

Emacs let*

letとlet*はちがうよって話。

(setq x 1)
;;; letは同時にバインドされるので外側のxを参照する
(let ((x (+ x 3))
      (y (+ x 2)))                      ; この時点でのxは1
  (+ x y))                              ; => 7
;;; let*は直前のローカル変数代入の影響を受ける
(let* ((x (+ x 3))
       (y (+ x 2)))                     ; この時点でのxは4
  (+ x y))                              ; => 10

2018-01-05

memo pub sub link

開発

memo pub sub link

https://ja.wikipedia.org/wiki/%E5%87%BA%E7%89%88-%E8%B3%BC%E8%AA%AD%E5%9E%8B%E3%83%A2%E3%83%87%E3%83%AB

https://en.wikipedia.org/wiki/Messaging_pattern

https://msdn.microsoft.com/en-us/library/aa480027.aspx

2018-01-05

loose coupling

開発

loose coupling

疎結合の英語。

https://en.wikipedia.org/wiki/Loose_coupling

2018-01-04

memo: GET website including non-ASCII in request.el

Emacs

概要

request.elで、日本語とかのASCIIじゃない文字列が混ざったページに対してGETすると、curlの結果が文字化けする。

(require 'request)
(request "http://rubikitch.com/"
         :parser 'buffer-string
         :complete (function*
                    (lambda (&key data &allow-other-keys)
                      (switch-to-buffer "*request-result*")
                      (erase-buffer)
                      (insert data))))

理由

このプルリク。elispについて、サブプロセスでcurlを実行しているけど、実行するバッファのエンコードが binary になっている。 Win/Mac/Linux全部に対応しないといけないライブラリなので、binaryにすることで動かなくなるのを回避しているのだと思う。

このプルリクの修正が直接の原因。 github.com

でも、binaryにするとASCIIじゃない、具体的には日本語のサイトでGETすると文字化けする。 github.com

対策

こう書いて、dataのエンコードをutf-8とかにする。

(require 'request)
(request "http://rubikitch.com/"
         :parser 'buffer-string
         :complete (function*
                    (lambda (&key data &allow-other-keys)
                      (switch-to-buffer "*request-result*")
                      (erase-buffer)
                      (insert (decode-coding-string data 'utf-8)))))

ってあるけど、正直知らないとわからないから、ドキュメントやテストには明記したいね。

2018-01-04

hive insert overwrite

Hive

hive insert overwrite

Hiveではinsertは追加ではなく上書きで、partitionが動的に作成されている

INSERT OVERWRITE TABLE テーブル名
[PARTITION (項目名=値, …)]
SELECT文 FROM 元テーブル名

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DML

http://www.ne.jp/asahi/hishidama/home/tech/apache/hive/insert.html#INSERT_OVERWRITE

2018-01-04

set-process-coding-system

Emacs

set-process-coding-system

サブプロセスに対して送るencodeの指定

(set-process-coding-system PROCESS &optional DECODING ENCODING)

Set coding systems of PROCESS to DECODING and ENCODING.
DECODING will be used to decode subprocess output and ENCODING to
encode subprocess input.

M-x describe-function で見られる説明はこれだけで、どんなエンコードを設定できるかは書いていない。

2018-01-03

request.el parser

Emacs

request.el parser

https://tkf.github.io/emacs-request/manual.html

request.elのparserについて。レスポンスボディをどうやってパースするかを決める。 json-readを使う場合は、JSONのデータ構造を決めることができる。この例の場合だと、json-readしたデータをplistにすることが出来る。

(request
 "http://..."
 :parser (lambda ()
           (let ((json-object-type 'plist))
             (json-read)))
 ...)

すべてのレスポンスボディをstringにしたいなら、 buffer-string にする。

(request
 "http://..."
 :parser '(buffer-string)
 ...)

2018-01-03

JSON example

JSON

JSON example

JSONの公式サイトなので。

http://json.org/example.html

2018-01-03

aws-cli 1.14.18

AWS

aws-cli 1.14.18

あと少し。

Release 1.14.18 · aws/aws-cli · GitHub

Update rds command to latest version

botocore 1.18.22にアップデート。

Merge branch 'release-1.14.18' into develop · aws/aws-cli@121a51f · GitHub

2018-01-02

read-string

Emacs

read-string

read-string prompt &optional initial

ミニバッファから文字列を読んで、それを返す

2018-01-02

y-or-n-p

Emacs

y-or-n-p

ユーザーに問い合わせ、エコー領域で入力を待ち、yを打てばtを、nを打てばnilを返す。spcをy delをnとみなす。それ以外の応答だとyかnを入力しろと怒られる。

2018-01-02

mapred.job.reuse.jvm.num.tasks

Hadoop

mapred.job.reuse.jvm.num.tasks

If you have very small tasks that are definitely running after each other, it is useful to set this property to -1 (meaning that a spawned JVM will be reused unlimited times). So you just spawn (number of task in your cluster available to your job)-JVMs instead of (number of tasks)-JVMs.

タスクがものすごく小さかったら、-1にして常に再利用するのをすすめる。 1にすると再利用しない。

Amazon EMR は mapred.job.reuse.jvm.num.tasks の値を 20 に設定しますが、これはブートスラップアクションによってオーバーライドすることができます。値を -1 にすると 1 つのジョブ内でいつまでも再利用が行われ、1 にするとタスクは再利用されません。

タスク間でJVMを共有してフレームワークのオーバーヘッドを低下させる意図。JVMの起動にはコストがかかるので、多くの小さいファイルを処理する場合はJVMを何度も再利用して起動のコストを下げる。処理に時間がかかる場合は、すべてのメモリが確実に解放されるようにJVMを再利用しないようにする。

2018-01-02

SaxParseException

Hadoop

SaxParseException

なんのことかよくわからない

by shigemk2

当面は技術的なことしか書かない

Emacs Coding-Systems

JSON_AS_ASCII

Emacs let*

memo pub sub link

loose coupling

memo: GET website including non-ASCII in request.el

概要

理由

対策

hive insert overwrite

set-process-coding-system

request.el parser

JSON example

aws-cli 1.14.18

read-string

y-or-n-p

mapred.job.reuse.jvm.num.tasks

SaxParseException