Generating pprof profiling data

pprof

pprof is a visualization tool for profiling data and all the backend services in a Sourcegraph instance have the ability to generate pprof profiling data.

Exposing the debug port to generate profiling data

Follow the instructions below to generate profiling data. We will use the Sourcegraph frontend and a memory profile as an example (the instructions are easily adapted to any of the Sourcegraph backends and any profiling kind).

Sourcegraph with Docker Compose

See expose debug port in Docker Compose.

Sourcegraph with Kubernetes

If you're using the Kubernetes cluster deployment,you need to port-forward 6060 from the frontend pod (if you have more than one, choose one):

kubectl get pods
kubectl port-forward sourcegraph-frontend-xxxx 6060:6060

Single-container Sourcegraph

See expose debug port in single-container Sourcegraph.

Generating profiling data

Once the port is reachable, you can trigger a profile dump by sending an HTTP request: (in the browser or with curl, wget or similar):

curl -sK -v http://localhost:6060/debug/pprof/heap > heap.out

Once the heap.out file has been generated, share it with Sourcegraph support or your account manager for analysis.

Downloading the binary to use with `go tool pprof`

If you want to use the downloaded profile with go tool pprof you need the binary that produced the profile data.

You can use kubectl to download it:

kubectl cp sourcegraph-frontend-xxxx:/usr/local/bin/frontend frontend-bin

Then you can use go tool pprof:

go tool pprof frontend-bin heap.out

Debug ports

This is a table of Sourcegraph backend debug ports in the two deployment contexts:

	kubernetes	docker
frontend	6060	6063
gitserver	6060	6068
searcher	6060	6069
symbols	6060	6071
repo-updater	6060	6074
zoekt-indexserver	6060	6072
zoekt-webserver	6060	3070

Profiling kinds

allocs: A sampling of all past memory allocations
block: Stack traces that led to blocking on synchronization primitives
cmdline: The command line invocation of the current program
goroutine: Stack traces of all current goroutines
heap: A sampling of memory allocations of live objects. You can specify the gc GET parameter to run GC before taking the heap sample.
mutex: Stack traces of holders of contended mutexes
profile: CPU profile. You can specify the duration in the seconds GET parameter. After you get the profile file, use the go tool pprof command to investigate the profile.
threadcreate: Stack traces that led to the creation of new OS threads trace: A trace of execution of the current program. You can specify the duration in the seconds GET parameter. After you get the trace file, use the go tool trace command to investigate the trace.

Adapting the URL will generate different profile data, for example

curl -sK -v http://localhost:6060/debug/pprof/profile > profile.out

will generate a CPU profile.