Git

This page is about the version control software Git and how to use it on the client side. This wiki page covers the server side.

References

Homepage: http://git-scm.com/
Pro Git book: http://book.git-scm.com/
Git User Manual: http://www.kernel.org/pub/software/scm/git/docs/user-manual.html
Git - SVN Crash Course: http://git-scm.com/course/svn.html
Git Magic: http://www-cs-students.stanford.edu/~blynn/gitmagic/index.html
Link collection: http://git-scm.com/documentation
man pages: git, gittutorial, gittutorial-2, gitcore-tutorial, gitglossary

GUI clients

GitX used to be a promising Mac OS X client. Apparently it was abandoned upstream. The project was subsequently forked by enthusiasts, but I have not followed their progress.
These days I use SourceTree by Atlassian. The app is free to use, although you need to register to get the free license.
GitHub also has a Mac OS X app, but I don't really use it except once in a while for doing something that is too complicated with SourceTree or on the command line (I forget, though, what exactly that is)

Creating and configuring local repositories

git init: Administration of repositories

Create a new non-bare repository in .git in the current directory:

git init

(set the GIT_DIR variable to create the repository in a directory named differently, or in a different location; by default, GIT_DIR points to .git)

Create a new bare repository:

mkdir repodir
cd repodir
git init --bare

Difference between bare and non-bare repositories:

A non-bare repository has a working tree and a hidden directory .git containing the version control information
A bare repository just contains the version control information and no working tree. All the contents of the .git directory are placed in the main directory itself
Only bare repositories can be the target of a push
The purpose of bare repositories is for having a central (usually remote) repository that a number of people can push to
To convert a bare into a non-bare repository: Clone the bare repo, then delete the original
To convert a non-bare repo into a bare one:

git clone --bare -l /path/to/non/bare/repo /path/to/new/bare/repo

Working Tree vs. Working Copy

In Subversion, users check out one revision from the central, shared repository into a directory that is then called the "working copy". The working copy therefore contains one, and only one, revision (this is simplified, but I need it to be so that I can make a useful comparison to git).

In git, the "working copy" is called "working tree". However, the directory space that contains the working tree at the same time also stores the git repository. As opposed to Subversion where each directory has its own .svn directory, the git working tree has exactly one .git directory in its root folder: That folder contains the entire git repository with all the branches.

A single git repository can track an arbitrary number of branches, but your working tree is associated with just one of them (the "current" or "checked out" branch), and HEAD points to that branch (= the tip, or head, of that branch).

Configuration file

The git configuration file contains a number of variables that affect the behaviour of git commands. A non-comprehensive list is available on the man page of git config. Configuration files exist on three levels:

.git/config file for each repository is used to store the information for that repository
$HOME/.gitconfig is used to store per user information
The file /etc/gitconfig can be used to store system-wide defaults

To write to a configuration file, use the command

git config <section.variable> <value>

Using the --global argument writes to the user specific configuration file, --system writes to the system-wide defaults. With no argument, the repository specific configuration file will be written to.

Important settings, or settings that I like to have are:

User name and email address in ~/.gitconfig. These are used for things like git-commit

git config --global user.name "Patrick Näf"
git config --global user.email herzbube@herzbube.ch
git config --global user.signingkey 3FF38573

User specific gitignore patterns:

git config --global core.excludesfile "$HOME/.gitignore"

Use colors for git-status and git-diff

git config --global color.status true
git config --global color.diff true

Don't store a backup file with .orig extension after a successful merge

git config --global mergetool.keepBackup false

Ignoring files

Files generated by a build process (e.g. object files), or by the operating system (.DS_Store), or whatever, should not be versioned. git ignores those files if you tell it their names. You do so by specifying so-called gitignore patterns, either on the command line of certain git commands, or in so-called gitignore files.

Patterns are read from various sources in the following order (this list is taken almost verbatim from the man page of gitignore(5):

Patterns read from the command line for those commands that support them.
Patterns read from a .gitignore file in the same directory as the path, or in any parent directory, with patterns in the higher level files (up to the root) being overridden by those in lower level files down to the directory containing the file. These patterns match relative to the location of the .gitignore file. A project normally includes such .gitignore files in its repository, containing patterns for files generated as part of the project build.
Patterns read from $GIT_DIR/info/exclude.
Patterns read from the file specified by the configuration variable core.excludesfile; you would set that variable by saying, for instance:

git config --global core.excludesfile "$HOME/.gitignore"

Example for $HOME/.gitignore:

.DS_Store
.svn/

Complaining about whitespace

Almost all editors I have encountered add unnecessary whitespace at the end of a line in certain situations, usually when they try to help with line indentation. Some editors have an option that removes trailing whitespace when a file is saved, but most do not. Fortunately, git has support for checking for common whitespace problems.

The option core.whitespace in ~/.gitconfig allows to define which whitespace problems should be noticed whenver a whitespace check is run. The default already enables checking for the most important problem, blank-at-eol, so usually you will not have to modify your .gitconfig file.

To enable whitespace problem checks in a repository (local or remote), you can enable the default pre-commit hook:

cd /path/to/repo
cd .git/hooks
mv pre-commit.sample pre-commit

Basic operations

git add: Adding files/directories or making changes to existing files/directories

Files:

a new file or directory needs to be added using git add
a file whose content has changed needs to be added using git add
when git add is run it looks at the file's current content and determines what needs to be added; the content is said to be staged for inclusion in the next commit
when a file's content changes after git add has been run, git add needs to be run AGAIN because the new content is NOT automatically staged for inclusion

Directories:

it seems that a new empty directory can NOT be added using git add; I was unable to do this, and so far I did not find information about this special (mis)behaviour of git
if a directory contains files, it is sufficient to git add the directory; the operation will then recursively iterate over the files; if another file is later added to the directory, the new file is NOT automatically staged for inclusion - git add needs to be run AGAIN

git mv: Renaming or moving files/directories

Existing files and directories can be renamed or moved to a new location using git mv. The result must still be committed.

Note: If a directory becomes empty due to a move operation, the next commit will remove it from source control. If people pull the change, the directory will disappear on their side, too. If a puller has a local change in the directory, the directory will not be deleted, though.

git rm: Removing files/directories

Existing files can be removed using git rm. The result must still be committed.

Note 1: Directories are not normally removed, unless the -r option is specified.

Note 2: If a directory becomes empty due to a remove operation, the same rules apply as with git mv.

git status/diff: See local changes

git status prints out

which changes will be committed next time git commit is run
which changes have not been staged for committing yet; Note: Empty directories do not appear here; directories only appear if they have at least one file inside

git diff prints out

the changes that have not been staged yet
in other words: the difference between the working tree and the index

git diff --cached prints out

the changes that have been staged and will be included in the next commit
in other words: the difference between the index and the HEAD of the current branch (usually "master")

Show all changed files between two commits:

git diff --name-only SHA1 SHA2
git diff --name-only TAG1 TAG2

git reset: Undo changes

git reset can be used to undo all sorts of changes, including destroying commits already made. The command is rather dangerous and you must know what you are doing or you may damage your repository...

Unstage all files that have been staged with git add, keeping all local changes:

git reset

Unstage a single file:

git reset foo.c

Discard unstaged changes for a single file. Surprisingly, this does not require git reset. Warning: There is no warning, the changes are immediately discarded!

git checkout foo.c

Throw away all local changes that have not been committed yet (this is useful after a merge, e.g. to throw away the merge results because of too many conflicts):

git reset --hard

Discard the last commit(s) from the repository, including all changes that were made in that commit (SO question shows a way to get the commit back before 90 days have elapsed):

git reset --hard HEAD^   # discard last commit
git reset --hard HEAD^^  # discard last 2 commits

Discard the last commit from the repository, but keep the changes that were made in that commit in the working tree. This is useful, for instance, if the commit was incomplete, or just not quite right, and you want to redo the commit with a few changes. Note that git commit --amend is simpler if you just need to edit the commit message, or add a file that was forgotten.

git reset HEAD^          # leave changes in the working tree, but not the index; the old head is stored in .git/ORIG_HEAD
git reset --soft HEAD^   # leave changes in the working tree AND in the index
<do some changes>
git add .
git commit -c ORIG_HEAD  # redo commit, re-using the previous commit message (can still be edited)

git clean: Remove untracked files from the working tree

Remove everything not under version control. The -x option makes sure that ignored files are removed as well. This is useful to remove, for instance, build results. In conjunction with git reset --hard (which discards all local changes that have not been committed yet) this command can be used to clean up a repo to get it into a state similar to after it is cloned.

git clean -dfx

git stash: Temporarily stash all local changes

Sometimes one needs to interrupt the current work and do something else. A useful workflow is this:

Temporarily stash all local changes and revert to a clean working tree
Do something else, probably commit
Get back the changes that have been stashed away and resume the original work

The commands for this are:

git stash
# do some work
git commit
git stash apply   # the stash is kept
git stash pop     # the stash is applied and then thrown away

It is possible to have multiple stashes. Useful commands:

git stash blablabla          # create new stash with a message
git stash list               # list all stashes
git stash show -p stash@{1}  # display diff (-p = patch format) between named stash and its original parent
git stash pop stash@{1}      # apply the named stash
git stash drop stash@{1}     # throw away the named stash
git clear                    # throw away all stashes

git commit: Make changes to the repository

Commit staged changes:

git commit -m "bla bla bla"

Notes:

Author name and email address are taken from ~/.gitconfig.
As a convenience, the -a option can be used to automatically stage files that have been modified and deleted. New files are not staged, though.

To fix the commit message of the last commit:

git commit --amend

To add another file to the last commit, or make additional changes to a file already in the commit:

git add <file>
git commit --amend

See git reset for a more sophisticated example of how to modify the last commit.

git show: Display information about commits and other stuff

Note: The man page for git-show is totally incomplete, for instance it does not show the --name-only option :-(

Display the files that changed in a commit:

git show --name-only 356da73

Also display diffs:

git show 356da73

git tag: Working with tags

git tag -s -m "tagging release 0.8.5" 0.8.5 356da73

Creates a tag named "0.8.5"
The tag refers to commit object 356da73
The message specified by -m is associated with the tag
Using GnuPG, the tag is PGP-signed, using the PGP key that matches the committer's email address; although I have not formally researched this, I presume that the committer's email address would be the one that has been defined in ~/.gitconfig under the option "user.email".
The tag created is an "annotated" tag, i.e. a tag that carries with it more information than just the tag name (in this case the additional information consists of a message, the tagger's name and email, and a PGP signature)

To use a specific PGP key, i.e. not the default one that matches the committer's email address, one has to set the "user.signingkey" option, either in the repository's configuration file, or the global configuration file. For instance:

git config --global user.signingkey 3FF38573   # use the key ID

Set the GIT_COMMITTER_DATE environment variable to create a tag with a given date instead of the current date (useful to backdate date, e.g. after populating a Git repository with content from another SCM). For instance:

GIT_COMMITTER_DATE="2009-06-14 12:58:50" git tag -s -m "tagging release 0.3" 0.3 ed598d1f3d6fac50b67daac2c191798c451cc962

Delete an existing tag:

git tag -d 0.1

Note: If the tag has already been pushed to the server, this must be done both on the client and on the server (a tag-delete cannot be pushed). THIS IS NOT RECOMMENDED!!! See the man page for git-tag for details.

List all tags that exist in the repository:

git tag

Verify the signature of a tag:

git tag -v 0.1

Find tags that contain the given commit:

git tag --contains 356da73

Checkout a specific tag:

git checkout 0.1
git checkout tags/0.1  # in case there is a branch that is also named 0.1

Find out which tag you are on

git describe --tags   # --tags is required to also find tags that are not annotated

git log: Information on the history

This somewhat looks like what I am used from svn log:

git log --name-status

Another abbreviated version of the history:

git log --stat --summary

Commits since v2.5 which modify Makefile:

git log v2.5.. Makefile

Commits between v2.5 and v2.6:

git log v2.5..v2.6

Commits made on the current branch (which is not master), all the way back since it was branched

git log ^master HEAD
git log master..HEAD        # equivalent, but apparently the more common shorthand
git log HEAD --not master   # equivalent, but here it's important to place --not at the end because --not affects all of the subsequent arguments (not just 1)

Working with remote repositories

git clone

Get a copy of a remote/upstream repository:

git clone /path/to/repo

Notes:

The copy is created in the current directory in a folder named "repo"
The same branch is checked out that is currently active in the remote/upstream repository
The origin is set to the remote/upstream repository; it is said that we are tracking that remote/upstream repo; the origin is later going to be used by pull and fetch commands

git pull

Important: Pulling is NOT what you want if you need to get at a branch that was newly created in the remote repository. You first need to create a local tracking branch with either "git branch" or "git checkout".

Pull all changes in the "master" branch from the remote repository that is our origin, into the local repository:

git pull master

Notes:

The changes are not only pulled, but the changes in the remote current branch are also merged immediately into the working tree
It is therefore a good idea to commit local changes before pulling
In addition, the local current branch should somehow match the remote current branch
Instead of pulling, which means an immediate merge, one could first do a "fetch" and then inspect the remote changes

git fetch /path/to/repo master
git log -p HEAD..FETCH_HEAD    # shows remote changes since histories forked
git log -p HEAD...FETCH_HEAD   # shows remote AND local changes since histories forked

git fetch

To fetch the content of a remote branch into a local branch:

git fetch origin work-for-0.4:work-for-0.4

Notes:

"origin" is an alias for a repository URL that has previously been set with "git remote"
On the left-hand side of the ":" is the name of the remote branch
On the right-hand side of the ":" is the name under which the branch should be stored locally.

Warning: Although the local branch is created if it does not exist, the local branch will NOT track the remote branch. Refer to "git branch" or "git checkout" for examples how to create a tracking branch, or convert a non-tracking into a tracking branch.

git push

Push local changes in the currently checked out branch into the remote repository that is tracked by the branch (the default remote is origin):

git push

Tags are not affected by the above command. To also sync tags:

git push --tags

Push local changes for a named repository or branch:

git push origin           # push changes in all branches that track origin
git push origin mybranch  # push changes only in mybranch

The second example above also creates a branch in the remote repository if it did not exist before. However, after the push the local branch is not tracking the remote branch. This can be fixed by using the -u command line option. A tracking branch is useful because in the future you can simply type git push or git pull to sync the local with the remote branch, and vice versa.

git push -u origin newbranch

Local changes can be pushed only if they result in a fast-forward in the remote repository. This is a problem when you have changed history locally (e.g. remove a commit) and want to push these changes. It is possible to force the push by prefixing the branch name with a "+" character:

git push origin +master

git remote: Manage remote ("tracked") repositories

Show a list of existing "remotes", i.e. remote repositories whose branches are tracked in the local repo (the "-v" option tells git to be verbose and also list the remote URL):

git remote -v

Add a new remote named "foo" that points to the repository at the given URL

git remote add foo git://linux-nfs.org/pub/linux/nfs-2.6.git
git remote add foo gitolite-user:scjd.git                         # the server gitolite-user is defined in .ssh/config

Once a remote repository has been set, its content can be fetched:

git fetch foo   # fetch all branches from remote "foo"

Rename a remote

git remote rename old new

Remove a remote

git remote rm foo

Working with local and remote branches

git branch: List/create/delete branches

List all branches that exist:

git branch     # local branches only
git branch -r  # remote branches only
git branch -a  # both local and remote branches

Create a new branch (but don't check it out). The first example splits off at the head of the currently checked out branch, the second splits off at the named commit.

git branch work-for-0.5
git branch work-for-0.5 7a8c9912

Create a new tracking branch, so-called because it is connected to and "tracks" a remote branch. The branch is not checked out!

git branch --track experimental origin/experimental

Connect an existing local branch to an existing remote branch, i.e. convert a local non-tracking branch into a tracking branch:

git branch -u origin/experimental experimental

Rename a branch:

git branch -m old new

Delete a branch:

git branch -d mybranch             # delete locally
git push origin --delete mybranch  # delete remote branch (local branch remains untouched)

git checkout: Switch working tree

Switch to another branch

Change the working tree to point to a different branch:

git checkout newbranch

Create a new branch and check it out immediately. The first example splits off at the head of the currently checked out branch, the second splits off at the named branch, the third creates a tracking branch.

git checkout -b newbranch
git checkout -b newbranch oldbranch
git checkout -b experimental origin/experimental

If you have local changes, the checkout command will fail unless you specify one of the following:

git checkout --merge newbranch     # merge changes
git checkout -f newbranch          # discards changes

Notes:

The merge works regardless of whether the changes have been added to the index or not
Conflicts are not reported in any way, though, you have to detect these by yourself :-(((( A conflicted file will contain markes such as this one: "<<<<<<< master:doc/ChangeLog"
Files that are present but matched by .gitignore are retained - it has not been verified yet if this applies to any file that is not version-controlled

Switch to an earlier commit

To go back in history and get the repository's state as it was in a specific commit:

git checkout 7a8c9912

After this you are no longer on a branch, this can be verified as follows: nargothrond:~/Documents/dev/littlego --> git branch

* (no branch)
  master

To return to HEAD of the master branch:

git checkout master

git merge: Merge changes from another branch

The following example pulls changes from a source branch and merges them with the current HEAD and working tree. If possible, git will do a so-called "fast-forward". Fast-forwarding is nicely explained in this section of the "Pro Git" book.

git merge sourcebranch

If you delete the source branch after fast-forwarding was applied, history will not show that the source branch ever existed. Instead it will appear as if all of the commits were made directly inside the target branch. To make sure that this does not happen, fast-forwarding can be disabled:

git merge --no-ff sourcebranch
git merge --no-ff --no-commit sourcebranch  # same, but do not auto-commit

Without fast-forwarding, history will always reflect that there was a source branch, and which commits were made on that branch, even if the branch itself has been deleted. The integration point will be marked by a merge commit that is a giant cumulative "patch" of all the commits made on the source branch. The drawback is that the "blame" command will now show a source line to have changed in the merge commit, not in the actual originating commit.

The following is similar to --no-ff: All changes in the source branch are squashed together into a single "patch" which is then applied to the target branch. Unlike --no-ff, however, no relationship between the source and target branch is visible in the history after the "patch" has been committed. Once the source branch has been deleted, there will be no record of the individual commits, and it will appear as if the "patch" commit has been developed as a single change.

git merge --squash sourcebranch

Further notes:

Do not merge while you have uncommitted local changes unless you are sure that the merge will not result in conflicts, or you are able to resolve conflicts
If a merge aborts due to a conflict that you cannot (or do not want to) resolve, you can recover by discarding the local changes in the working tree with git reset

git rebase

git rebase re-applies commits, one by one, in order, from your current branch onto another. Most of this section, however, only shows how the function can also be used to modify older commits of the current branch.

Interactive rebase

The --interactive option must be specified so that an editor (e.g. vi) pops up that lets you specify what you want to do, exactly. Presumably without --interactive you need to specify how to do the rebase via command line options. This is not covered here.

Interactive rebase is a multi-step operation:

In the initial command you specify which commmits you want to include in the operation.
In an interactive step you specify which commits you actually want to modify, and what modifications you want to perform.
Optionally you then perform changes on individual commits, and commit the changes to actually modify the commit.
The last step may be repeatable, in that case you have to explicitly state when you are ready to continue with the rebase to the next commit.

Specify which commit should be modified

This command specifies that you want to modify the last 3 commits:

git rebase --interactive HEAD~3

TODO: What does this do, exactly?

git rebase --interactive 96f7a7f^

Specify which commits to modify, and how

After you run the rebase operation an editor pops up that lets you interactively specify which commits to modify, and how.

The editor shows a list of those commits that you have selected for rebasing. The order of the commits is oldest-to-newest, i.e. reverse to what you get when you use git log. This is important when you use the "squash" or "fixup" commands.

Every commit has a command in front of it that tells Git what you want to do with the commit. The default command is "pick" - this simply uses the commit as-is without making any changes to it. Example:

pick 00a37f2 install certbot Debian package
pick ae00f87 letsencrypt staging server account accidentally created
pick 86a3523 add systemd timer for certbot

Reorder commits

One of the simplest things you can do in a rebase is changing the order of the commits: Don't change the "pick" command, just shift the lines around until you are happy, then save and close the editor to perform the reorder.

Since you don't actually modify any of the commits' content you don't have to do anything more, the rebase is already finished.

Change a commit message

Use the "reword" command to modify a commit's commit message. Save the buffer and exit the editor. Another editor instance will immediately pop up that lets you rephrase the commit message. When you are happy with the message, save and close the editor to perform the change.

Since you don't actually modify the commit's content you don't have to do anything more, the rebase is already finished.

Squash commits

Use the "squash" command to merge a commit with the previous commit. Important: The order of commits is reversed to what you're used to from git log. Squashing a commit merges it with the commit above it.

The squashed commit's commit message is added to the previous commit's commit message and you are then prompted in another editor instance to edit the final merged commit's commit message.

Note: Use the "fixup" command instead of "squash" if the log message of the commit to be squashed can be discarded. You will not be prompted to edit the merged commit's commit message.

Since you don't actually modify any of the commits' content you don't have to do anything more, the rebase is already finished.

Edit a commit

Use the "edit" command to modify a commit. Save the buffer and exit the editor. Rebase will now stop just after the commit you want to edit.

Make whatever changes you want, then stage the changes as usual (git add ...), then modify the commit with

git commit --amend

Conclude the rebase (or continue to the next commit to be edited):

git rebase --continue

Split an old commit into several commits

Start the interactive rebase in the usual way so that you include the commit you want to split. Mark the commit you want to split with the "edit" command. Save the buffer and exit the editor.

Rebase will now stop just after the commit you want to edit. Issue this command:

git reset HEAD~

Now commit the pieces individually in the usual way, producing as many commits as you need.

When you're finished, conclude the rebase (or continue to the next commit to be edited):

git rebase --continue

Note: Information in this section comes from this StackOverflow question/answer.

Rebase local commits onto remote commits pushed by someone else

You have a local branch that is tracking a remote branch. You pulled, then started to work, creating some local commits. Someone else did the same, and they were faster than you in pushing. Now when you want to push you must first integrate your version of the code with their version that is already pushed.

If your work and their work is in different areas of the code base, the simplest way to resolve the situation probably is to rebase your commits onto what they have already pushed.

While you have your local tracking branch "foo" with your local commits checked out, run these commands:

# Optionally create a backup of your branch first
git checkout -b foo_backup

# Switch back to your original branch
git checkout foo

# Update all tracking branches, including "foo"
git fetch origin

# Rebase your commits
git rebase origin/foo

If everything goes well, then the rebase should go through without a hitch. If there are conflicts you can either try to resolve them, or abort the rebase. Once you're done you can delete the backup branch "foo_backup" (if you created one).

Conflict handling

Note: Stuff in this chapter has been extracted from the man page of git merge.

Throw away all local changes (e.g. too many conflicts):

git reset --hard

Show different versions of files that are in conflict (usually 3 versions: 1 = common ancestor, 2 = HEAD version, 3 = remote version):

git ls-files -u

Show each one of these three versions of a conflicted file:

git show :1:filename   # common ancestor
git show :2:filename   # HEAD version
git show :3:filename   # remote version

Run graphical merge tool (on Mac OS X usually launches FileMerge via the opendiff cmdline utility):

git mergetool

Remove deleted remote branches and tags from local repository

Deleting a branch in the remote repository does not automatically remove it from a local repository. This must be done manually in two steps.

First, remove the references to all branches that no longer exist the remote repository:

git fetch --prune

Add the --prune-tags option to also remove references to tags that no longer exist:

 git fetch --prune --prune-tags

Second, if the branch still exists locally because you have checked it out in the past, you have to delete it with this command. This is necessary because the pruning command above only removes references, not actual branches.

git branch -d mybranch

Working with partial clones

Shallow clones

The --depth argument can be used when cloning a repository to determine how much of a repository's history should be cloned. This helps with limiting the amount of data that is transferred during cloning. A prime use case for this is during automated pipeline runs when only the most recent commit is needed for building. For example, to get only the most recent commit:

git clone --depth 1 --branch <foo> <repository-url>

Note: Even with a shallow clone it is possible to commit and push to the remote, given that on the remote side no changes occurred since cloning.

Cloning with a filter

The --filter argument can be used when cloning a repository to limit the data that is being transferred during the clone operation. For instance, the following will filter out all blobs (= file contents) when cloning:

git clone --filter=blob:none --branch <foo> <repository-url>

Sparse checkout

Git can be configured to not checkout the full working tree. The sparse checkout behaviour can be configured in two ways:

By directly manipulating the so-called "sparse-checkout file", which is located here: .git/info/sparse-checkout
By issuing a number of git sparse-checkout commands; these modify the sparse-checkout file for you.

To initialize the sparse-checkout file:

git sparse-checkout init

This results in only the files in the repository's root folder to be checked out. As a result, the sparse-checkout file looks like this:

$ cat .git/info/sparse-checkout 
/*
!/*/

The same result can be achieved when cloning a repo, by specifying the --sparse argument:

git clone --sparse [...]

By default, the sparse-checkout file uses the same syntax as .gitignore. Patterns usually define what should be included, but an exclamation mark ("!") at the beginning marks negative patterns that define what should be excluded.

To replace the content of the sparse-checkout file:

git sparse-checkout set <patterns>

To add to the content of the sparse-checkout file:

git sparse-checkout add <patterns>

Note: Not covered in this section are:

Interaction with Git submodules
Use of --cone and cone patterns
Interaction when filtering during a clone, i.e. git clone --sparse --filter=blob:none [...]. The only thing currently known is that this transfers only minimal data.

Other stuff

Generating and applying patches

A good overview is this: http://ariejan.net/2009/10/26/how-to-create-and-apply-a-patch-with-git/

Generating patches from commits

Generate a patch that contains one commit A only:

git format-patch -1 A

Note: The resulting file is placed in the current working directory and named after the first line of the commit message. For instance:

0001-final-changes-for-release-0.1.patch

Write the above patch to a different output directory:

git format-patch -1 A -o /tmp/patchdir

Generate a series of patches from commit A+1 up to HEAD:

git format-patch A -o /tmp/patchdir

Generate a series of patches from commit A up to HEAD:

git format-patch A^ -o /tmp/patchdir

Generate a series of patches from the beginning of history up to commit A:

git format-patch A -o /tmp/patchdir --root

Generate a series of patches from commit A to commit B:

git format-patch A^..B -o /tmp/patchdir

Generating patches from diffs

Patches created with git format-patch are useful to transmit a commit. If you just want a diff between two trees (i.e. a folder diff, or a single file diff, similar to what you get from diff), then use git diff.

Generate a patch that captures the difference between the working tree and the index:

git diff >patch-file

Generate a patch that captures the difference between the working tree version and the index (= staged) version of a single file:

git diff foo.cpp >patch-file

Applying patches

Get an overview of what is in the patch:

git apply --stat /path/to/patch

Test whether the patch applies cleanly. If no errors are printed, the patch applies cleanly.

git apply --check /path/to/patch

Apply the patch (without committing):

git apply /path/to/patch

Apply the patch and generate a "Signed-off-by" tag in the commit message. This tag is read by Github and others to provide useful info about how the commit ended up in the code.

git am --signoff /path/to/patch

Notes:

Working in a single-committer environment, I find the generated tag not so useful
Very useful, however, is that git am automatically uses the comment that is part of the patch file to generate a commit message AND even performs the commit for you. This allows to apply patches very fast - if they apply cleanly
To recover from a patch that did not apply, use this command

git am --abort

Cherry-picking

Cherry-picking is the term used for taking a commit anywhere in the repository and applying its content on top of your current working tree, with the result that a new, unrelated commit is being recorded for those changes.

The basic syntax is this:

git cherry-pick 9584c7c193772a892cb5209860278ea1a5ca3228

If the changes in the referenced commit can be applied cleanly, then a new commit will be created immediately, using the commit message of the original commit. If not you have to resolve the conflicts in the usual way, then invoke:

git cherry-pick --continue

If you get stuck somehow, the cherry-picking operation can be aborted:

git cherry-pick --abort

It can be useful to add a note to the commit message that states where the commit is coming from. Note that this information is useful only if the original commit is publicly visible.

git cherry-pick -x 9584c7c193772a892cb5209860278ea1a5ca3228

To provide a custom commit message:

git cherry-pick -e 9584c7c193772a892cb5209860278ea1a5ca3228

To cherry-pick an entire sequence of commits (for each source commit a new commit will be created):

git cherry-pick 9584c7c193772a892cb5209860278ea1a5ca3228..7c58f24d6aeb9fde7f4bbc047de08951753aa012

Submodules

Documentation

Some basics, and some more basics
A few advanced recipes

Add a new submodule to a project:

git submodule add git://github.com/herzbube/foo.git 3rdparty/foo

This has the following effects:

The remote repo is cloned into the local subfolder 3rdparty/foo.
The submodule name, local path and remote URL are recorded in the .gitmodules file. The .gitmodules file is created if this is the first submodule being added.
The submodule name and remote URL are also recorded in the .git/config file.
The Git repository is cloned into the .git/modules/3rdparty/foo folder.

By default the master branch will be checked out in the submodule. It may be desirable to checkout a different commit, for instance a specific tag:

cd 3rdparty/foo
git checkout 3.1  # or "git checkout tags/3.1", in case there is a branch that is also named 3.1
cd -

Commit the change. The exact commit at which the remote repo was cloned is recorded in the commit.

git commit -m "added submodule foo @ tag 3.1"

Cloning a repo with submodules

git clone ...
git submodule init     # initialize local config file
git submodule update   # check out the submodule's commit that is recorded in the superproject

If a submodule has other submodules, then the "init" and "update" operations both must be performed recursively. A real-world example that requires this is the "modularized boost" repository on GitHub. The following combines the two operations into one command:

git submodule update --init --recursive

Get remote changes into a repo with submodules

git merge origin/master   # merge changes from remote
git submodule update      # also merge changes in the submodule

Making changes to a submodule

cd submodule
# make changes
git commit
cd ..
git add submodule
# record in the superproject that it is now referencing a new submodule commit
git commit

The following commands completely remove a submodule. If the submodule has not been added yet to the superproject's history, some of the steps need to be made manually.

# Removes the section in the .git/config file and empties the folder 3rdparty/foo
git submodule deinit 3rdparty/foo
# Removes the section in the .gitmodules file and removes the empty folder 3rdparty/foo
git rm 3rdparty/foo
# Removes the Git repository clone
rm -r .git/modules/3rdparty/foo

Change the URL of a submodule (recipe from StackOverflow)

# Use any text editor to change the URL
vi .gitmodules
# This command updates the URL in .git/config so that it matches the URL in .gitmodules
git submodule sync

Other notes

A submodule is a full Git repository in its own right. Git commands in the submodule path therefoe operate on the submodule repo and not the superproject repo.
Any changes made in the submodule repo must be "published" (= committed/pushed) BEFORE the superproject changes are pushed, otherwise someone who merges the superproject will get a reference to a non-published commit in the submodule
If you want to make changes in a submodule, it is a good idea to create a branch first, otherwise you will work in a "detached head" environment (i.e. HEAD points directly to a commit, not to a symbolic reference), which may make your commits inaccessible if you merge updates from remote

Convert sub-directory into repository of its own

The following command "rewrites" a repository to look as if sub-directory foo has been its project root, and discards all other history. This effectively turns the sub-directory into a repository of its own. I don't pretend to understand in the least what this command does, but I got the magic from this stackoverflow.com question.

git filter-branch --subdirectory-filter foo -- --all

Important note: This action drastically modifies the repository!!! Perform this only on a clone, or push all changes first, or make a backup first.

Depending on how long the old repository has been in use before it was rewritten, the newly rewritten repository still contains quite a bit of overhead and hidden cruft from the old repository. Although there probably are other and better ways to do this, my way of cleaning up is to clone the newly rewritten repository.

The following is a full transcript of how I extracted my HTB repository from the Tools repository:

cd /tmp
git clone gitolite-user:tools.git   # get a fresh copy of the repository to convert
cd tools
git filter-branch --subdirectory-filter htb -- --all
cd ..
git clone file:///tmp/tools htb
du -sh tools htb
880K	tools
524K	htb

Further cleanup steps:

Get another fresh clone of the original repository and remove the sub-directory that has been extracted. I do this with a simple git rm -r foo. This leaves the sub-directory's history intact, but I am sure there is a way to destroy the history as well.
Add the newly rewritten repository to gitolite. The only problem here is that the rewritten repository already has a remote that is still connected to the original repository - this can be easily resolved by removing the remote first:

cd /tmp/htb
git remote rm origin
git remote add origin gitolite-user:htb.git

Replay commits into a different repository

The following steps are taken verbatim from this Stack Overflow question. I used these commands to setup fuego-on-ios a second time, after I had decided to use svn2git instead of svn git.

cd /path/to/destination/repo
git remote add temp file:///path/to/source/repo
git fetch temp
git checkout temp/master -b wip   # wip = work-in-progress
git rebase master                 # replays commits (rebase onto master)
git checkout master
git merge wip
# Cleanup
git branch -d wip
git remote rm temp

At the moment I have no idea why this works. The mysterious steps are "git rebase master", "git checkout master" and "git merge wip".

Diagnostics & error recovery

Check repository integrity:

git fsck --full

If a packed archive exists (pack files are normally located in GITDIR/objects/pack), extract the single objects within the pack and write them to the current repository (note: a pack file always has an accompanying .idx file whch probably must be present as well):

git unpack-objects </tmp/foo.pack

To see the type of an object (the example object would be located in GITDIR/objects/6c/8cae4994b5ec7891ccb1527d30634997a978ee):

git cat-file -t 6c8cae4994b5ec7891ccb1527d30634997a978ee

To see the content (pretty-printed) of an object with ID ID:

git cat-file -p ID

To see the content of a tree object with object ID T (is equivalent to the "cat-file" command if the object is a tree):

git ls-tree T

To see the content of a tree object that belongs to commit with object ID C:

git ls-tree C

To recursively list the content of a tree object (note: it is important to specify the -r option in front of the tree object ID, otherwise git will interpret the option as a pattern to match):

git ls-tree -r 6c8cae4994b5ec7891ccb1527d30634997a978ee

Recreate a tree object from ls-tree formatted text:

cd ~/git/backups/foo.git
git ls-tree 6c8cae4994b5ec7891ccb1527d30634997a978ee >/tmp/lstree.txt
cd ~/git/recovery/foo.git
git mktree </tmp/lstree.txt

Show information about a commit with object ID C:

git show C

Recreate a commit object from tree with object ID T, linking it to the parent commit object with object ID C (note that author name, email and date are taken from environment variables, or from configuration file items):

git commit-tree T -p C </tmp/changelog

Print out the object ID that a file would get if it were made into a blob:

git hash-object <doc/README

Recreate a blob object (and print its ID):

git hash-object -w <doc/README

Find out branch creator

The following command should be useful to find out who is probably the creator of a remote Git branch. In the list that is printed, the person who made the first (oldest) commit is probably the one who also pushed that commit and therefore the one who created the branch on the remote Git server.

git log --format='%h   %ci   %<(20)%cn   %s' remotes/origin/<source-branch>..remotes/origin/<branch-name>

Format options (described also on man page of git log)

%h = Abbreviated commit hash
%ci = Committer date, ISO 8601-like format
%<(20) = Reserve 20 characters for the next placeholder, padding it
%cn = Committer name
%s = Subject, i.e. first line of commit message

Working with Subversion

General information

The command to interact with a Subversion repository is

git svn

A Git repository that is connected to a Subversion repository stores the link in

.git/config

Cloning and tracking an upstream Subversion repository

This section shows how to clone a 3rdparty software project's Subversion repository, and how to track the project's upstream progress on an ongoing basis, but actually maintaining my own modifications in a separate branch of a Git repository that I control.

The following commands clone the upstream Subversion repo into a local Git repo. A word of warning: This clones the entire Subversion repo, including all branches and tags. The operation therefore might take quite a while, depending on the size of the upstream repo.

mkdir fuego-on-ios
cd fuego-on-ios
git svn init --stdlayout http://svn.code.sf.net/p/fuego/code/ .
git svn fetch

Notes:

This creates a master branch that tracks the upstream trunk
This also creates many remote-tracking branches, one for each tag and branch in the upstream repo. git branch -a lists them, for instance:

* master
  remotes/EGC2008
  remotes/OLYMPIAD2008
  remotes/VERSION_0_1_FIXES
  remotes/VERSION_0_2_FIXES
  remotes/VERSION_0_3_FIXES
  remotes/VERSION_0_4_1_FIXES
  remotes/VERSION_0_4_FIXES
  remotes/VERSION_1_FIXES
  remotes/tags/EGC2008_1
  remotes/tags/OLYMPIAD2008_1
  remotes/tags/PAMPLONA_2009
  remotes/tags/UEC_CUP_2013
  remotes/tags/VERSION_0_1
  remotes/tags/VERSION_0_1_1
  remotes/tags/VERSION_0_2
  remotes/tags/VERSION_0_2_2
  remotes/tags/VERSION_0_3
  remotes/tags/VERSION_0_3_1
  remotes/tags/VERSION_0_3_2
  remotes/tags/VERSION_0_4
  remotes/tags/VERSION_0_4_1
  remotes/tags/VERSION_1_0
  remotes/tags/VERSION_1_1
  remotes/trunk

The main integration branch that holds my own modifications cannot be named "master" because git svn has already taken that name for itself. For this reason I like to create a branch that is named after the repository:

git branch fuego-on-ios

Integrating upstream changes into a tracking Git repository

Download upstream revisions to the local object database, but do not create Git commits:

git svn fetch

Integrate upstream revisions into the current branch:

git svn rebase

Notes:

Performs git svn fetch first, i.e. downloads upstream revisions. To skip this step, i.e. to rebase only revisions that are already fetched: git svn rebase --local
Creates Git commits from all outstanding upstream revisions
Replays (rebases) all local commits that have not been committed back to the upstream Subversion repository on top of the latest Subversion revision commit
Performs a fast-forward if no local changes exist (the ideal case!)

Reconnecting a Git repository with upstream Subversion repository after a clone

(the solution to the following problem is a glorified copy of this Stack Overflow answer)

The problem:

Machine 1: You create a local Git repository that tracks an upstream Subversion repository
Machine 1: You make the local Git repository public, e.g. you push it to GitHub
Machine 2: You clone the public repository
Machine 2: The cloned repository is no longer connected to the upstream Subversion repository. For instance, it is not possible to say git svn info or sync with upstream with git svn rebase. The error message you receive is this:

Unable to determine upstream SVN information from working tree history

The first thing to fix is to add some necessary entries to .git/config. In the original Git repository that was created with git svn clone there is a section like this:

[svn-remote "svn"]
	url = http://svn.code.sf.net/p/fuego/code
	fetch = trunk:refs/remotes/trunk
	branches = branches/*:refs/remotes/*
	tags = tags/*:refs/remotes/tags/*

We need to replicate this section in the cloned repository. This can be done either by manually editing the .git/config file, or by issuing a number of commands:

git config svn-remote.svn.url http://svn.code.sf.net/p/fuego/code
git config svn-remote.svn.fetch trunk:refs/remotes/trunk
git config svn-remote.svn.branches branches/*:refs/remotes/*
git config svn-remote.svn.tags tags/*:refs/remotes/tags/*

This is not yet enough, though, git svn info still produces the error message from above. What we still need to do is to setup the remote named "trunk". In the original Git repository there is a file such as this:

cat .git/refs/remotes/trunk 
3318343b099ae9649fadfa5dd53a87adff095ed7

So we need to create the same file in the cloned repository with an appropriate hash:

In the cloned repository, look at the output of git log when you are on the master branch. Note down the hash of the most recent commit that represents a Subversion commit.
Create the file .git/refs/remotes/trunk and add a line to it with the hash you noted down in the previous step.

The final step is to restore the contents of the .git/svn folder. I prefer to do this with

git svn info

but other "git svn" commands such as git svn fetch should do as well.

External Tools

DiffMerge

DiffMerge is a freely available (though not open source) visual diff and merge tool. If the diffmerge command line utility was installed, DiffMerge can be integrated as merge tool into Git using the following configuration:

git config --global merge.tool diffmerge
git config --global mergetool.diffmerge.cmd 'diffmerge --merge --result="$MERGED" "$LOCAL" "$(if test -f "$BASE"; then echo "$BASE"; else echo "$LOCAL"; fi)" "$REMOTE"'
git config --global mergetool.diffmerge.trustExitCode true

To also configure DiffMerge as diff tool:

git config --global diff.tool diffmerge
git config --global difftool.diffmerge.cmd 'diffmerge "$LOCAL" "$REMOTE"'

GitHub

Overview

Signing up with GitHub provides a free (for open source projects) public place to host Git repositories. A few general notes:

To be allowed to commit, an account needs to be associated with one or more public SSH keys
Old versions of the GitHub API required the use of a secret API token if applications wanted to do special things on GitHub. This has changed with v3 of the GitHub API, the new way uses OAuth tokens.

Local Git configuration

GitHub requires "user.name" and "user.email" entries to be in your Git settings file ~/.gitconfig. The following commands will add the entries if they are not yet present:

git config --global user.name "Billy Everyteen"
git config --global user.email "me@here.com"

Note: Old versions of the GitHub API required the presence of "github.user" and, optionally, "github.token". Since the release of the GitHub API v3 these entries are no longer necessary.

Local SSH configuration

Add the following snippet to your ~/.ssh/config file:

Host github.com
UseKeychain yes
AddKeysToAgent yes
IdentityFile ~/.ssh/foo.id_rsa

Cloning a GitHub repository

git clone git@github.com:herzbube/reponame.git

Adding and removing branches to a GitHub repository

I did not find a way how to create or delete a branch using GitHub's web interface. Presumably the idea is that this must be done locally and then pushed to GitHub.

Create a new branch locally, then push it to GitHub. The push automatically creates the branch upstream. In addition, use -u to let the local branch track the remote branch.

git branch newbranch
git push -u origin newbranch

Delete a branch locally, then do the same remotely on GitHub:

git branch -d mybranch
git push origin --delete mybranch

Creating a GitHub repository with the content of an external repository

This task was needed after I had decided to move my Little Go repository from my own server to GitHub, including all branches, history and tags.

First create the desired repository on GitHub, then run the following commands. Note that this may not work with older versions of git - I don't recollect which version is the minimum, but if you have trouble, try the alternate command sequence further down. The following commands should work with newer versions of git (git 1.8 should do).

git clone --bare http://git.herzbube.ch/littlego.git
cd littlego.git
git push --mirror git@github.com:herzbube/littlego.git
cd ..
rm -rf littlego.git
git clone git@github.com:herzbube/littlego.git

This alternative command sequence should also work, but here you have to take care that you push all branches.

git clone --bare gitolite-user:littlego.git
cd littlego.git
git remote rm origin
git remote add origin git@github.com:herzbube/littlego.git
git push -u origin master
git push -u origin develop
git push -u origin --tags

Add a patch to a project where you don't have write access

Fork the project
Clone the project locally
Make changes, commit & push back to GitHub
On GitHub navigate to the forked project, then at the top of the screen click the button "Pull Request" (not the link "Pull Requests" which will display requests for your forked repository)
GitHub help on pull requests has all the details
Once the request has been sent, an issue will be created for the target (original) repository. The commits that were included in the pull requests are attached to the issue.

Sync a forked repo

The following section is based on this GitHub help section.

Syncing a forked repo with the upstream repo is done locally, i.e. it is not possible to do this directly in the GitHub web interface.

The first step is to locally clone the forked repo. The local clone will now have remotes labelled (by default) "origin" that point to the forked repo on GitHub. You can check the currently configured remotes like this:

git remote -v

The next step is to add a remote to the local clone that points to the upstream repo. The remotes are labelled "upstream" in the following example.

git remote add upstream https://github.com/original_owner/original_repository.git

Next, retrieve the content of the upstream repository into your local clone. Note that "upstream" is the label we used for the remote in the previous example.

git fetch upstream

Finally, merge changes in upstream branches into your forked branches. For the "master" branch this looks like this:

# Switch to forked "master"
git checkout master
# Merge
git merge upstream/master

Repeat for all branches that you want to sync. Obviously, to make the changes visible on GitHub you must then push them:

git push

GitHub Pages

GitHub Pages provides a convenient and easy way to create user and project websites. Everything is nicely documented here: https://help.github.com/categories/20/articles.

The help articles are none too obvious about what to do if you just want a standalone project page accessible under a custom subdomain. For instance, I wanted to have a page for the "Little Go" project to be accessible under littlego.herzbube.ch. It's actually very simple:

Add a CNAME for the subdomain to DNS and let it resolve to pages.github.com. For instance, I added the CNAME littlego.herzbube.ch.
Create the project's gh-pages branch. For Little Go, I went to the project's settings page and under "GitHub Pages" clicked the button "Automatic Page Generator". This triggers a wizard that you need to step through to create the branch. You can also choose from among several very nice layouts and the wizard will populate the branch with the files necessary to display your page in that layout.
Add a file CNAME to the gh-pages branch. The file's content is the name of the subdomain that the page should be accessible under. In my example this is "littlego.herzbube.ch"

Git

References

GUI clients

Creating and configuring local repositories

git init: Administration of repositories

Working Tree vs. Working Copy

Configuration file

Ignoring files

Complaining about whitespace

Basic operations

git add: Adding files/directories or making changes to existing files/directories

git mv: Renaming or moving files/directories

git rm: Removing files/directories

git status/diff: See local changes

git reset: Undo changes

git clean: Remove untracked files from the working tree

git stash: Temporarily stash all local changes

git commit: Make changes to the repository

git show: Display information about commits and other stuff

git tag: Working with tags

git log: Information on the history

Working with remote repositories

git clone

git pull

git fetch

git push

git remote: Manage remote ("tracked") repositories

Working with local and remote branches

git branch: List/create/delete branches

git checkout: Switch working tree

Switch to another branch

Switch to an earlier commit

git merge: Merge changes from another branch

git rebase

Interactive rebase

Specify which commit should be modified

Specify which commits to modify, and how

Reorder commits

Change a commit message

Squash commits

Edit a commit

Split an old commit into several commits

Rebase local commits onto remote commits pushed by someone else

Conflict handling

Remove deleted remote branches and tags from local repository

Working with partial clones

Shallow clones

Cloning with a filter

Sparse checkout

Other stuff

Generating and applying patches

Generating patches from commits

Generating patches from diffs

Applying patches

Cherry-picking

Submodules

Convert sub-directory into repository of its own

Replay commits into a different repository

Diagnostics & error recovery

Find out branch creator

Working with Subversion

General information

Cloning and tracking an upstream Subversion repository

Integrating upstream changes into a tracking Git repository

Reconnecting a Git repository with upstream Subversion repository after a clone

External Tools

DiffMerge

GitHub

Overview

Local Git configuration

Local SSH configuration

Cloning a GitHub repository

Adding and removing branches to a GitHub repository

Creating a GitHub repository with the content of an external repository

Add a patch to a project where you don't have write access

Sync a forked repo

GitHub Pages

Navigation menu

Search