simplex-chat/readme.md

# Federated chat system with E2EE and low risk of MITM attack

## Problems

The problems of existing chat solutions this system intends to solve:

- Dependency on a single company/server to access and use chat. That creates implications for chat privacy (see E2EE), resilience and data ownership.
- Existing [E2EE](https://en.wikipedia.org/wiki/End-to-end_encryption) implementations in chat platforms are easily compromised via the platform itself with [MITM attack](https://en.wikipedia.org/wiki/Man-in-the-middle_attack) - as the key exchange happens via the same channel, the key can be substituted.
- Dependency on DNS (e.g. with XMPP protocol). While DNS is useful for shared information access, connecting chat users to particular domains undermines users privacy and gives control over chat users communications to domain owners.
- Dependency on phone numbers. Most phone numbers either expose or allow to access private information about the user, including personal identity.
- User names. They must be unique, but they can violate existing trademarks and be reclaimed by trademark owners, lead to name squatting and can leak private information about the users.
- Using any user identifiers to route messages (whether usernames, phone numbers or DNS-based addresses) exposes the list of user connections to the chat system.
- Visiblity of user profile information to chat system and other chat users.


## Solution

- No phone numbers, user names and no DNS are used to identify to users (chat servers may choose to be accessible via DNS, but it is not required)
- Each user is connected to multiple servers to ensure message delivery even if some of the servers are compromised
- Using standard assymetric encryption protocol, so system users can create independent server and client implementations
- Open-source server implementations that can be easily deployed by any user with minimal expertise (e.g. on Heroku via web UI).
- Open-source mobile client implementations (including web client) so that system users can independently assess system security model.
- Only client applications store user profiles, connections to other user profiles and messages; servers do NOT have access to any of this information (and unless compromised, do NOT store even encrypted messages).
- Multiple client applications and devices can be used by each user profile to communicate and to share connections and message history.
- Key exchange and establishing connections between user profiles is done by sharing QR code via any independent communication channel (or directly via screen and camera), system servers are NOT used for key exchange - to reduce risk of key substitution in MITM attack. QR code contains the connection-specific public key and user's current servers.
- Servers do NOT communicate with each other, they only communicate with client applications.
- Unique public key is used for each user profile connection in order to:
  - reduce the risk of attacker posing as user's connection
  - avoid exposing all user connections to the servers
- Unique public key is used to identify each connection participant to each server.
- Public keys used between connections are regularly rotated to prevent decryption of the full message history ([forward secrecy](https://en.wikipedia.org/wiki/Forward_secrecy)) in case when some servers or middle-men preserve message history and the current key is compromised.
- Users can repeat key exchange using QR code and alternative channel at any point to increase communication security.
- No single server in the system has visibility of all connections or messages of any user, as user profiles are identified by multiple rotating public keys, using separate key for each profile connection.
- Servers only store hashes of public keys, so that nobody can encrypt messages from a pending user profile connection while the connection is established, other than the user who has this public key received via QR code (in addition to that, messages are signed once key exchange is done both ways).
- User profile (meta-data of the user including non-unique name / handle and optional additional data, e.g. avatar and status) is stored in the client apps and is shared only with accepted user profile connections.


## System components

- chat servers
- chat client applications


## System functionality

### Chat server

Chat servers can be either available to all users or only to users who have a valid server key.

Chat servers have to provide the following:

- receive messages to connected user profiles
- send messages to client apps of connected recepients
- identify recepients by hashes of public keys, unique for each recepient-sender connection (one-directional)
- verify recepients via signatures using recepients' server public keys for each connection.
- verify senders signatures using senders' server public keys for each connection (provided by recepient).
- servers never exchange messages with other servers


### Chat client application

Chat applications are installed by users on their devices

Client apps should provide the following:

- create and manage user profiles - user can have multiple profiles.
- share access to all or selected user profiles with other devices (optionally including existing connections and message histories).
- for each user profile:
  - generate and show QR code with connection-specific public key and profile servers that can be shown on the screen and/or sent via alternative channel.
  - track whether QR code was shared (assuming lower connection security if it wasn't read directly from the screen).
  - read QR code (via the camera) to establish connection with another user.
  - receive and accept connection requests (it would contain, in encrypted form (using recepient public key that was previously shared), requesting user public key to use to encrypt messages and user's servers).
  - exchange user profiles data once connection is accepted.
  - send messages to connected user profiles (message would contain hash of connection-specific public key of the recepient and encrypted message) via the servers that the recepient is connected to.
  - receive messages from connected user profile (message will be decrypted in the client using recepient's private key associated with the sender) via the servers that the recepient is connected to.
  - optionally define the servers they trust and will connect to, including the servers that require server keys to access.
  - store history of all conversations encrypted using user client app key with passphrase (or some other device specific encryption method).


## System design

Prepared with [mermaid-js](https://mermaid-js.github.io/mermaid-live-editor)


### Connections between user profiles

The system uses connections between user profiles (no information identifying users is stored on the servers) to send and to receive messages. Each connection is one-directional, from sender to recepient, without any correlation between connections and user profiles, and between 2 connections used to send and to receive messages to/from the same participants.

This design aims to reduce the risk of correlating communication of a user profile with other user profiles (to keep the list of connections private), and of correlating sent and received messages between the same participants.

Sockets used to send and receive connections can be used by an attacker for such correlation; to avoid it user profiles can be configured to use separate sockets for each one-directional connection (and in case of limited number of sockets, they can be closed and new sockets can be opened to avoid the possibility of any correlation, other than on network level). The probability of network level correlation could be further reduced with onion routing of messages (but it is not in scope).

Each one-directional connection is identified by two key pairs for each participant. Each connection participant registers on the server \<connection public key hash\> and \<server public key\> for each connection.

All keys are generated by the user. Server keys are used for communication between server and user (for server and user to sign and to encrypt). Connection keys are used for E2EE between users (for sender to encrypt using recepient's connection-specific public key and to sign using sender's connection-specific private key). For all keys, client app sends to all user's  servers \<server public key\> and \<connection public key hash\>.

For each connection two keys allow to subscribe to receive messages from connection (by sending \<connection public key hash\> signed by \<server private key\> - server will be able to match it with connections database and accept such request to deliver messages from connection via the subscribed socket) and to decrypt server-side encryption of messages sent from (and separately encrypted by) connection using the \<server private key\>.

Each connection also allow another user to send messages to connection (by signing messages, including  \<connection public key hash\>, with \<server private key\> - server will be able to match it with connections database and accept such messages) and to decrypt any service messages and responses using the \<server private key\>.

In this way the server will only have the list of connections via which messages can be sent or received (identified by \<connection public key hash\>), associated public keys to authenticate senders and recievers (\<server public key\>), with a separate server public key for each connection participant. The attacker will not be able to establish that any list of connections belongs to the same user (other than by analysing socket or network traffic).


### Establishing connection between user profiles

![Adding connection](/diagrams/connection.svg)


### Sending message

![Sending message](/diagrams/message.svg)
docs: readme 2019-12-21 13:06:18 +00:00			`# Federated chat system with E2EE and low risk of MITM attack`

			`## Problems`

			`The problems of existing chat solutions this system intends to solve:`

			`- Dependency on a single company/server to access and use chat. That creates implications for chat privacy (see E2EE), resilience and data ownership.`
			`- Existing [E2EE](https://en.wikipedia.org/wiki/End-to-end_encryption) implementations in chat platforms are easily compromised via the platform itself with [MITM attack](https://en.wikipedia.org/wiki/Man-in-the-middle_attack) - as the key exchange happens via the same channel, the key can be substituted.`
docs: updated readme 2019-12-22 10:27:57 +00:00			`- Dependency on DNS (e.g. with XMPP protocol). While DNS is useful for shared information access, connecting chat users to particular domains undermines users privacy and gives control over chat users communications to domain owners.`
docs: readme 2019-12-21 13:06:18 +00:00			`- Dependency on phone numbers. Most phone numbers either expose or allow to access private information about the user, including personal identity.`
			`- User names. They must be unique, but they can violate existing trademarks and be reclaimed by trademark owners, lead to name squatting and can leak private information about the users.`
docs: updated readme 2019-12-22 10:27:57 +00:00			`- Using any user identifiers to route messages (whether usernames, phone numbers or DNS-based addresses) exposes the list of user connections to the chat system.`
			`- Visiblity of user profile information to chat system and other chat users.`
docs: readme 2019-12-21 13:06:18 +00:00

			`## Solution`

docs: updated readme 2019-12-22 10:27:57 +00:00			`- No phone numbers, user names and no DNS are used to identify to users (chat servers may choose to be accessible via DNS, but it is not required)`
			`- Each user is connected to multiple servers to ensure message delivery even if some of the servers are compromised`
			`- Using standard assymetric encryption protocol, so system users can create independent server and client implementations`
			`- Open-source server implementations that can be easily deployed by any user with minimal expertise (e.g. on Heroku via web UI).`
			`- Open-source mobile client implementations (including web client) so that system users can independently assess system security model.`
			`- Only client applications store user profiles, connections to other user profiles and messages; servers do NOT have access to any of this information (and unless compromised, do NOT store even encrypted messages).`
			`- Multiple client applications and devices can be used by each user profile to communicate and to share connections and message history.`
			`- Key exchange and establishing connections between user profiles is done by sharing QR code via any independent communication channel (or directly via screen and camera), system servers are NOT used for key exchange - to reduce risk of key substitution in MITM attack. QR code contains the connection-specific public key and user's current servers.`
			`- Servers do NOT communicate with each other, they only communicate with client applications.`
docs: readme 2019-12-21 13:06:18 +00:00			`- Unique public key is used for each user profile connection in order to:`
			`- reduce the risk of attacker posing as user's connection`
			`- avoid exposing all user connections to the servers`
docs: updated readme 2019-12-22 10:27:57 +00:00			`- Unique public key is used to identify each connection participant to each server.`
			`- Public keys used between connections are regularly rotated to prevent decryption of the full message history ([forward secrecy](https://en.wikipedia.org/wiki/Forward_secrecy)) in case when some servers or middle-men preserve message history and the current key is compromised.`
docs: readme 2019-12-21 13:06:18 +00:00			`- Users can repeat key exchange using QR code and alternative channel at any point to increase communication security.`
			`- No single server in the system has visibility of all connections or messages of any user, as user profiles are identified by multiple rotating public keys, using separate key for each profile connection.`
docs: updated readme 2019-12-22 10:27:57 +00:00			`- Servers only store hashes of public keys, so that nobody can encrypt messages from a pending user profile connection while the connection is established, other than the user who has this public key received via QR code (in addition to that, messages are signed once key exchange is done both ways).`
docs: readme 2019-12-21 13:06:18 +00:00			`- User profile (meta-data of the user including non-unique name / handle and optional additional data, e.g. avatar and status) is stored in the client apps and is shared only with accepted user profile connections.`


			`## System components`

docs: updated readme 2019-12-22 10:27:57 +00:00			`- chat servers`
			`- chat client applications`
docs: readme 2019-12-21 13:06:18 +00:00

			`## System functionality`

			`### Chat server`

			`Chat servers can be either available to all users or only to users who have a valid server key.`

			`Chat servers have to provide the following:`

docs: updated readme 2019-12-22 10:27:57 +00:00			`- receive messages to connected user profiles`
			`- send messages to client apps of connected recepients`
			`- identify recepients by hashes of public keys, unique for each recepient-sender connection (one-directional)`
			`- verify recepients via signatures using recepients' server public keys for each connection.`
			`- verify senders signatures using senders' server public keys for each connection (provided by recepient).`
			`- servers never exchange messages with other servers`
docs: readme 2019-12-21 13:06:18 +00:00

			`### Chat client application`

			`Chat applications are installed by users on their devices`

			`Client apps should provide the following:`

			`- create and manage user profiles - user can have multiple profiles.`
			`- share access to all or selected user profiles with other devices (optionally including existing connections and message histories).`
			`- for each user profile:`
docs: updated readme 2019-12-22 10:27:57 +00:00			`- generate and show QR code with connection-specific public key and profile servers that can be shown on the screen and/or sent via alternative channel.`
			`- track whether QR code was shared (assuming lower connection security if it wasn't read directly from the screen).`
docs: readme 2019-12-21 13:06:18 +00:00			`- read QR code (via the camera) to establish connection with another user.`
docs: updated readme 2019-12-22 10:27:57 +00:00			`- receive and accept connection requests (it would contain, in encrypted form (using recepient public key that was previously shared), requesting user public key to use to encrypt messages and user's servers).`
docs: readme 2019-12-21 13:06:18 +00:00			`- exchange user profiles data once connection is accepted.`
docs: updated readme 2019-12-22 10:27:57 +00:00			`- send messages to connected user profiles (message would contain hash of connection-specific public key of the recepient and encrypted message) via the servers that the recepient is connected to.`
			`- receive messages from connected user profile (message will be decrypted in the client using recepient's private key associated with the sender) via the servers that the recepient is connected to.`
			`- optionally define the servers they trust and will connect to, including the servers that require server keys to access.`
			`- store history of all conversations encrypted using user client app key with passphrase (or some other device specific encryption method).`
docs: sending message diagram 2019-12-21 14:08:57 +00:00

docs: update connection diagram to include user-server authentication and encryption 2019-12-21 22:20:02 +00:00			`## System design`
docs: sending message diagram 2019-12-21 14:08:57 +00:00
docs: adding connection diagram 2019-12-21 15:04:51 +00:00			`Prepared with [mermaid-js](https://mermaid-js.github.io/mermaid-live-editor)`

docs: update connection diagram to include user-server authentication and encryption 2019-12-21 22:20:02 +00:00
			`### Connections between user profiles`

docs: updated readme 2019-12-22 10:27:57 +00:00			`The system uses connections between user profiles (no information identifying users is stored on the servers) to send and to receive messages. Each connection is one-directional, from sender to recepient, without any correlation between connections and user profiles, and between 2 connections used to send and to receive messages to/from the same participants.`
docs: update connection diagram to include user-server authentication and encryption 2019-12-21 22:20:02 +00:00
			`This design aims to reduce the risk of correlating communication of a user profile with other user profiles (to keep the list of connections private), and of correlating sent and received messages between the same participants.`

docs: updated readme 2019-12-22 10:27:57 +00:00			`Sockets used to send and receive connections can be used by an attacker for such correlation; to avoid it user profiles can be configured to use separate sockets for each one-directional connection (and in case of limited number of sockets, they can be closed and new sockets can be opened to avoid the possibility of any correlation, other than on network level). The probability of network level correlation could be further reduced with onion routing of messages (but it is not in scope).`
docs: update connection diagram to include user-server authentication and encryption 2019-12-21 22:20:02 +00:00
			`Each one-directional connection is identified by two key pairs for each participant. Each connection participant registers on the server \<connection public key hash\> and \<server public key\> for each connection.`

docs: updated readme 2019-12-22 10:27:57 +00:00			`All keys are generated by the user. Server keys are used for communication between server and user (for server and user to sign and to encrypt). Connection keys are used for E2EE between users (for sender to encrypt using recepient's connection-specific public key and to sign using sender's connection-specific private key). For all keys, client app sends to all user's servers \<server public key\> and \<connection public key hash\>.`
docs: update connection diagram to include user-server authentication and encryption 2019-12-21 22:20:02 +00:00
docs: updated readme 2019-12-22 10:27:57 +00:00			`For each connection two keys allow to subscribe to receive messages from connection (by sending \<connection public key hash\> signed by \<server private key\> - server will be able to match it with connections database and accept such request to deliver messages from connection via the subscribed socket) and to decrypt server-side encryption of messages sent from (and separately encrypted by) connection using the \<server private key\>.`
docs: update connection diagram to include user-server authentication and encryption 2019-12-21 22:20:02 +00:00
docs: updated readme 2019-12-22 10:27:57 +00:00			`Each connection also allow another user to send messages to connection (by signing messages, including \<connection public key hash\>, with \<server private key\> - server will be able to match it with connections database and accept such messages) and to decrypt any service messages and responses using the \<server private key\>.`
docs: update connection diagram to include user-server authentication and encryption 2019-12-21 22:20:02 +00:00
			`In this way the server will only have the list of connections via which messages can be sent or received (identified by \<connection public key hash\>), associated public keys to authenticate senders and recievers (\<server public key\>), with a separate server public key for each connection participant. The attacker will not be able to establish that any list of connections belongs to the same user (other than by analysing socket or network traffic).`


			`### Establishing connection between user profiles`
docs: adding connection diagram 2019-12-21 15:04:51 +00:00
			`![Adding connection](/diagrams/connection.svg)`

docs: update connection diagram to include user-server authentication and encryption 2019-12-21 22:20:02 +00:00
docs: sending message diagram 2019-12-21 14:08:57 +00:00			`### Sending message`

docs: adding connection diagram 2019-12-21 15:04:51 +00:00			`![Sending message](/diagrams/message.svg)`