FreeBSD Google Summer of Code Ideas
This page lists FreeBSD project ideas for Google Summer of Code (GSoC). If you plan to apply to work on a FreeBSD GSoC project, you should contact one or more mentors as soon as possible to get feedback. We want to ensure projects have a reasonable chance of being successfully completed.
Contents
-
FreeBSD Google Summer of Code Ideas
- How to use this idea list
- Skill levels
- Kernel Projects
-
Userland projects
- Port virtual_oss to base
- UFS4fuse: support FreeBSD's UFS2 with fusefs
- syzkaller improvements
- Capsicumization of the base system
- Network Configuration Libraries
- IPv6 support and cleanup of address family dependency in userland utilities
- Speed up the FreeBSD boot process
- Make installer more useful
- Port of libc SIMD enhancements to other architectures
- Tools
How to use this idea list
This list of project ideas should be regarded as inspiration, not an exclusive list. You can propose any FreeBSD-related project you want, but whether you apply using your own project idea or one here, we want to see that you thoroughly researched the project in coordination with one or more potential mentors and that you have a clear plan. If you are interested in working with a mentor who is not listed here, you can contact FreeBSD developers using one of these methods.
Send an email to the <freebsd-hackers AT FreeBSD DOT org> mailing list.
Join the #freebsd-soc IRC channel on the Libera.Chat network. We use this channel for communication during both the application period and the coding period. You can join the channel either using an IRC client like irssi, or using Libera.Chat's web client. You can read about how to connect to the Libera.Chat network. Due to timezone differences, there may be delays before questions are answered, so please be patient.
If you do not get a response on IRC or on the mailing list, you can email <soc-admin AT FreeBSD DOT org>.
We also have a few pages that list other project ideas: Ideas Page, Junior Jobs, and Wanted Ports. Note that many of these ideas may not be suitable GSoC projects as-is but may provide inspiration.
If you need even more inspiration, see SummerOfCode for projects from previous years.
Skill levels
Starting in 2024, projects can be small (~90 hours), medium (~175 hours), or large (~350 hours) and last from 8 weeks to 12 weeks or be extended up to a 22-week work term. Choose a project that you feel you can realistically complete within one of those timelines. Again, mentors can help you gauge how much effort and experience a given project may require.
C: For C-related ideas, at least some experience coding in C is necessary. Generally, it is easier to work on the userland than the kernel. Kernel programming and debugging are harder and can involve skills ranging from understanding operating system internals to computer architecture.
Kernel: The FreeBSD kernel is written in C, so at least a basic proficiency in C is required for a kernel project. Various levels of other skills may be necessary, but you can certainly assume that knowing how to do "Hello world" in the kernel and knowing how to recompile the kernel should be an essential starting point.
Network: Understanding TCP/IP will be required for these projects. It would be beneficial if you understood socket API for any programming language and understood how sockets work.
Security: For these tasks, you must have a general understanding of security issues, the UNIX permission model, and the basics of cryptography and its applications.
Scripting: For scripting projects, unless otherwise stated, we won't enforce a specific scripting language on you. However, there are certain factors to consider. If you are willing to work with /bin/sh (+awk, sed, etc.) or Lua, there is a chance that whatever you produce could be committed to the base system. If you pick something like bash, which is not part of FreeBSD's base system, your tool can only be considered for the ports tree, which is a repository with instructions to build third-party software for FreeBSD.
Kernel Projects
Unified kernel tracing interface
[last updated: 2024-03-15]
Mentor |
|
Skills |
C (advanced), Kernel |
Mid-term deliverable |
Implementation available for amd64 |
Duration |
350 hours |
Difficulty |
Hard |
Expected Outcome |
Existing consumers converted to use the new interface |
Description
The FreeBSD kernel contains several subsystems which add hooks to core pieces of the kernel. For example, the context switch code in the scheduler contains this snippet:
if (td != newtd) { #ifdef HWPMC_HOOKS if (PMC_PROC_IS_USING_PMCS(td->td_proc)) PMC_SWITCH_CONTEXT(td, PMC_FN_CSW_OUT); #endif SDT_PROBE2(sched, , , off__cpu, newtd, newtd->td_proc); #ifdef KDTRACE_HOOKS /* * If DTrace has set the active vtime enum to anything * other than INACTIVE (0), then it should have set the * function to call. */ if (dtrace_vtime_active) (*dtrace_vtime_switch_func)(newtd); #endif td->td_oncpu = NOCPU; cpu_switch(td, newtd, mtx); cpuid = td->td_oncpu = PCPU_GET(cpuid); SDT_PROBE0(sched, , , on__cpu); #ifdef HWPMC_HOOKS if (PMC_PROC_IS_USING_PMCS(td->td_proc)) PMC_SWITCH_CONTEXT(td, PMC_FN_CSW_IN); #endif
There are three hooks that may be called before switching into the new thread, and two hooks that may be called after the switch. They are used by DTrace and HWPMC to collect information about context switch events. These hooks are disabled most of the time, but each hook introduces overhead even when disabled since we must check whether it is enabled each time the code is executed.
The goal of the project is to identify useful kernel tracepoints, and design and implement a unified interface that can be used by different consumers to collect information about the event corresponding to a particular tracepoint. This would make it easier for new subsystems to collect information from existing tracepoints, rather than modifying the core kernel to add more hooks. An additional aim would be to ensure that such tracepoints have minimal overhead, probably by using hot-patching to enable and disable a particular tracepoint. FreeBSD-CURRENT now uses clang 10.0, which implements the "asm goto" construct that could be useful here.
Implement MPLS support for FreeBSD
[last updated: 2023-01-27]
Mentor |
Alexander Chernikov <melifaro AT FreeBSD DOT org> |
Skills |
C (intermediate), networking (intermediate) |
Mid-term deliverable |
MPLS forwarding works on static labels |
Duration |
350 hours |
Difficulty |
Medium |
Expected Outcome |
MPLS forwarding, encap/decap works, MPLS labels can be programmed via Netlink |
Multiprotocol Label Switching (MPLS) is an overlay network technology based on labels instead of IP addresses. The project would be to bring basic MPLS support for FreeBSD. Roughly, the implementation can be split into the following chunks:
- create MPLS routing tables, analogous to AF_INET[6] routing tables
- add the logic to handle MPLS packets (mpls_input(), mpls_forward(), mpls_output()) at various stages
- add the code to perform MPLS encap for IPv4/IPv6 routes, extending nexthops functionality
- add Netlink support for working with IP-MPLS, MPLS-MPLS and MPLS-IP routes
- Add userland support for working with MPLS routes to route(8) and netstat(8)
- [stretch] add fast fib lookup module to enable high-performance concurrent label lookups
The OpenBSD's MPLS stack or the NetBSD's MPLS stack overviews of other *BSD implementations can provide more datapoints.
Improve netgraph concurrency
[last updated: 2023-01-27]
Mentor |
Alexander Chernikov <melifaro AT FreeBSD DOT org> |
Skills |
C (intermediate), networking (intermediate) |
Mid-term deliverable |
Traffic is able to pass in a lockless fashion in 2-node netgraph topology |
Duration |
350 hours |
Difficulty |
Medium |
Expected Outcome |
Traffic is able to pass in a lockless fashion between ng_<ppp|car|tee|iface>, compatibility retained with non-converted nodes |
Netgraph is a traffic-processing subsystem based on the dynamically configured graph of nodes and directed edges. Each node apply a single specific manipulation to the packet. The core idea is similar to VPP. Currently netgraph uses topology lock and node/hook atomic refcounts to ensure safe passing of the packets between the nodes. The goal of the project is to make passing data between the nodes lockless. The necessary primitives like epoch-based object reclamation and lockless datastructures are available in the base system. The rough implementation sketch may look like the following:
- Enable delayed reclamation of netgraph hooks and nodes under existing NET_EPOCH
Make core API like ng_address_hook() leverage existing NET_EPOCH and avoid taking topology locks / refcounts
- Test the implementation with a number of stateless nodes like ng_patch or ng_tee and ng_ipfw
- Evaluate and convert nodes on per-node basis on their reliance on the topology lock
Add IPv6 scoped-address support in in-kernel ONC RPC and NFS
[last updated: 2024-01-31]
Mentor |
Hiroki Sato <hrs AT FreeBSD DOT org> |
Skills |
C (intermediate), Kernel (intermediate) |
Duration |
175 hours |
Difficulty |
Medium |
Expected Outcome |
NFS works with IPv6-scoped addresses |
FreeBSD’s NFS and other implementations relying on ONC RPC do not work with non-global IPv6-scoped addresses (e.g., link-local) because the RPC universal address format specified in RFC 5665 is used and it does not include the scope ID. This project aims to eliminate this limitation. The required steps are as follows:
- Implement a function handling text representation of an IPv6 address with the scope ID and replace inet_ntop() with it.
- Add support for IPv6-scoped address format to conversions between a taddr and a uaddr in the kernel.
Implement native OpenBFS for FreeBSD
[last updated: 2024-07-03]
Mentor |
Pedro Giffuni <pfg AT FreeBSD DOT org> |
Skills |
C (intermediate), filesystems (intermediate) |
Mid-term deliverable |
Basic read-only support |
Duration |
350 hours |
Difficulty |
Hard |
Expected Outcome |
BeOS file system support |
The Be File System was developed by Dominic Giampaolo and Cyril Meurillon to provide BeOS with a modern 64-bit-capable journalling file system. The project would be to bring BFS support for FreeBSD, perhaps considering some reuse of the open implementation made by Haiku-OS.
Userland projects
Port virtual_oss to base
[last updated: 2024-01-24]
Mentor |
Christos Margiolis <christos AT FreeBSD DOT org> |
Skills |
C (advanced), FreeBSD programming environment (intermediate), DSP (moderate), kernel (basic) |
Duration |
175 hours |
Difficulty |
Medium |
Expected Outcome |
virtual_oss part of the base system |
virtual_oss is an audio server written by Hans Petter Selasky for FreeBSD's sound system, OSS (Open Sound System), which provides support for (de-)multiplexing audio devices, switching audio devices on the fly, as well as bluetooth sound, among other features. FreeBSD currently lacks a built-in audio server, since virtual_oss is only provided as a port.
The goal of the project is to successfully port virtual_oss to the base system and integrate it in programs and scripts that might benefit from using it.
Even though this project is largely about integration, virtual_oss uses third-party libraries, namely libsamplerate and fftw3, which will most likely need to be replaced by rolling our own code in order to make virtual_oss completely standalone.
UFS4fuse: support FreeBSD's UFS2 with fusefs
[last updated: 2024-01-29]
Mentor |
Alan Somers <asomers AT FreeBSD DOT org> |
Skills |
C++ or Rust (expert), fusefs (expert) |
Mid-term deliverable |
TBD |
Duration |
350 hours |
Difficulty |
Hard |
Expected Outcome |
Read-only fusefs-based UFS2 |
FreeBSD's UFS is arguably the longest standing UNIX file system still under active development. It is well documented and has many interesting features. While UFS has been ported to Linux and Mac OSX, the ports are not very good or have been deprecated. Using fusefs as a userland option would let the filesystem be used on many other systems. Existing FreeBSD headers and code can be reused but a fresh design using modern C++ or Rust constructs would be interesting.
This project is challenging: any proposal must include a detailed coding plan and a very motivated contributor.
syzkaller improvements
[last updated: 2020-03-04]
Mentors |
|
Skills |
golang (intermediate), kernel (intermediate) |
Duration |
350 hours |
Difficulty |
Hard |
Expected Outcome |
Complete FreeBSD syzkaller extenstions described below |
syzkaller is a suite of tools that performs coverage-guided system call fuzzing. Originally targeting Linux, it can now fuzz many different operating system kernels and has been extremely effective at finding bugs, many with security implications. It creates, monitors and fuzzes a set of virtual machines running the target kernel. More information can be found in its documentation, and in these slides. Google kindly runs a public syzkaller instance targeting FreeBSD.
For a while it has been possible to run syzkaller on FreeBSD; that is, fuzz FreeBSD on FreeBSD. syzkaller makes use of ZFS and bhyve (or QEMU) to do so. This makes development and testing of FreeBSD-specific syzkaller features much easier.
Though syzkaller has found quite a few bugs in FreeBSD, it does not cover as much as it does on Linux, so it is virtually guaranteed that there are plenty of bugs waiting to be found. This project consists of several subtasks that would improve FreeBSD's coverage:
- Extend syzkaller's FreeBSD system call descriptions. For each system call, syzkaller requires a set of annotations that describe the system call's arguments. It is missing many of FreeBSD's system calls. syzkaller similarly needs to be taught about device-specific ioctls.
- Support fuzzing of FreeBSD's Linux system call compatibility layer. Since syzkaller can of course fuzz Linux natively, it should be straightforward to run a Linux fuzzer on FreeBSD.
- Support external injection of USB traffic.
- Support running the fuzzer in a jail, optionally with various resource limits in place.
- Test syzkaller with a ZFS root filesystem instead of UFS. Work with the syzkaller developers to get a FreeBSD+ZFS target running in syzbot.
Port support for automated patch testing and crash bisection to FreeBSD. Some details are listed here.
- Work with the project mentor to triage and possibly fix any kernel bugs found in the process.
Note: contributing to syzkaller requires signing the Google CLA. Please make sure that this is acceptable before attempting this project. Note also that working with syzkaller is probably easiest on a dedicated hardware system with a reasonably large amount of disk space (several hundred GB should be sufficient), ideally running FreeBSD on ZFS. syzkaller can instantiate VMs on Google Compute Engine and fuzz them, so this may be an option as well. Please contact the project mentors for details.
Capsicumization of the base system
[last updated: 2020-03-02]
Mentor |
Mariusz Zaborski <oshogbo AT FreeBSD DOT org> |
Co-Mentor |
Mark Johnston <markj AT FreeBSD DOT org> |
Skills |
C (intermediate), familiarity with the UNIX programming environment |
Mid-term deliverable |
Sandbox a few of the target applications |
Duration |
350 hours |
Difficulty |
Medium |
Expected Outcome |
Sandbox the complete list of target applications |
Capsicum is a sandboxing technology used in FreeBSD. It is complemented by Casper, a framework for defining application-specific capabilities. A large number of utilities in the FreeBSD base system have been converted to run under Capsicum and Casper, but many programs have yet to be converted. The project would consist of identifying several high-profile daemons and utilities in the base system or ports, and modifying them to run in capability mode. One good candidate is syslogd, the system logging daemon.
As part of this work it may be necessary or useful to add additional Casper services to the base system. For example, we do not yet have a Casper service which allows an application to make outbound network connections.
Note: Converting applications to run under Capsicum/Casper can take a lot of effort, especially when they are large or when they are not designed with privilege separation in mind. Some applications, like a shell, can't really be run in capability mode at all. Before attempting to sandbox a given application, take care to study the ways in which it interacts with the system. For example, does the application need to open any files? If so, are the file names statically defined or are they derived from user input? This will provide insight into the difficulties that will arise when sandboxing the application.
Network Configuration Libraries
[last updated: 2024-01-29]
Mentor |
Allan Jude <allanjude AT FreeBSD DOT org> |
Co-Mentor |
Christos Margiolis <christos AT FreeBSD DOT org> |
Skills |
C (intermediate), knowledge of networking and NAT |
Mid-term deliverable |
Library to configure IPFW NAT to allow a bhyve VM guest to reach the Internet |
Duration |
350 hours |
Difficulty |
Medium |
Expected Outcome |
A library to manage NAT configuration for VMs and Jails |
Deliverables: A simple tool to configure the network on a laptop to allow a bhyve VM to access the internet.
Use Cases:
- A bhyve VM running on a laptop NAT'd out the laptop's wifi connection
- A bhyve VM bridged to the hosts Ethernet network
Stretch goal: Extend the tool to support configuring network access for standard and VIMAGE jails
Build a libipfw to enable programmatic configuration of the firewall, implement basic functionality to add rules and configure NAT instances.
Optional: relocate most functionality available in the command line interface into the library and replace the replace the command line interface with a thin wrapper around the new library.
Build a libbsdnat that can be used by bhyve VM managers and Jail management tools to configure NAT on the host to allow the guest access to the internet via the host's network. This library will then be extended to handle creating 'port forwarding' rules to expose services in the guest to the public network via the host. Network mappings will be ephemeral and will need to be recreated by the management tools when the VM or jail is restarted.
Possible design for final tool:
- libifconfig - used to create and manage bridge interface for bhyve, epairs for jails
- libbsdnat - Configure NAT for outbound, and port forward rules for inbound.
- libipfw - Used to insert rules routing traffic via above NAT instances.
netlink - Used for network stack configuration
IPv6 support and cleanup of address family dependency in userland utilities
[last updated: 2024-02-11]
Mentor |
Hiroki Sato <hrs AT FreeBSD DOT org> |
Skills |
C (intermediate), networking (intermediate) |
Duration |
175 hours |
Expected Outcome |
More userland utilities with better IPv6 support |
Many of our tools are not IPv6 "clean", such as these tools:
- rwhod(8)
- Various yp(8) daemons
- rpc.rquotad(8)
This project could also include a broader survey of other network services in /usr/bin and /usr/sbin to make sure they're all IPv6 clean. Other possible work could involve
- remove/rework the about 200 gethostby*() calls all in the base system and contrib to use getaddrinfo().
- make more code to conditionally compile in INET or INET6 support.
Speed up the FreeBSD boot process
[last updated: 2024-01-12]
Mentor |
Colin Percival <cperciva AT FreeBSD DOT org> |
Skills |
C (intermediate), sh (intermediate), kernel (beginner) |
Mid-term deliverable |
Targets for boot speedup identified and some addressed |
Duration |
175 hours |
Difficulty |
Medium |
Expected Outcome |
FreeBSD will boot faster |
Using the TSLOG framework and one or more systems running FreeBSD, profile the boot process and identify targets for improving boot performance. This is likely to involve delving into multiple different parts of FreeBSD, ranging from the kernel to daemons launched from rc.d scripts; there are likely a large number of places where improvements can be made, and work will be guided by the amount of time which could potentially be saved and the likely complexity of making improvements. In many cases, a student will need to collaborate with other FreeBSD developers familiar with different parts of the system.
Make installer more useful
[last updated: 2024-01-30]
Mentor |
Pierre Pronchery <pierre AT FreeBSDFoundation DOT org>, Warner Losh <imp AT FreeBSD DOT org> |
Skills |
C (intermediate), sh (intermediate) |
Mid-term deliverable |
Part of the improvements |
Duration |
175 hours |
Difficulty |
Medium |
Expected Outcome |
FreeBSD Installer Disk will allow installing packages |
Many people use the FreeBSD installation media to repair malfunctioning systems, test out compatibility and other unconventional uses. The current installer has fallen behind what Linux systems can do in these areas, in no small part because people can add incrementally to the equivalent Linux system. In addition, pkgbase is coming, where the base system will be packaged into bits installable by pkg, FreeBSD's package manager. To those ends, allowing packages to be installed (either locally or via the network) would increase the flexibility of FreeBSD's installation media.
Port of libc SIMD enhancements to other architectures
[last updated: 2024-01-31]
Mentor |
Robert Clausecker <fuz AT FreeBSD DOT org> |
Skills |
C (intermediate), assembly (advanced) |
Mid-term deliverable |
Part of improvements |
Duration |
350 hours |
Difficulty |
Medium |
Expected Outcome |
Enhancement of all/most libc string functions on a new architecture. |
In 2023, the FreeBSD Foundation sponsored work to reimplement the libc string functions in assembly for better performance. As a result of this work, almost all string functions were reimplemented for amd64 with SIMD techniques, leading to a 5x performance improvement on average. However, this effort currently only benefits users of the amd64 architecture.
This project would entail taking these improvements and expanding or porting them to one or more other architectures, such as arm64, riscv64, powerpc64, or powerpc64le. Previous experience with assembly programming is a strict requirement for this project, but the mentor would help you get familiar with the use of SIMD techniques for string processing.
Tools
Improve LLDB on FreeBSD
[last updated: 2024-03-12]
Mentor |
|
Co-Mentor |
Ed Maste <emaste AT FreeBSD DOT org> |
Skills |
shell scripting (intermediate), Lua (intermediate), debugger use (basic) |
Mid-term deliverable |
crashinfo invokes LLDB and extracts backtrace, subset of Lua bindings enabled |
Duration |
350 hours |
Difficulty |
Medium |
Expected Outcome |
LLDB Lua bindings at feature parity with Python bindings, crashinfo script successfully uses in-tree debugger, lldb |
The LLVM debugger, lldb, is part of the FreeBSD base system. This project aims to extend lldb on FreeBSD in two ways.
1. Improve lldb Lua bindings
lldb supports scripting in C++, Python, and Lua. The Lua bindings are a more recent addition, and some features that would make them much more usable on FreeBSD are missing. The first goal of this project is to improve Lua binding support, add documentation, and bring the Lua bindings to parity with the Python bindings.
Example tasks:
- Enable Lua bindings to create new LLDB commands
- Generate Lua documentation similar to the Python documentation
- Write a tutorial for scripting LLDB with Lua
Add convenience methods to support common FreeBSD kernel debugging tasks from Lua, such as those described in https://freebsdfoundation.org/wp-content/uploads/2019/01/Debugging-the-FreeBSD-Kernel.pdf
2. Integration lldb into crashinfo
FreeBSD provides a script, /usr/sbin/crashinfo, which runs after a system (kernel) crash and extracts information from the core dump to store in a text file. See the crashinfo man page for more information. crashinfo currently makes use of the GNU debugger, gdb, which must be installed from the package collection or ports tree. The second goal of this project will be to add lldb support to crashinfo, providing the same information that crashinfo's gdb integration provides. (gdb support should be retained in the script and used when gdb is available).
Linuxulator on powerpc64
Mentor
Justin Hibbits <jhibbits AT FreeBSD DOT org>, Brandon Bergren <bdragon AT FreeBSD DOT org>
Skills
C (intermediate), kernel (intermediate)
Duration
175 hours
Difficulty
Medium
Expected Outcome
Simple Linux binary support on FreeBSD/powerpc64
FreeBSD provides optional binary compatibility with Linux, allowing users to install and run unmodified Linux binaries. The Linux binaries can be started in the same way native FreeBSD binaries can. They behave almost exactly like native processes and can be traced and debugged the usual way. The "linuxulator" is available for the i386, amd64, and arm64 architectures. This aim of this project is to add preliminary linuxulator support for FreeBSD/powerpc64.