DEV Community

Cover image for GET NEXT LINE A 42 Project TO Learn How To Deal with File Descriptors and I/O of System
Errfig Aymen
Errfig Aymen

Posted on

GET NEXT LINE A 42 Project TO Learn How To Deal with File Descriptors and I/O of System

In the realm of C programming, managing input, output, and memory effectively is fundamental. To help you grasp these critical concepts, get_next_line is a project where you'll write a function that reads a file line by line using a file descriptor. Each invocation of the function reads the next line from the file, allowing you to process the entire file content one line at a time.

Understanding File Descriptors and I/O in a System

What is a File Descriptor?

A file descriptor is a non-negative integer that uniquely identifies an open file in a system. When a program opens a file, the operating system returns a file descriptor that can be used to refer to that file in subsequent operations, such as reading, writing, or closing the file. File descriptors are an abstraction used by the operating system to manage various I/O resources, including files, sockets, and pipes.

0, 1, and 2 (standard input, standard output, and standard error) in Process A are independent and separate from the file descriptors in Process B. This isolation ensures that file operations in one process do not interfere with those in another.

file descriptor table

file descriptor table

Each file descriptor is associated with a file descriptor table entry that contains essential information about the file. This includes the file path, access permissions, and the current offset, which tracks the position within the file for read/write operations. This structure allows the operating system to manage multiple open files efficiently and ensure correct access and data manipulation.

Note that file descriptors 0, 1, and 2 are reserved by the operating system for standard streams. File descriptor 0 is used for standard input (stdin), which typically represents input from the keyboard. File descriptor 1 is used for standard output (stdout), which represents output to the screen or terminal. File descriptor 2 is used for standard error (stderr), which also represents output to the screen or terminal but is specifically intended for error messages. These reserved file descriptors ensure that basic input and output operations can be consistently managed across different programs and environments. Any file descriptor returned by the open function will be 3 or higher, ensuring it does not conflict with these standard streams.

how to open file

example
'#include <fcntl.h>'
'#include <unistd.h>'

int fd = open("example.txt", O_RDONLY);
if (fd == -1) {
    perror("Error opening file");
    return 1;
}
Enter fullscreen mode Exit fullscreen mode
code breakdown

A file descriptor, represented as an integer, is obtained using the open function, which takes two parameters: the file name (or path) and flags that determine the file's access permissions. For example, to read a file's content, we use the O_RDONLY flag (read-only). To read and write, we use the O_RDWR flag. While there are many flags available, we will use only O_RDONLY for this project. The open function returns a non-negative integer, which is the file descriptor if the operation is successful; otherwise, it returns -1 to indicate an error (you don't have permission to access example.txt). Note that the open function is in the unistd.h library, and the permission flags are defined in fcntl.h.

reading from a file descriptor

example
'#include <fcntl.h>'
'#include <unistd.h>'
'#include <stdio.h>'
'#define BUFFER_SIZE 4'

int fd = open("example.txt", O_RDONLY);
if (fd == -1) {
    perror("Error opening file");
    return 1;
}
char buffer[BUFFER_SIZE];
read(fd, buffer, sizeof(buffer)-1);
printf("1st call : %s\n", buffer);
// prints the first 3 bytes
read(fd, buffer, sizeof(buffer)-1);
printf("2nd call : %s\n", buffer);
read(fd, buffer, sizeof(buffer)-1);
printf("3rd call : %s\n", buffer);
read(fd, buffer, sizeof(buffer)-1);
printf("4th call : %s\n", buffer);
read(fd, buffer, sizeof(buffer)-1);
printf("5th call : %s\n", buffer);
Enter fullscreen mode Exit fullscreen mode
breakdown

code result

1st call : HEL
2nd call : LO
3rd call : WOR
4th call : LD
5th call : (null)

The read function, provided by the unistd.h library, is used to read data from a file descriptor. It takes three parameters: the file descriptor, a buffer to store the read data, and the number of bytes to read from the file, read function returns the number of bytes read from the file.

In the file descriptor table, there's an attribute called offset. The offset keeps track of the current position within the file. Every time the read function is called, it reads data starting from the current offset and then advances the offset by the number of bytes read. This ensures that subsequent reads continue from where the last read left off.

how offset works

In our example:

  • The first call to read reads the first 3 bytes from the file and stores them in the buffer, starting at the beginning of the file (offset 0). The offset is then updated to 3.
  • The second call to read reads the next 3 bytes starting from the updated offset (3), then updates the offset to 6.
    etc ...

  • 5th call to read buffer will be null and read returns 0 indicating end of file.

This process continues until all the data has been read from the file or an error occurs. The buffer is null-terminated after each read to ensure it can be printed as a string.

THE PROBLEM

char *get_next_line(int fd) takes as parameter a file descriptor of a file and returns one line for each call. If it reaches the end of the file, it returns NULL.

Parameters

  • fd: File descriptor of the file to read from.
  • BUFFER_SIZE: The size of the buffer used to read chunks from the file. your program should have no leaks.

Solution :

https://github.com/Its-JoeTheKing/get_next_line

Top comments (0)