Source code
File Formats | > | Electronic File Formats | > | Source code |
Source code is the program code of a programming language as stored in a computer's memory or in a file or other storage medium (programs have been stored on cassettes, punched cards, and many other media). Except in interpreted languages (like BASIC) which execute the program directly from the source, source code needs to be compiled or assembled into executables in the target machine code (possibly passing through intermediate stages of object code needing to be linked or code in some intermediary language that is in turn compiled, assembled or interpreted).
Most of the time, program source code is stored as plain text (in a character encoding), so it can be viewed or edited in any text viewer or editor, though programmer-oriented development environments offer enhanced features such as language-specific syntax highlighting and integrated access to compilers. However, there are also some specialized source code formats that do not use plain text, instead doing some sort of tokenization to the keywords and syntactic elements of the language. This was more common on early computers that had much more limited memory, disk space, and bandwidth than the present ones.
Non-text-based source code formats
- Tokenized BASIC (.bas)
- Apple Integer BASIC
- Applesoft BASIC
- Atari BASIC
- Commodore BASIC
- GW-BASIC / BASICA (IBM PC and compatibles)
- TRS-80 BASIC
Text-based source code formats
The language can usually be identified by the file extension.
- asm - Assembler
- s - Assembler
- sh - Bourne shell script
- c - C
- cc - C
- h - C
- cpp - C++
- cxx - C++
- cs - C#
- j - Java
- jav - Java
- java - Java
- js - JavaScript
- m - Matlab
- pas - Pascal
- pcl - PCL -- DEC TOPS-20 Programmable Command Language
- pl - Perl
- pm - Perl
- php - PHP
- py - Python
- pyc - Python
- pyo - Python
- pyd Python