Документ взят из кэша поисковой машины. Адрес оригинального документа : http://www.arcetri.astro.it/science/Radio/alma/Report_4_2002.pdf
Дата изменения: Fri Apr 11 15:09:30 2003
Дата индексирования: Sat Dec 22 07:48:57 2007
Кодировка:

Поисковые слова: п п п п п п п п
Atacama Large Millimeter Array Demultiplexer design based on a custom GaAs chip
G. Collodi1,2 , G. Comoretto2 Arcetri Technical Report n. 4/2002

1)Mecsa Dipartimento di Elettronica e Telecomunicazioni ­ v. Lombroso 7 - Firenze 2) INAF - Osservatorio Astrofisico di Arcetri ­ largo E. Fermi 5 - Firenze

Arcetri Technical Report n. 4/2002 Firenze, June 2002


Summa ry
In the ALMA sampler the input data, sampled at 4 Gs/s, is transmitted to the fiber optic formatter in a parallel demultip lexed data stream composed of 16 x 250 MHz samples. Several different approaches have been proposed for the demultiplexer, as no commercial units are available at this frequency. We propose a scheme based on a custom digital GaAs chip. The chip architecture, the global demultiplexer architecture and results from simulations are presented.

Introduction
In the ALMA antenna backend electronics, signals from the receivers are converted to eight IF channels, each one covering 2 GHz of bandwidth in the 2-4 GHz range. Each IF signal is subsequently sampled at 4 GS/s, 3 bit, and sent to the correlator on a fiber link. The 8 IF channels are organized as 4 IF pairs, usually representing the two polarizations of the same RF band. Each pair is processed by a sample r. The sampler subsystem accepts two signals in the band 2-4 GHz, and delivers a signal on three optic fibers, at 10 GS/s. The sampled signal is time -demultiplexer at 250 MS/s before being formatted for fiber transmission. At the moment, there is not a clear proposal for the demultiplexer. Several schemes have been proposed, involving commercial deserializers for SONET standard[12], delay lines, or commercial discrete chips[9]. In either case, the demultiplexer must be synchronous with the sampler clock[13]. In particular, commercial chips for deserializing optical data present the problem that the demultiplexer clock is generated internally or derived from the data, and cannot be easily syncronized to an external reference. In this report we propose a scheme based on a custom GaAs demultiplexer chip, designed to overcome this problem. This approach presents several advantages in term of power consumption and system integration, and eases the problems of synchronizing the demultiplexer clock.

Global design
The sampler subsystem is composed of two sampler units and a fiber formatter. Each sampler unit accepts a signal at approximately 0 dBm, and must provide to the formatter 48 bits of data at 250 MHz, synchronous with the 125 MHz global clock. The sampler accepts a sampling clock, nominally sinusoidal at 0 dBm, with a frequency of 4 GHz and a variable phase, and a demultiplexer clock, obtained dividing the 4 GHz clock. The phase of the demux clock may vary in a limited range (nominally +/ -130 ps), following the 4 GHz clock, and its frequency is 250 MHz (1/16 demultiplexing). Both these clocks have to be generated in a single unit for each antenna, and are common to all samplers. Each sampler unit is composed by various subunits, that process the input signal in sequence. The global schematics for a single unit is shown in Fig. 1. The signal is sampled by an optional track/hold unit[5], digitized to three bits[4, 6, 7], and then parallelized to 16 signals at 250 MHz. Auxiliary circuits must distribute clock and power supplies to all the units, control the threshold levels in the digitizer, monitor the sampled signal statistics and provide some crude test signal. A single high speed FPGA (e.g. XCV2 family) can integrate all the control functions, and resynchronize the demux outputs to the system clock. In our proposal the optional T/H, the sampler chip and the demux are hosted in a multichip module, together with supply filters, clock power splitter, and termination resistor chips. The multichip substrate must have good electric properties up to 12 GHz, and provide a good thermal path for the high speed chips. Each chip can be provided either as a packaged component or as a bonded die. The first is preferable for ease of integration (especially for mass production), the latter for thermal and electric considerations. The module can be mounted on a conventional PCB, that hosts the supply regulators, threshold control circuits and the FPGA for control and monitor. Since the subunits composing the sampler are closely related, their interrelations are important and must be carefully considered in the system design. A proposal for the complete sampler system design will be the subject of a forthcoming report.


Fig. 1: Block diagram of a sampler unit Deserializer chip
Each bit from the sampler is independently deserialized by a single chip unit. The relative synchronization of the three bits is guaranteed by the common clock signals, and by the reduced dimensions of the multichip module. The block schematic of the chip is shown in fig. 2. It is composed by two sets of 16 identical D master-slave flip -flops, organized as a 4 GHz shift register ad a 250 MHz parallel latch. The circuit operates internally with custom logic levels, the input and output signals being converted by appropriate buffers. This gives full freedom on the logic levels to use. In particular, the sampler output levels are still to be decided, either as LVDS or LVPECL.

Fig. 2: Functional diagram of the deserializer chip


The 4 GHz clock is given as a 0 dBm sinusoidal tone, to avoid problems due to harmonic distortion. A passive splitter provides this signal for all subunits in the multichip module. Relative phasing must be determined to ensure proper setup and hold times between units. The 250 MHz clock can be provided in parallel as a LVDS signal, terminated externally after the last demultiplexer chip. The chip requires 1.2V for the logic core (shift register, latch), and 2.5V for the input/output buffers.

Technology
The process used to implement the demux is the ED02AH process from OMMIC. The main characteristic of the process are summarized in the following table

ED02AH Main Characteristics Interconnect. Met. layer(s): Poly layer(s): Spec. process char.: 2 No Pseudomorphic HEMT on GaAs 0.2 mm MMIC technology Frequencies 30 to 100 GHz Depletion and enhancement transistors Ft=63 GHz and NF=0.9 dB at 12 GHz Diodes, resistors, capacitors, selfs, air bridges, via holes Transistor grid parallel with x axis. Fixed die sizes: Die sizes are fixed and have the following: 1500 mic x 1000 mic 1500 mic x 2000 mic 3000 mic x 1000 mic 3000 mic x 2000 mic. Available I/O: Temp. range: Yes Max recommended: 150° C.

The process is accessible through Circuit Multi Project (CMP) which is available also for small scale production (thousand of units) Future development using CMOS technology, not available now, has been taken into account and will follow in a year.

Circuit Topology :
The demultiplexer has been implemented with a Master Slave D Flip Flop with a 4 GH z clock. Each flip flop make use of 8 NOR gates. Different topology has been evaluated in order to implement the NOR function [10]. In particular Source Coupled FET Logic (SCFL), Buffered FET Logic (BFL) and Super Buffered FET Logic (SBFL) has been taken into account. The performance of all these different topologies has been evaluated on the basis of simulations and a comparison of the results has been done. SCFL, BFL and SBFL NOR gates have all shown good performance in terms of speed and immunity from noise, but the first two have shown an elevate power consumption. Moreover the NOR gate using the BFL approach has also a large surface occupation due to the level shifter (it needs at least one diode to make the output of the gate compatible with the input of the following one). SBFL NOR topology has been the only one capable to match both the low power consumption and small surface occupation specifications, as well as high speed performance. Moreover the output of the gate does not need to use a buffer or level shifter but is fully compatible with the input of the gate built in the same technology (VOH =VIH and VOL=VIL). This translates in a simple layout design,


without the need of a diode such as in SCFL or BFL, and in a significant reduction in power consumption. The only drawback of this topology is the use of both enhancement and depletion devices in the same circuit. Schematics of SBFL NOR gate is shown in Fig. 3. Vdd is 1.2 V. and the gate uses positive logic with VOH = 0.7V and VOL = 0V.

Fig.3 ­ SBFL NOR Gate. Simulation Results:
As highlighted in the previous paragraph many different simulations have been carried out in order to evaluate the behavior of the whole system, as well as that of the elemental circuits. In particular, until now, both the performance of a single flip -flop and a of 1:8 complete demultiplexer have been tested. Simulations have been carried out using the SPICE Time Domain Simulator and the related OMMIC ED02AH Library. Simulations have shown that the D Master Slave Flip Flop implemented using SBFL NOR gate has a delay time of 110-120 pS, a setup time of 80 pS and a hold time of ­ 30 pS. These performances allow for a maximum clock frequency of 5GHz. With a 4GHz Clock the Flip Flop has a power consumption of about 7 mW.

Fig.4 ­ 1:8 DEMUX Simulation Results


A complete set of simulations for 1:8 demux with a pseudorandom input sequence has been carried out and the results are shown in Fig. 4. Simulations confirm an optimum behavior of the whole system. At the moment the buffer circuits for input and output matching with the LVDS Standard as well as the clock generator for 4GHz and 250 MHz clock were not included in the simulations. The power consumption of the 1:8 demux is 100 mW. Estimated power consumption for a LVDS buffer is 18 mW, and thus the total power budget for a 1-bit 1/16 demux is (18*16)+200mW = 480 mW.

Layout:
Layout of NOR gate is shown in Fig.5. NOR gate layout has been designed using the Ohmic Contact Sharing Technique (OEST) to minimize the surface. The elemental cell has the following dimension 100 x 70 µm2

Fig. 5 ­ NOR Gate Layout
Using the block diagram illustrated in Fig. 6 for the Flip Flop implementation a surface occupation of 250x350 µm for each flip flop has been estimated.

Fig. 6 - Flip Flop layout block diagram.
This results in a die area of 2000 x 1400µm2 for the shiftregister/latch, and 3000 x 2000 µm2 for the whole 1:16 demux including clock generator , buffer and level shifter.


Control FPGA
The output FPGA performs several tasks: · · · · Synchronizes the samples form the demultiplexer to a fixed-phase clock, derived internally from the 125 MHz system clock. Performs some statistics (mean, variance) of the sampled data, making them available to the standard interface (SPI AMBSI2 bus) Uses these data to adjust the sampler thresholds for symmetry /amplitude Drives the relatively long cable to the formatter unit

A single VIRTEX -II FPGA seems adequate to perform these tasks. The SPI interface uses a standard macro developed 2 by the ALMA team. The statistics analyzer computes x and x for each parallel sample using lookup tables, and integrate them using ripple counters. On every tick of the 48 ms clock, counts are latched and summed together. These values, compared to the theoretical ones, can be used to dynamically adjust the sampler threshold values.

Alternate design
At the moment, we are considering, besides the standard architecture of a direct 1:16 demultiplexer, a 1:8 demultiplexer followed by a final 1:2 stage in the FPGA. This option reduces the costs of the custom chips and the interconnectio n problems, at the expense of tighter speed requirements for the FPGA. A 1/8 demultiplexer requires a 500 MHz clock, instead of 250 MHz. A dual data rate structure can be used, together with a 250 MHz clock, to implement in a simple way the 1:2 demux (Fig . 7). This clock can be generated internally by the FPGA, using its built -in PLL locked to the system 125 MHz clock. Using this architecture, however, timing margins become very critical. Some testing is thus required.

Fig. 7: Two stage demultiplexer: 1:8 GaAs chip followed by a 1:2 demux in the FPGA Costs and time schedule
Using the layout considerations done previously, it seems reasonable to fit the whole 1-bit demultiplexer in a 2x3 mm die. Costs for prototyping 10 dice (unbounded) is 7128 Euro (VAT in cl). For small scale production, costs are roughly reduced by 40%, i.e. 430E/chip. The approximate cost for the 3-bit demux, including packaging, is around 1500 Euro. The cost for the multichip module, the PCB and the control FPGA has not been evaluated yet. If the 1:8 demux architecture is used, a smaller chip (1x1.5 mm) can be used, and the cost (and power consumption) is halved. The increase in FPGA complexity is marginal, and the reduced pin count may cause a further cost reduction. The next foundry run for this process has a submission deadline at September 2002, and expected delivery around April 2003.


References:
[1.] Andersson, L.I.; Rudberg, B.G.R.; Lewin, P.T.; Reed, M.D.; Planer, S.M.; Sundaram, S.L. "Silicon bipolar chipset for SONET/SDH 10 Gb/s fiber-optic communication links" - Solid -State Circuits, IEEE Journal of , Volume: 30 Issue: 3 , March 1995 Page(s): 210 ­ 218 Bagheri, M.; Kong, D.T.; Hacker, J.; Schneider, K.; Chow, J. "10 Gb/s framer/demultiplexer IC for SONET STS -192 applications"- Custom Integrated Circuits Conference, 1995., Proceedings of the IEEE 1995 , 1995 Page(s): 427 ­ 430 Barnes Z. "Microprocessr and CAN Interface" contribution to ALMA Backend PDR meeting, Granada, April 24-25, 2002 Baudry A., Deschans D., Beguerer J.B., Deval Y. , Fouillat P., Montignac G., Gentaz O., Torres M. "Designing and Prototyping of 2-4 GHz Bandpass SiGe Digitizers and Associated Test Equipment for the ALMA Project. I " ALMA Memo no. 410 (2002) Collodi G., Comoretto G. "A MMIC Sampler for ALMA ­ Design Report" Arcetri Technical Report n. 2/2001 Deschans D., Beguerer J.B., Deval Y., Scarabello C., Fouillat P., Montignac G., Baudry A. "A High Speed, High Bandwidth, Low Resolution Flash ADC for Radio Astronomy Applications" contribution to ALMA Backend PDR meeting, Granada, April 24-25, 2002. Deschans D., Beguerer J.B., Deval Y., Fouillat P., Baudry A., Montignac G. "4 -Gsample/s, 2 bit SiGe Digitizers for the ALMA Project. Paper II" ALMA Memo no. 426 (2002) Ishida, K.; Wakimoto, H.; Yoshihara, K.; Konno, M.; Sh imizu, S.; Kitaura, Y.; Tomita, K.; Suzuki, T.; Uchitomi, N. "A 10-GHz 8-b multiplexer/demultiplexer chip set for the SONET STS -192 system"- Solid -State Circuits, IEEE Journal of , Volume: 26 Issue: 12 , Dec. 1991 Page(s): 1936 ­ 1943 Mayvial Jean Yves, "Proposal for an all-commercial devices 4 GHz triple 1:16 demux" IRAM Internal note (29/05/2002) mayvial@iram.fr

[2.]

[3.] [4.]

[5.] [6.]

[7.] [8.]

[9.]

[10.] Stephen I. Long; Steven E. Burtner "Gallium arsenide digital integrated circuit design " ­ Mc Graw Hill International Edition. [11.] Tanabe, A.; Umetani, M.; Fujiwara, I.; Ogura, T.; Kataoka, K.; Okihara, M.; Sakuraba, H.; Endoh, T.; Masuoka, F. "0.18-µm CMOS 10-Gb/s multiplexer/demultiplexer ICs using current mode logic with tolerance to threshold voltage fluctuation"- Solid -State Circuits, IEEE Journal of , Volume: 36 Issue: 6 , June 2001 Page(s): 988 ­ 996 [12.] Torres M., Gentaz. O. "A Simple Technique for disciplinig independent demultiplexers" ALMA Memo no. 383 (2001) [13.] Torres M. "Closed loop automatic adjustment of eye diagram" contribution to ALMA Backend PDR meeting, Granada, April 24-25, 2002 [14.] Yoshida, N.; Fujii, M.; Atsumo, T.; Numata, K.; Asai, S.; Kohno, M.; Oikawa, H.; Tsutsui, H.; Maeda, T. "Low-power-consumption 10-Gbps GaAs 8:1 multiplexer/1:8 demultiplexer"- Gallium Arsenide Integrated Circuit (GaAs IC) Symposium, 1997. Technical Digest 1997., 19th Annual , 1997 Page(s): 113 ­ 116